Side-by-side benchmark comparison across knowledge, coding, math, and reasoning.
Claude 4.1 Opus wins overall with a score of 61 vs 45 (16 point difference).Claude 4.1 Opus wins 4 out of 4 categories.
Claude 4.1 Opus
74.5
Nemotron-4 15B
51.8
Claude 4.1 Opus
68
Nemotron-4 15B
46
Claude 4.1 Opus
75
Nemotron-4 15B
53
Claude 4.1 Opus
73
Nemotron-4 15B
51
Claude 4.1 Opus scores higher overall with 61 vs 45, a difference of 16 points across all benchmarks.
Claude 4.1 Opus leads in knowledge tasks with an average score of 74.5 vs 51.8.
Claude 4.1 Opus leads in coding with an average score of 68 vs 46.
Claude 4.1 Opus leads in math with an average score of 75 vs 53.
Claude 4.1 Opus leads in reasoning with an average score of 73 vs 51.