Side-by-side benchmark comparison across knowledge, coding, math, and reasoning.
Grok 4.1 wins overall with a score of 84 vs 59 (25 point difference).Grok 4.1 wins 4 out of 4 categories.
Claude 4 Sonnet
71.5
Grok 4.1
96
Claude 4 Sonnet
65
Grok 4.1
91
Claude 4 Sonnet
72
Grok 4.1
97.1
Claude 4 Sonnet
70
Grok 4.1
94
Grok 4.1 scores higher overall with 84 vs 59, a difference of 25 points across all benchmarks.
Grok 4.1 leads in knowledge tasks with an average score of 96 vs 71.5.
Grok 4.1 leads in coding with an average score of 91 vs 65.
Grok 4.1 leads in math with an average score of 97.1 vs 72.
Grok 4.1 leads in reasoning with an average score of 94 vs 70.