Side-by-side benchmark comparison across knowledge, coding, math, and reasoning.
Grok 4.1 wins overall with a score of 84 vs 54 (30 point difference).Grok 4.1 wins 4 out of 4 categories.
Gemini 1.5 Pro
62.5
Grok 4.1
96
Gemini 1.5 Pro
56
Grok 4.1
91
Gemini 1.5 Pro
63
Grok 4.1
97.1
Gemini 1.5 Pro
61
Grok 4.1
94
Grok 4.1 scores higher overall with 84 vs 54, a difference of 30 points across all benchmarks.
Grok 4.1 leads in knowledge tasks with an average score of 96 vs 62.5.
Grok 4.1 leads in coding with an average score of 91 vs 56.
Grok 4.1 leads in math with an average score of 97.1 vs 63.
Grok 4.1 leads in reasoning with an average score of 94 vs 61.