Side-by-side benchmark comparison across knowledge, coding, math, and reasoning.
Grok 4 wins overall with a score of 69 vs 68 (1 point difference).Grok 4 wins 0 out of 4 categories.
Grok 4
84.8
o3-pro
87.3
Grok 4
79
o3-pro
80
Grok 4
86.6
o3-pro
89
Grok 4
82
o3-pro
85
Grok 4 scores higher overall with 69 vs 68, a difference of 1 points across all benchmarks.
o3-pro leads in knowledge tasks with an average score of 87.3 vs 84.8.
o3-pro leads in coding with an average score of 80 vs 79.
o3-pro leads in math with an average score of 89 vs 86.6.
o3-pro leads in reasoning with an average score of 85 vs 82.