Side-by-side benchmark comparison across knowledge, coding, math, and reasoning.
Gemini 3.1 Pro wins overall with a score of 87 vs 39 (48 point difference).Gemini 3.1 Pro wins 4 out of 4 categories.
Gemini 3.1 Pro
96
Llama 4 Behemoth
45.8
Gemini 3.1 Pro
91
Llama 4 Behemoth
40
Gemini 3.1 Pro
97.1
Llama 4 Behemoth
47
Gemini 3.1 Pro
94
Llama 4 Behemoth
45
Gemini 3.1 Pro scores higher overall with 87 vs 39, a difference of 48 points across all benchmarks.
Gemini 3.1 Pro leads in knowledge tasks with an average score of 96 vs 45.8.
Gemini 3.1 Pro leads in coding with an average score of 91 vs 40.
Gemini 3.1 Pro leads in math with an average score of 97.1 vs 47.
Gemini 3.1 Pro leads in reasoning with an average score of 94 vs 45.