Side-by-side benchmark comparison across knowledge, coding, math, and reasoning.
Gemini 2.5 Pro wins overall with a score of 65 vs 39 (26 point difference).Gemini 2.5 Pro wins 4 out of 4 categories.
Gemini 2.5 Pro
81.5
Llama 4 Behemoth
45.8
Gemini 2.5 Pro
75
Llama 4 Behemoth
40
Gemini 2.5 Pro
83
Llama 4 Behemoth
47
Gemini 2.5 Pro
80
Llama 4 Behemoth
45
Gemini 2.5 Pro scores higher overall with 65 vs 39, a difference of 26 points across all benchmarks.
Gemini 2.5 Pro leads in knowledge tasks with an average score of 81.5 vs 45.8.
Gemini 2.5 Pro leads in coding with an average score of 75 vs 40.
Gemini 2.5 Pro leads in math with an average score of 83 vs 47.
Gemini 2.5 Pro leads in reasoning with an average score of 80 vs 45.