Side-by-side benchmark comparison across knowledge, coding, math, and reasoning.
Gemini 3.1 Flash-Lite wins overall with a score of 53 vs 39 (14 point difference).Gemini 3.1 Flash-Lite wins 4 out of 4 categories.
Gemini 3.1 Flash-Lite
60.8
Llama 4 Behemoth
45.8
Gemini 3.1 Flash-Lite
55
Llama 4 Behemoth
40
Gemini 3.1 Flash-Lite
62
Llama 4 Behemoth
47
Gemini 3.1 Flash-Lite
59
Llama 4 Behemoth
45
Gemini 3.1 Flash-Lite scores higher overall with 53 vs 39, a difference of 14 points across all benchmarks.
Gemini 3.1 Flash-Lite leads in knowledge tasks with an average score of 60.8 vs 45.8.
Gemini 3.1 Flash-Lite leads in coding with an average score of 55 vs 40.
Gemini 3.1 Flash-Lite leads in math with an average score of 62 vs 47.
Gemini 3.1 Flash-Lite leads in reasoning with an average score of 59 vs 45.