Side-by-side benchmark comparison across knowledge, coding, math, and reasoning.
Gemini 3 Pro Deep Think wins overall with a score of 81 vs 58 (23 point difference).Gemini 3 Pro Deep Think wins 4 out of 4 categories.
Gemini 3 Pro Deep Think
96
Llama 3.1 405B
68.5
Gemini 3 Pro Deep Think
91
Llama 3.1 405B
62
Gemini 3 Pro Deep Think
97.1
Llama 3.1 405B
69
Gemini 3 Pro Deep Think
94
Llama 3.1 405B
67
Gemini 3 Pro Deep Think scores higher overall with 81 vs 58, a difference of 23 points across all benchmarks.
Gemini 3 Pro Deep Think leads in knowledge tasks with an average score of 96 vs 68.5.
Gemini 3 Pro Deep Think leads in coding with an average score of 91 vs 62.
Gemini 3 Pro Deep Think leads in math with an average score of 97.1 vs 69.
Gemini 3 Pro Deep Think leads in reasoning with an average score of 94 vs 67.