Side-by-side benchmark comparison across knowledge, coding, math, and reasoning.
Gemini 3 Pro Deep Think wins overall with a score of 81 vs 48 (33 point difference).Gemini 3 Pro Deep Think wins 4 out of 4 categories.
Gemini 3 Pro Deep Think
96
Llama 3 70B
56.5
Gemini 3 Pro Deep Think
91
Llama 3 70B
50
Gemini 3 Pro Deep Think
97.1
Llama 3 70B
57
Gemini 3 Pro Deep Think
94
Llama 3 70B
55
Gemini 3 Pro Deep Think scores higher overall with 81 vs 48, a difference of 33 points across all benchmarks.
Gemini 3 Pro Deep Think leads in knowledge tasks with an average score of 96 vs 56.5.
Gemini 3 Pro Deep Think leads in coding with an average score of 91 vs 50.
Gemini 3 Pro Deep Think leads in math with an average score of 97.1 vs 57.
Gemini 3 Pro Deep Think leads in reasoning with an average score of 94 vs 55.