Side-by-side benchmark comparison across knowledge, coding, math, and reasoning.
Gemini 3 Pro Deep Think wins overall with a score of 81 vs 76 (5 point difference).Gemini 3 Pro Deep Think wins 3 out of 4 categories.
Gemini 3 Pro Deep Think
96
GPT-5.1
94
Gemini 3 Pro Deep Think
91
GPT-5.1
89
Gemini 3 Pro Deep Think
97.1
GPT-5.1
97.1
Gemini 3 Pro Deep Think
94
GPT-5.1
92
Gemini 3 Pro Deep Think scores higher overall with 81 vs 76, a difference of 5 points across all benchmarks.
Gemini 3 Pro Deep Think leads in knowledge tasks with an average score of 96 vs 94.
Gemini 3 Pro Deep Think leads in coding with an average score of 91 vs 89.
Gemini 3 Pro Deep Think and GPT-5.1 are tied for math with average scores of 97.1.
Gemini 3 Pro Deep Think leads in reasoning with an average score of 94 vs 92.