Claude 4 Sonnet vs Gemini 2.5 Pro

Side-by-side benchmark comparison across knowledge, coding, math, and reasoning.

Quick Verdict

Gemini 2.5 Pro wins overall with a score of 65 vs 59 (6 point difference).Gemini 2.5 Pro wins 4 out of 4 categories.

Knowledge

Gemini 2.5 Pro

Claude 4 Sonnet

71.5

Gemini 2.5 Pro

81.5

73
MMLU
83
73
GPQA
83
71
SuperGPQA
81
69
OpenBookQA
79

Coding

Gemini 2.5 Pro

Claude 4 Sonnet

65

Gemini 2.5 Pro

75

65
HumanEval
75

Mathematics

Gemini 2.5 Pro

Claude 4 Sonnet

72

Gemini 2.5 Pro

83

73
AIME 2023
84
75
AIME 2024
86
74
AIME 2025
85
69
HMMT Feb 2023
80
71
HMMT Feb 2024
82
70
HMMT Feb 2025
81
72
BRUMO 2025
83

Reasoning

Gemini 2.5 Pro

Claude 4 Sonnet

70

Gemini 2.5 Pro

80

71
SimpleQA
81
69
MuSR
79

Frequently Asked Questions

Which is better, Claude 4 Sonnet or Gemini 2.5 Pro?

Gemini 2.5 Pro scores higher overall with 65 vs 59, a difference of 6 points across all benchmarks.

Which is better for knowledge tasks, Claude 4 Sonnet or Gemini 2.5 Pro?

Gemini 2.5 Pro leads in knowledge tasks with an average score of 81.5 vs 71.5.

Which is better for coding, Claude 4 Sonnet or Gemini 2.5 Pro?

Gemini 2.5 Pro leads in coding with an average score of 75 vs 65.

Which is better for math, Claude 4 Sonnet or Gemini 2.5 Pro?

Gemini 2.5 Pro leads in math with an average score of 83 vs 72.

Which is better for reasoning, Claude 4 Sonnet or Gemini 2.5 Pro?

Gemini 2.5 Pro leads in reasoning with an average score of 80 vs 70.