Claude 4 Sonnet vs Gemini 2.5 Flash

Side-by-side benchmark comparison across knowledge, coding, math, and reasoning.

Quick Verdict

Claude 4 Sonnet wins overall with a score of 59 vs 41 (18 point difference).Claude 4 Sonnet wins 4 out of 4 categories.

Knowledge

Claude 4 Sonnet

Claude 4 Sonnet

71.5

Gemini 2.5 Flash

47.8

73
MMLU
50
73
GPQA
49
71
SuperGPQA
47
69
OpenBookQA
45

Coding

Claude 4 Sonnet

Claude 4 Sonnet

65

Gemini 2.5 Flash

42

65
HumanEval
42

Mathematics

Claude 4 Sonnet

Claude 4 Sonnet

72

Gemini 2.5 Flash

49

73
AIME 2023
50
75
AIME 2024
52
74
AIME 2025
51
69
HMMT Feb 2023
46
71
HMMT Feb 2024
48
70
HMMT Feb 2025
47
72
BRUMO 2025
49

Reasoning

Claude 4 Sonnet

Claude 4 Sonnet

70

Gemini 2.5 Flash

47

71
SimpleQA
48
69
MuSR
46

Frequently Asked Questions

Which is better, Claude 4 Sonnet or Gemini 2.5 Flash?

Claude 4 Sonnet scores higher overall with 59 vs 41, a difference of 18 points across all benchmarks.

Which is better for knowledge tasks, Claude 4 Sonnet or Gemini 2.5 Flash?

Claude 4 Sonnet leads in knowledge tasks with an average score of 71.5 vs 47.8.

Which is better for coding, Claude 4 Sonnet or Gemini 2.5 Flash?

Claude 4 Sonnet leads in coding with an average score of 65 vs 42.

Which is better for math, Claude 4 Sonnet or Gemini 2.5 Flash?

Claude 4 Sonnet leads in math with an average score of 72 vs 49.

Which is better for reasoning, Claude 4 Sonnet or Gemini 2.5 Flash?

Claude 4 Sonnet leads in reasoning with an average score of 70 vs 47.