Claude Haiku 4.5 vs Qwen2.5-1M

Side-by-side benchmark comparison across knowledge, coding, math, and reasoning.

Quick Verdict

Qwen2.5-1M wins overall with a score of 66 vs 57 (9 point difference).Qwen2.5-1M wins 4 out of 4 categories.

Knowledge

Qwen2.5-1M

Claude Haiku 4.5

65.8

Qwen2.5-1M

81.8

68
MMLU
84
67
GPQA
83
65
SuperGPQA
81
63
OpenBookQA
79

Coding

Qwen2.5-1M

Claude Haiku 4.5

60

Qwen2.5-1M

76

60
HumanEval
76

Mathematics

Qwen2.5-1M

Claude Haiku 4.5

67

Qwen2.5-1M

84

68
AIME 2023
85
70
AIME 2024
87
69
AIME 2025
86
64
HMMT Feb 2023
81
66
HMMT Feb 2024
83
65
HMMT Feb 2025
82
67
BRUMO 2025
84

Reasoning

Qwen2.5-1M

Claude Haiku 4.5

64

Qwen2.5-1M

80

65
SimpleQA
81
63
MuSR
79

Frequently Asked Questions

Which is better, Claude Haiku 4.5 or Qwen2.5-1M?

Qwen2.5-1M scores higher overall with 66 vs 57, a difference of 9 points across all benchmarks.

Which is better for knowledge tasks, Claude Haiku 4.5 or Qwen2.5-1M?

Qwen2.5-1M leads in knowledge tasks with an average score of 81.8 vs 65.8.

Which is better for coding, Claude Haiku 4.5 or Qwen2.5-1M?

Qwen2.5-1M leads in coding with an average score of 76 vs 60.

Which is better for math, Claude Haiku 4.5 or Qwen2.5-1M?

Qwen2.5-1M leads in math with an average score of 84 vs 67.

Which is better for reasoning, Claude Haiku 4.5 or Qwen2.5-1M?

Qwen2.5-1M leads in reasoning with an average score of 80 vs 64.