Qwen2.5-72B vs Z-1

Side-by-side benchmark comparison across knowledge, coding, math, and reasoning.

Quick Verdict

Qwen2.5-72B wins overall with a score of 65 vs 43 (22 point difference).Qwen2.5-72B wins 4 out of 4 categories.

Knowledge

Qwen2.5-72B

Qwen2.5-72B

80.8

Z-1

49.8

83
MMLU
52
82
GPQA
51
80
SuperGPQA
49
78
OpenBookQA
47

Coding

Qwen2.5-72B

Qwen2.5-72B

75

Z-1

44

75
HumanEval
44

Mathematics

Qwen2.5-72B

Qwen2.5-72B

83

Z-1

51

84
AIME 2023
52
86
AIME 2024
54
85
AIME 2025
53
80
HMMT Feb 2023
48
82
HMMT Feb 2024
50
81
HMMT Feb 2025
49
83
BRUMO 2025
51

Reasoning

Qwen2.5-72B

Qwen2.5-72B

79

Z-1

49

80
SimpleQA
50
78
MuSR
48

Frequently Asked Questions

Which is better, Qwen2.5-72B or Z-1?

Qwen2.5-72B scores higher overall with 65 vs 43, a difference of 22 points across all benchmarks.

Which is better for knowledge tasks, Qwen2.5-72B or Z-1?

Qwen2.5-72B leads in knowledge tasks with an average score of 80.8 vs 49.8.

Which is better for coding, Qwen2.5-72B or Z-1?

Qwen2.5-72B leads in coding with an average score of 75 vs 44.

Which is better for math, Qwen2.5-72B or Z-1?

Qwen2.5-72B leads in math with an average score of 83 vs 51.

Which is better for reasoning, Qwen2.5-72B or Z-1?

Qwen2.5-72B leads in reasoning with an average score of 79 vs 49.