o3-pro vs Z-1

Side-by-side benchmark comparison across knowledge, coding, math, and reasoning.

Quick Verdict

o3-pro wins overall with a score of 68 vs 43 (25 point difference).o3-pro wins 4 out of 4 categories.

Knowledge

o3-pro

o3-pro

87.3

Z-1

49.8

88
MMLU
52
89
GPQA
51
87
SuperGPQA
49
85
OpenBookQA
47

Coding

o3-pro

o3-pro

80

Z-1

44

80
HumanEval
44

Mathematics

o3-pro

o3-pro

89

Z-1

51

90
AIME 2023
52
92
AIME 2024
54
91
AIME 2025
53
86
HMMT Feb 2023
48
88
HMMT Feb 2024
50
87
HMMT Feb 2025
49
89
BRUMO 2025
51

Reasoning

o3-pro

o3-pro

85

Z-1

49

86
SimpleQA
50
84
MuSR
48

Frequently Asked Questions

Which is better, o3-pro or Z-1?

o3-pro scores higher overall with 68 vs 43, a difference of 25 points across all benchmarks.

Which is better for knowledge tasks, o3-pro or Z-1?

o3-pro leads in knowledge tasks with an average score of 87.3 vs 49.8.

Which is better for coding, o3-pro or Z-1?

o3-pro leads in coding with an average score of 80 vs 44.

Which is better for math, o3-pro or Z-1?

o3-pro leads in math with an average score of 89 vs 51.

Which is better for reasoning, o3-pro or Z-1?

o3-pro leads in reasoning with an average score of 85 vs 49.