GPT-5.1-Codex-Max vs GPT-5 (high)

Side-by-side benchmark comparison across knowledge, coding, math, and reasoning.

Quick Verdict

GPT-5.1-Codex-Max wins overall with a score of 77 vs 72 (5 point difference).GPT-5.1-Codex-Max wins 4 out of 4 categories.

Knowledge

GPT-5.1-Codex-Max

GPT-5.1-Codex-Max

95

GPT-5 (high)

90

98
MMLU
93
96
GPQA
91
94
SuperGPQA
89
92
OpenBookQA
87

Coding

GPT-5.1-Codex-Max

GPT-5.1-Codex-Max

94

GPT-5 (high)

85

94
HumanEval
85

Mathematics

GPT-5.1-Codex-Max

GPT-5.1-Codex-Max

97.1

GPT-5 (high)

94

99
AIME 2023
95
99
AIME 2024
97
98
AIME 2025
96
95
HMMT Feb 2023
91
97
HMMT Feb 2024
93
96
HMMT Feb 2025
92
96
BRUMO 2025
94

Reasoning

GPT-5.1-Codex-Max

GPT-5.1-Codex-Max

93

GPT-5 (high)

88

94
SimpleQA
89
92
MuSR
87

Frequently Asked Questions

Which is better, GPT-5.1-Codex-Max or GPT-5 (high)?

GPT-5.1-Codex-Max scores higher overall with 77 vs 72, a difference of 5 points across all benchmarks.

Which is better for knowledge tasks, GPT-5.1-Codex-Max or GPT-5 (high)?

GPT-5.1-Codex-Max leads in knowledge tasks with an average score of 95 vs 90.

Which is better for coding, GPT-5.1-Codex-Max or GPT-5 (high)?

GPT-5.1-Codex-Max leads in coding with an average score of 94 vs 85.

Which is better for math, GPT-5.1-Codex-Max or GPT-5 (high)?

GPT-5.1-Codex-Max leads in math with an average score of 97.1 vs 94.

Which is better for reasoning, GPT-5.1-Codex-Max or GPT-5 (high)?

GPT-5.1-Codex-Max leads in reasoning with an average score of 93 vs 88.