Benchmark analysis of GPT-5.1-Codex-Max by OpenAI across 14 tests.
Creator
OpenAI
Source Type
ProprietaryReasoning
ReasoningContext Window
400K
Overall Score
GPT-5.1-Codex-Max ranks #12 out of 88 models with an overall score of 77. It is created by OpenAI and features a 400K context window.
GPT-5.1-Codex-Max ranks #12 out of 88 models in knowledge and understanding benchmarks with an average score of 95. There are stronger options in this category.
GPT-5.1-Codex-Max ranks #3 out of 88 models in coding and programming benchmarks with an average score of 94. It is among the top performers in this category.
GPT-5.1-Codex-Max ranks #12 out of 88 models in mathematics benchmarks with an average score of 97.1. There are stronger options in this category.
GPT-5.1-Codex-Max ranks #12 out of 88 models in reasoning and logic benchmarks with an average score of 93. There are stronger options in this category.
GPT-5.1-Codex-Max has a context window of 400K tokens, which determines how much text it can process in a single interaction.