Benchmark analysis of GPT-5.1 by OpenAI across 14 tests.
Creator
OpenAI
Source Type
ProprietaryReasoning
ReasoningContext Window
400K
Overall Score
GPT-5.1 ranks #13 out of 88 models with an overall score of 76. It is created by OpenAI and features a 400K context window.
GPT-5.1 ranks #13 out of 88 models in knowledge and understanding benchmarks with an average score of 94. There are stronger options in this category.
GPT-5.1 ranks #13 out of 88 models in coding and programming benchmarks with an average score of 89. There are stronger options in this category.
GPT-5.1 ranks #13 out of 88 models in mathematics benchmarks with an average score of 97.1. There are stronger options in this category.
GPT-5.1 ranks #13 out of 88 models in reasoning and logic benchmarks with an average score of 92. There are stronger options in this category.
GPT-5.1 has a context window of 400K tokens, which determines how much text it can process in a single interaction.