Benchmark analysis of Claude 4.1 Opus Thinking by Anthropic across 14 tests.
Creator
Anthropic
Source Type
ProprietaryReasoning
ReasoningContext Window
200K
Overall Score
Claude 4.1 Opus Thinking ranks #79 out of 88 models with an overall score of 29. It is created by Anthropic and features a 200K context window.
Claude 4.1 Opus Thinking ranks #79 out of 88 models in knowledge and understanding benchmarks with an average score of 35.8. There are stronger options in this category.
Claude 4.1 Opus Thinking ranks #79 out of 88 models in coding and programming benchmarks with an average score of 30. There are stronger options in this category.
Claude 4.1 Opus Thinking ranks #79 out of 88 models in mathematics benchmarks with an average score of 37. There are stronger options in this category.
Claude 4.1 Opus Thinking ranks #79 out of 88 models in reasoning and logic benchmarks with an average score of 35. There are stronger options in this category.
Claude 4.1 Opus Thinking has a context window of 200K tokens, which determines how much text it can process in a single interaction.