Benchmark analysis of Grok 4.1 by xAI across 14 tests.
Creator
xAI
Source Type
ProprietaryReasoning
Non-ReasoningContext Window
128K
Overall Score
Grok 4.1 ranks #5 out of 88 models with an overall score of 84. It is created by xAI and features a 128K context window.
Grok 4.1 ranks #5 out of 88 models in knowledge and understanding benchmarks with an average score of 96. It is among the top performers in this category.
Grok 4.1 ranks #8 out of 88 models in coding and programming benchmarks with an average score of 91. It is among the top performers in this category.
Grok 4.1 ranks #5 out of 88 models in mathematics benchmarks with an average score of 97.1. It is among the top performers in this category.
Grok 4.1 ranks #5 out of 88 models in reasoning and logic benchmarks with an average score of 94. It is among the top performers in this category.
Grok 4.1 has a context window of 128K tokens, which determines how much text it can process in a single interaction.