Benchmark analysis of Grok 4 by xAI across 14 tests.
Creator
xAI
Source Type
ProprietaryReasoning
Non-ReasoningContext Window
128K
Overall Score
Grok 4 ranks #22 out of 88 models with an overall score of 69. It is created by xAI and features a 128K context window.
Grok 4 ranks #26 out of 88 models in knowledge and understanding benchmarks with an average score of 84.8. There are stronger options in this category.
Grok 4 ranks #26 out of 88 models in coding and programming benchmarks with an average score of 79. There are stronger options in this category.
Grok 4 ranks #26 out of 88 models in mathematics benchmarks with an average score of 86.6. There are stronger options in this category.
Grok 4 ranks #26 out of 88 models in reasoning and logic benchmarks with an average score of 82. There are stronger options in this category.
Grok 4 has a context window of 128K tokens, which determines how much text it can process in a single interaction.