BenchLM recommendation

Best xAI Grok Models in 2026

Data verified July 20, 2026

As of July 20, 2026, the top model in best xai grok models on the BenchLM leaderboard is Grok 4.5 with a score of 76.7.

Last verified: July 20, 2026

All xAI Grok models ranked by benchmark performance.

Unless noted otherwise, ranking surfaces on this page use BenchLM's provisional leaderboard lane rather than the stricter sourced-only verified leaderboard.

Bottom line: Grok 4.1 is xAI's best — strong on multimodal (98) and knowledge (95) with perfect multilingual. Grok 4.20 adds stronger coding and instruction following.

Grok 4.5 leads this ranking with a score of 76.7, followed by Grok 4.3 (65.1) and Grok 4.1 Fast (Reasoning) (60.5). There is a significant gap between the leading models and the rest of the field.

All models in this ranking are proprietary. No open-weight alternatives are available for this category.

This ranking is based on provisional overall weighted scores across BenchLM.ai's scoring formula tracked by BenchLM.ai. For detailed model profiles, click any model name below. To compare two specific models head-to-head, use the "vs #" links.

1Closed

Grok 4.5

xAI · 500K

76.7BenchAlign v5

2Closed

Grok 4.3

xAI · 1M

65.1BenchAlign v5

3Closed

Grok 4.1 Fast (Reasoning)

xAI · 2M

60.5BenchAlign v5

What changed

Grok 4.1 leads xAI's lineup with perfect multilingual and strong multimodal (98).

Grok 4.20 reasoning variant — leads instruction following (98) and coding (80).

Grok 4.1 Fast speed-optimized with strong math (94) and reasoning (89).

How to choose

Best xAI model overall?

Grok 4.1 — strongest across all categories

Coding and structured output?

Grok 4.20 — best instruction following and coding

Low latency?

Grok 4.1 Fast — speed-optimized variant

Full Rankings (10 models)

1

Grok 4.5

xAI·Proprietary·500K

76.7

BenchAlign v5

vs #2

2

Grok 4.3

xAI·Proprietary·1M

65.1

BenchAlign v5

vs #3

3

Grok 4.1 Fast (Reasoning)

xAI·Proprietary·2M

60.5

BenchAlign v5

vs #4

4

Grok 4

xAI·Proprietary·128K

60.4

BenchAlign v5

vs #5

5

Grok 4.1

xAI·Proprietary·1M

60

BenchAlign v5

vs #6

6

Grok 4 Fast (Reasoning)

xAI·Proprietary·2M

56.6

BenchAlign v5

vs #7

7

Grok 4.20

xAI·Proprietary·2M

54.7

BenchAlign v5

vs #8

8

Grok 4.1 Fast

xAI·Proprietary·1M

51.3

BenchAlign v5

vs #9

9

Grok 3 [Beta]

xAI·Proprietary·128K

40.4

BenchAlign v5

vs #10

10

Grok Code Fast 1

xAI·Proprietary·256K

38.6

BenchAlign v5

Key Takeaways

The top model is Grok 4.5 by xAI with a BenchAlign v5 score of 76.7 and Supported evidence.

10 models are included in this ranking.

Score in Context

What these scores mean

Models are ranked by the same overall BenchLM score used across all leaderboards. Comparing within xAI's lineup helps identify which model fits your use case and budget.

Known limitations

This page only shows xAI models. Cross-provider comparison requires the overall or category-specific leaderboards. Newer models may have limited benchmark coverage initially.

Explore More

Last updated: July 20, 2026

Choose a model with this week’s evidence

Join 2,000+ readers for ranking moves, pricing changes, and the claims that still need proof.

One email each week. Unsubscribe anytime.