The AI Race
Watch the AI race unfold
Scrub the timeline to travel through 18 months of model releases. See who held the crown, who climbed the ranks, and how the benchmark landscape evolved.
Apr 20265 releases
Crown Holder
Gemma 4 31B
73/ 100
New champion this month!
Runner Up
Qwen3.6 Plus
Alibaba
69/ 100
Releases this month
5 modelsProvider Race
Cumulative avg. top-3 score through Apr 2026Benchmark Health
How fresh are the benchmarks we use to score models? Green means the benchmark is actively separating models. Red means scores are bunching up or the benchmark is outdated.
Agentic
24 benchmarks24 current
Coding
13 benchmarks11 current1 refreshing1 stale1 saturated
Reasoning
11 benchmarks9 current2 stale1 saturated
Multimodal
34 benchmarks32 current2 refreshing
Knowledge
13 benchmarks8 current3 refreshing2 stale1 saturated
Multilingual
7 benchmarks6 current1 stale
Instruction Following
2 benchmarks1 current1 stale
Math
14 benchmarks9 current2 refreshing3 stale
Want deeper historical analysis?
Explore 21 months of Arena Elo ratings — crown changes, provider dominance, open-source gap tracking, and more.
LLM Leaderboard History