Watch the AI race unfold
Scrub the timeline to travel through 18 months of model releases. See who held the crown, who climbed the ranks, and how the benchmark landscape evolved.
Monthly Race Snapshot
Export or embed the live race view for the currently selected month.
Claude Fable 5
Anthropic
Releases this month
29 modelsClaude Mythos 5
Anthropic
Claude Fable 5
Anthropic
GLM-5.2
Z.AI
Qwen3.7 Plus
Alibaba
MiniMax M3
MiniMax
Nemotron 3 Ultra
NVIDIA
MAI-Thinking-1
Microsoft
Sakana Fugu Ultra
Sakana AI
Ornith-1.0-397B
DeepReinforce
Claude Sonnet 5
Anthropic
LongCat-2.0
Meituan
Ornith-1.0-35B
DeepReinforce
Ornith-1.0-9B
DeepReinforce
Gemma 4 12B
Kimi K2.7 Code
Moonshot AI
LFM2.5-230M
LiquidAI
Holo3.1-35B-A3B
H Company
Holo3.1-4B
H Company
Holo3.1-9B
H Company
GPT-5.6 Luna
OpenAI
GPT-5.6 Sol
OpenAI
GPT-5.6 Terra
OpenAI
Sakana Fugu
Sakana AI
LFM2.5-ColBERT-350M
LiquidAI
LFM2.5-Embedding-350M
LiquidAI
Holo3.1-0.8B
H Company
Holo3.1-35B-A3B-FP8
H Company
Holo3.1-35B-A3B-GGUF
H Company
Holo3.1-35B-A3B-NVFP4
H Company
Provider Race
Cumulative avg. top-3 score through Jun 2026Benchmark Health
How fresh are the benchmarks we use to score models? Green means the benchmark is actively separating models. Red means scores are bunching up or the benchmark is outdated.
Agentic
44 benchmarksCoding
34 benchmarksReasoning
23 benchmarksMultimodal
50 benchmarksKnowledge
30 benchmarksMultilingual
10 benchmarksInstruction Following
4 benchmarksMath
23 benchmarksWant deeper historical analysis?
Explore 21 months of Arena Elo ratings — crown changes, provider dominance, open-source gap tracking, and more.
LLM Leaderboard History