Unified Model Leaderboard
Benchmarks, pricing, runtime signals, and context window in one table. Filter state syncs to the URL so every view is shareable. Provisional-ranked mode includes source-unverified non-generated benchmark evidence.
1 Gemini 3.1 Pro Google | Closed | Current | Standard | 1M | $1.25 / $5.00 | 109 | 29.71s | 85 | 78 | 77 | 88 | 89 | 81 | 94 | 90 | 59 | 1492.63 | |
2 Claude Opus 4.6 Anthropic | Anthropic | Closed | Current | Standard | 1M | $15.00 / $75.00 | 40 | 1.78s | 84 | 82 | 75 | 82 | 85 | 78 | 95 | 90 | 79 | 1496.61 |
3 GPT-5.3 Codex OpenAI | OpenAI | Closed | Current | Reasoning | 400K | $2.50 / $10.00 | 79 | 88.26s | ~84 | 81 | 73 | 93 | 91 | 82 | 93 | 88 | 98 | 1416 |
4 Grok 4.1 xAI | xAI | Closed | Superseded | Standard | 1M | $3.00 / $15.00 | N/A | N/A | ~83 | 74 | 69 | 91 | 93 | 81 | 93 | 85 | 93 | 1460.98 |
5 GPT-5.4 OpenAI | OpenAI | Closed | Current | Reasoning | 1.05M | $2.50 / $15.00 | 74 | 151.79s | 81 | 70 | 74 | 88 | 88 | 83 | 95 | 90 | 81 | 1465.79 |
6 Claude Mythos Preview Anthropic | Anthropic | Closed | Current | Reasoning | 1M | $25.00 / $125.00 | N/A | N/A | 81 | 78 | 78 | — | 93 | 75 | 93 | 80 | — | — |
7 Gemini 3 Pro Deep Think Google | Closed | Current | Reasoning | 2M | N/A | N/A | N/A | ~81 | 76 | 66 | 82 | 95 | 76 | 87 | 89 | 96 | 1486.39 | |
8 | Z.AI | Open | Current | Reasoning | 200K | $0.00 / $0.00 | N/A | N/A | ~81 | 81 | 66 | 87 | 79 | 74 | 86 | 84 | 94 | 1455.62 |
9 Claude Sonnet 4.6 Anthropic | Anthropic | Closed | Current | Standard | 200K | $3.00 / $15.00 | 44 | 1.48s | 80 | 76 | 70 | 78 | 92 | 73 | 90 | 86 | 75 | 1462.21 |
10 GPT-5 (high) OpenAI | OpenAI | Closed | Established | Reasoning | 128K | N/A | 83 | 36.28s | ~80 | 80 | 71 | 83 | 89 | 73 | 86 | 87 | 72 | 1433.37 |
11 GLM-5.1 Z.AI | Z.AI | Open | Current | Reasoning | 203K | $1.40 / $4.40 | N/A | N/A | 79 | 78 | 70 | 71 | — | 74 | — | 92 | 90 | 1467.44 |
12 GPT-5.2 OpenAI | OpenAI | Closed | Current | Reasoning | 400K | $2.00 / $8.00 | 73 | 130.34s | ~79 | 68 | 73 | 82 | 87 | 80 | 92 | 85 | 76 | 1439.54 |
13 Gemini 3 Pro Google | Closed | Current | Standard | 2M | N/A | 109 | 32.65s | ~78 | 71 | 65 | 75 | 86 | 74 | 86 | 85 | 77 | 1486.16 | |
14 GPT-5.2-Codex OpenAI | OpenAI | Closed | Current | Reasoning | 400K | $2.00 / $8.00 | 123 | 87.34s | ~78 | 80 | 69 | 91 | 88 | 75 | 88 | 92 | 96 | 1331 |
15 Claude Opus 4.5 Anthropic | Anthropic | Closed | Current | Standard | 200K | N/A | 46 | 1.01s | 77 | 75 | 68 | 68 | 78 | 73 | 87 | 79 | 95 | 1468 |
16 GPT-5.4 Pro OpenAI | OpenAI | Closed | Current | Reasoning | 1.05M | $30.00 / $180.00 | 74 | 151.79s | 77 | 77 | 61 | 83 | 94 | 49 | — | 83 | 55 | 1483.56 |
17 Qwen3.5 397B (Reasoning) Alibaba | Alibaba | Open | Current | Reasoning | 128K | $0.00 / $0.00 | N/A | N/A | ~77 | 73 | 67 | 82 | 71 | 72 | 88 | 89 | 93 | 1450 |
18 GPT-5.1 OpenAI | OpenAI | Closed | Current | Reasoning | 200K | $1.50 / $6.00 | 111 | 57.47s | ~77 | 75 | 70 | 69 | 92 | 74 | 88 | 86 | 71 | 1438.53 |
19 GPT-5.1-Codex-Max OpenAI | OpenAI | Closed | Current | Reasoning | 400K | $2.00 / $8.00 | N/A | N/A | ~77 | 76 | 70 | 92 | 88 | 74 | 88 | 91 | 96 | 1349 |
20 GLM-5 Z.AI | Z.AI | Open | Superseded | Standard | 200K | $0.00 / $0.00 | 74 | 1.64s | 76 | 72 | 65 | 71 | 69 | 73 | 83 | 85 | 90 | 1455.57 |
21 Kimi K2.5 (Reasoning) Moonshot AI | Moonshot AI | Closed | Current | Reasoning | 128K | N/A | N/A | N/A | ~76 | 67 | 73 | 74 | 78 | 68 | 90 | 94 | 71 | 1447 |
22 Gemma 4 31B Google | Open | Current | Reasoning | 256K | $0.00 / $0.00 | N/A | N/A | ~73 | — | 78 | 66 | 77 | 61 | — | — | — | 1451.16 | |
23 GLM-4.7 Z.AI | Z.AI | Open | Established | Reasoning | 200K | $0.00 / $0.00 | 82 | 1.10s | ~73 | 62 | 68 | 79 | 71 | 63 | 84 | 85 | 86 | 1442.71 |
24 GPT-5 (medium) OpenAI | OpenAI | Closed | Established | Reasoning | 128K | N/A | 83 | 36.28s | ~73 | 70 | 69 | 82 | 88 | 71 | 88 | 88 | 93 | 1328 |
25 | xAI | Closed | Current | Standard | 1M | N/A | 138 | 0.54s | ~72 | 65 | 55 | 88 | 87 | 71 | 85 | 85 | 94 | 1420 |