1 GPT-5.4 Pro OpenAI | Closed | 1.05M | 87 | 88 | 87 | 96 | 95 | 85 | 96 | 97 | 98 | 1472 |
2 GPT-5.4 OpenAI | Closed | 1.05M | 84 | 77 | 74 | 90 | 88 | 83 | 95 | 96 | 98 | 1480 |
3 Gemini 3.1 Pro Google | Closed | 1M | 83 | 76 | 69 | 88 | 95 | 81 | 94 | 95 | 97 | 1500 |
4 Claude Opus 4.6 Anthropic | Closed | 1M | 80 | 73 | 73 | 82 | 85 | 78 | 95 | 95 | 97 | 1504 |
5 GPT-5.3 Codex OpenAI | Closed | 400K | 80 | 76 | 69 | 93 | 91 | 82 | 93 | 93 | 98 | 1416 |
6 Gemini 3 Pro Deep Think Google | Closed | 2M | 79 | 78 | 60 | 82 | 95 | 76 | 87 | 89 | 96 | 1349 |
7 GPT-5.2 OpenAI | Closed | 400K | 77 | 66 | 70 | 82 | 95 | 80 | 92 | 94 | 97 | 1481 |
8 Claude Sonnet 4.6 Anthropic | Closed | 200K | 76 | 68 | 63 | 78 | 92 | 73 | 90 | 90 | 97 | 1438 |
9 Qwen3.5 397B (Reasoning) Alibaba | Open | 128K | 72 | 75 | 61 | 82 | 71 | 72 | 88 | 89 | 93 | 1450 |
10 Kimi K2.5 (Reasoning) Moonshot AI | Closed | 128K | 71 | 58 | 70 | 74 | 78 | 68 | 90 | 94 | 94 | 1447 |
11 GLM-5 (Reasoning) Zhipu AI | Open | 200K | 71 | 78 | 62 | 87 | 79 | 74 | 86 | 92 | 96 | 1451 |
12 o3-mini OpenAI | Closed | 200K | 70 | 67 | 54 | 81 | 74 | 71 | 73 | 94 | — | — |
13 GLM-4.7 Zhipu AI | Open | 200K | 69 | 51 | 69 | 79 | 71 | 63 | 84 | 88 | 89 | 1445 |
14 o3 OpenAI | Closed | 200K | 68 | 70 | 54 | 62 | 72 | 67 | 81 | 85 | 88 | 1258 |
15 Qwen2.5-1M Alibaba | Open | 1M | 67 | 65 | 45 | 81 | 68 | 62 | 80 | 84 | 85 | 1256 |
16 Grok 4 xAI | Closed | 128K | 67 | 58 | 66 | 60 | 78 | 65 | 81 | 82 | 86 | 1238 |
17 GPT-4.1 OpenAI | Closed | 1M | 67 | 65 | 52 | 81 | 74 | 63 | 69 | 87 | — | — |
18 o1 OpenAI | Closed | 200K | 67 | 65 | 47 | 78 | 71 | 69 | 77 | 92 | — | — |
19 DeepSeek V3.2 (Thinking) DeepSeek | Open | 128K | 66 | 69 | 51 | 60 | 71 | 66 | 81 | 85 | 86 | 1421 |
20 DeepSeek Coder 2.0 DeepSeek | Open | 128K | 66 | 68 | 53 | 73 | 59 | 61 | 80 | 86 | 81 | 1238 |
21 Nemotron 3 Ultra 500B NVIDIA | Open | 10M | 65 | 63 | 44 | 79 | 67 | 58 | 80 | 84 | 77 | 1252 |
22 Claude 4 Sonnet Anthropic | Closed | 200K | 65 | 58 | 49 | 71 | 80 | 58 | 82 | 83 | 75 | 1239 |
23 Kimi K2 Moonshot AI | Closed | 128K | 65 | — | 58 | — | — | 64 | — | 90 | 68 | 1051 |
24 Gemini 3 Flash Google | Closed | 1M | 64 | 58 | 45 | 73 | 80 | 54 | 81 | 85 | 73 | 1473 |
25 GLM-5 Zhipu AI | Open | 200K | 63 | 58 | 58 | 77 | 69 | 70 | 82 | 85 | 92 | 1420 |