1 Gemini 3.1 Pro Google | Closed | 1M | 83 | 76 | 72 | 88 | 95 | 81 | 94 | 95 | 97 | 1423 |
2 GPT-5.4 OpenAI | Closed | 1.05M | 79 | 77 | 73 | 90 | 88 | 83 | 95 | 96 | 98 | 1454 |
3 Claude Sonnet 4.6 Anthropic | Closed | 200K | 76 | 71 | 61 | 78 | 92 | 74 | 90 | 91 | 96 | 1339 |
4 GPT-5.4 Pro OpenAI | Closed | 1.05M | 76 | 88 | 87 | 96 | 95 | 85 | 96 | 97 | 98 | 1472 |
5 Gemini 3 Pro Deep Think Google | Closed | 2M | 70 | 78 | 60 | 82 | 95 | 76 | 87 | 89 | 96 | 1349 |
6 GPT-5.3 Codex OpenAI | Closed | 400K | 70 | 76 | 73 | 93 | 91 | 82 | 93 | 93 | 98 | 1416 |
7 o3-mini OpenAI | Closed | 200K | 70 | 67 | 55 | 81 | 74 | 71 | 73 | 94 | — | — |
8 Qwen2.5-1M Alibaba | Open | 1M | 67 | 65 | 45 | 81 | 68 | 62 | 80 | 84 | 85 | 1256 |
9 GPT-4.1 OpenAI | Closed | 1M | 67 | 65 | 52 | 81 | 74 | 63 | 69 | 87 | — | — |
10 o1 OpenAI | Closed | 200K | 67 | 65 | 48 | 78 | 71 | 69 | 77 | 92 | — | — |
11 Grok 4.1 xAI | Closed | 1M | 67 | 78 | 74 | 91 | 93 | 81 | 93 | 93 | 97 | 1435 |
12 Claude Opus 4.6 Anthropic | Closed | 1M | 67 | 79 | 76 | 86 | 85 | 78 | 95 | 95 | 97 | 1422 |
13 DeepSeek V3.2 (Thinking) DeepSeek | Open | 128K | 66 | 69 | 51 | 60 | 71 | 66 | 81 | 85 | 86 | 1260 |
14 DeepSeek Coder 2.0 DeepSeek | Open | 128K | 66 | 68 | 53 | 73 | 59 | 61 | 80 | 86 | 81 | 1238 |
15 GPT-5.2 OpenAI | Closed | 400K | 66 | 66 | 69 | 82 | 95 | 80 | 92 | 94 | 97 | 1426 |
16 Nemotron 3 Ultra 500B NVIDIA | Open | 10M | 63 | 63 | 44 | 79 | 67 | 58 | 80 | 84 | 77 | 1252 |
17 Grok 4 xAI | Closed | 128K | 63 | 58 | 43 | 60 | 78 | 65 | 81 | 82 | 86 | 1238 |
18 Qwen3.5 397B (Reasoning) Alibaba | Open | 128K | 63 | 75 | 62 | 82 | 71 | 72 | 88 | 89 | 93 | 1326 |
19 Claude Haiku 4.5 Anthropic | Closed | 200K | 62 | 57 | 42 | 69 | 78 | 54 | 80 | 86 | 71 | 1263 |
20 o3 OpenAI | Closed | 200K | 62 | 70 | 49 | 62 | 72 | 67 | 81 | 85 | 88 | 1258 |
21 Gemini 3 Flash Google | Closed | 1M | 62 | 58 | 41 | 73 | 80 | 54 | 81 | 85 | 73 | 1241 |
22 Claude 4 Sonnet Anthropic | Closed | 200K | 61 | 58 | 43 | 71 | 80 | 58 | 82 | 83 | 75 | 1239 |
23 Nemotron 3 Super 100B NVIDIA | Open | 1M | 60 | 57 | 42 | 71 | 60 | 53 | 80 | 84 | 70 | 1260 |
24 Qwen3.5 397B Alibaba | Open | 128K | 60 | 57 | 41 | 73 | 61 | 61 | 79 | 82 | 83 | 1237 |
25 Claude 3.5 Sonnet Anthropic | Closed | 200K | 60 | 55 | 38 | 68 | 75 | 52 | 81 | 83 | 69 | 1214 |