Benchmark profile

Artificial Analysis Intelligence Index

A display-only intelligence index published by Artificial Analysis that aggregates provider-reported and benchmark-derived signals into a single model-level score.

Data verified August 2, 202620 confirmed releases in the last 30 daysSee provider release alerts

The public Artificial Analysis Intelligence Index snapshot ranks Claude Opus 5 first at 60.7%, ahead of Claude Fable 5 (59.9%) and GPT-5.6 Sol (58.9%) among 167 tested models. We mirror the table as display-only evidence; it does not affect overall rankings.

Benchmark score on Artificial Analysis Intelligence Index — August 2, 2026

BenchLM mirrors the published score view for Artificial Analysis Intelligence Index. Claude Opus 5 leads the public snapshot at 60.7% , followed by Claude Fable 5 (59.9%) and GPT-5.6 Sol (58.9%). BenchLM does not use these results to rank models overall.

1Closed

Claude Opus 5

Anthropic

claude-opus-5

60.7%

Overall 82.79

2Closed

Claude Fable 5

Anthropic

claude-fable-5

59.9%

Overall 82.73Context 1M+

3Closed

GPT-5.6 Sol

OpenAI

gpt-5-6-sol

58.9%

Overall 81.36Context 1.05M

167 modelsKnowledgeCurrentDisplay onlyUpdated August 2, 2026

Benchmark score table (167 models)

Score

Claude Opus 5Anthropic · Closed

60.7%

Claude Fable 5Anthropic · Closed

59.9%

GPT-5.6 SolOpenAI · Closed

58.9%

Kimi K3Moonshot AI · Closed

57.1%

Claude Opus 4.8Anthropic · Closed

55.7%

GPT-5.6 TerraOpenAI · Closed

55.0%

GPT-5.5OpenAI · Closed

54.8%

Grok 4.5xAI · Closed

53.8%

Claude Opus 4.7 (Adaptive)Anthropic · Closed

53.5%

Claude Sonnet 5Anthropic · Closed

53.4%

GPT-5.4OpenAI · Closed

51.4%

GPT-5.6 LunaOpenAI · Closed

51.2%

GLM-5.2Z.AI · Open weight

51.1%

Muse Spark 1.1Meta · Closed

50.6%

Gemini 3.5 FlashGoogle · Closed

50.2%

Gemini 3.6 FlashGoogle · Closed

50.1%

Gemini 3.1 ProGoogle · Closed

46.5%

Qwen3.7 MaxAlibaba · Closed

46.0%

MiniMax M3MiniMax · Open weight

44.4%

DeepSeek V4 Pro (Max)DeepSeek · Open weight

44.3%

GPT-5.3 CodexOpenAI · Closed

44.3%

GPT-5.3-Codex-SparkOpenAI · Closed

44.3%

Kimi K2.6Moonshot AI · Open weight

44.2%

Claude Opus 4.6 (Adaptive)Anthropic · Closed

43.7%

DeepSeek V4 Pro (High)DeepSeek · Open weight

43.1%

Muse SparkMeta · Closed

43.1%

Claude Opus 4.7Anthropic · Closed

42.7%

MiMo-V2.5-ProXiaomi · Closed

42.2%

GPT-5.2OpenAI · Closed

42.2%

Kimi K2.7 CodeMoonshot AI · Open weight

42.0%

Hy3 PreviewTencent · Open weight

41.2%

Hy3Tencent · Open weight

41.2%

Claude Opus 4.5 ThinkingAnthropic · Closed

40.8%

InklingThinking Machines Lab · Open weight

40.7%

MiMo-V2-ProXiaomi · Closed

40.3%

DeepSeek V4 Flash (Max)DeepSeek · Open weight

40.3%

GLM-5.1Z.AI · Open weight

40.2%

GPT-5.2-CodexOpenAI · Closed

40.1%

Qwen 3.6 Max (preview)Alibaba · Closed

40.0%

GPT-5.4 miniOpenAI · Closed

40.0%

Qwen3.6 PlusAlibaba · Closed

39.6%

Gemini 3 ProGoogle · Closed

39.5%

GLM-5Z.AI · Open weight

39.5%

Qwen3.7 PlusAlibaba · Closed

39.0%

GPT-5.4 nanoOpenAI · Closed

38.2%

MiniMax M2.7MiniMax · Open weight

38.1%

GLM-5-TurboZ.AI · Closed

38.1%

Claude Opus 4.6Anthropic · Closed

37.8%

Nemotron 3 UltraNVIDIA · Open weight

37.8%

Grok 4.3xAI · Closed

37.6%

DeepSeek V4 Flash (High)DeepSeek · Open weight

37.5%

Qwen3.6-27BAlibaba · Open weight

37.0%

GPT-5.1OpenAI · Closed

36.9%

Gemini 3.5 Flash-LiteGoogle · Closed

36.5%

Claude Sonnet 4.6Anthropic · Closed

35.9%

Kimi K2.5Moonshot AI · Open weight

35.4%

Kimi K2.5 (Reasoning)Moonshot AI · Closed

35.4%

MiMo-V2-OmniXiaomi · Closed

35.0%

GPT-5.1-Codex-MaxOpenAI · Closed

34.7%

GPT-5.1-CodexOpenAI · Closed

34.7%

Claude Opus 4.5Anthropic · Closed

34.7%

GPT-5 (high)OpenAI · Closed

34.7%

GLM-5V-TurboZ.AI · Closed

34.5%

Qwen3.5-27BAlibaba · Open weight

33.8%

GPT-5 (medium)OpenAI · Closed

33.7%

Claude 4.1 Opus ThinkingAnthropic · Closed

33.7%

GLM-4.7Z.AI · Open weight

33.7%

Qwen3.5 397BAlibaba · Open weight

33.7%

Qwen3.5 397B (Reasoning)Alibaba · Open weight

33.7%

MiniMax M2.5MiniMax · Closed

33.6%

Grok 4xAI · Closed

33.3%

o3-proOpenAI · Closed

32.5%

Qwen3.5-122B-A10BAlibaba · Open weight

32.3%

Qwen3.6-35B-A3BAlibaba · Open weight

31.6%

Grok 4.1 Fast (Reasoning)xAI · Closed

30.6%

o3OpenAI · Closed

30.4%

Step 3.7 FlashStepFun · Open weight

30.3%

Mistral Medium 3.5 128BMistral · Open weight

29.9%

Gemma 4 31BGoogle · Open weight

29.4%

Qwen3.5-35B-A3BAlibaba · Open weight

29.3%

Claude 4.1 OpusAnthropic · Closed

28.2%

Gemini 3 FlashGoogle · Closed

27.4%

Grok 4 Fast (Reasoning)xAI · Closed

27.4%

Step 3.5 FlashStepFun · Open weight

26.0%

Gemini 2.5 ProGoogle · Closed

25.8%

Gemma 4 26B A4BGoogle · Open weight

25.7%

Claude 4 SonnetAnthropic · Closed

25.5%

Nemotron 3 Super 120B A12BNVIDIA · Open weight

25.4%

GPT-5 miniOpenAI · Closed

25.3%

Gemini 3.1 Flash-LiteGoogle · Closed

25.0%

MiMo-V2-FlashXiaomi · Open weight

24.7%

DeepSeek V3.2DeepSeek · Open weight

24.7%

Qwen3 MaxAlibaba · Closed

24.0%

GPT-OSS 120BOpenAI · Open weight

23.8%

o1OpenAI · Closed

23.4%

GLM-4.6Z.AI · Open weight

23.0%

GLM-4.7-FlashZ.AI · Open weight

22.9%

Command A+Cohere · Open weight

22.5%

K-ExaoneLG AI Research · Closed

22.1%

100

Gemma 4 12BGoogle · Open weight

21.8%

101

Grok Code Fast 1xAI · Closed

21.6%

102

Mercury 2Inception · Closed

21.4%

103

DeepSeek V3.1DeepSeek · Open weight

21.1%

104

DeepSeek V3.1 (Reasoning)DeepSeek · Open weight

20.7%

105

DeepSeek-R1DeepSeek · Open weight

20.1%

106

GPT-5 nanoOpenAI · Closed

19.9%

107

Mistral Small 4Mistral · Open weight

19.6%

108

Mistral Small 4 (Reasoning)Mistral · Open weight

19.6%

109

Kimi K2Moonshot AI · Closed

19.4%

110

GPT-4.1OpenAI · Closed

19.4%

111

o3-miniOpenAI · Closed

19.0%

112

o1-proOpenAI · Closed

18.9%

113

Trinity-Large-PreviewArcee AI · Open weight

18.2%

114

Trinity-Large-ThinkingArcee AI · Open weight

18.2%

115

MiniMax M1 80kMiniMax · Closed

17.7%

116

o1-previewOpenAI · Closed

17.0%

117

Grok 4.1 FastxAI · Closed

16.9%

118

GLM-4.5-AirZ.AI · Closed

16.5%

119

Mistral Large 3Mistral · Closed

15.9%

120

Nemotron 3 Nano Omni 30B A3BNVIDIA · Open weight

14.9%

121

GPT-OSS 20BOpenAI · Open weight

14.9%

122

GPT-4.1 miniOpenAI · Closed

14.8%

123

Llama 4 MaverickMeta · Open weight

14.3%

124

Nemotron 3 Nano 30BNVIDIA · Open weight

14.2%

125

DeepSeek V3DeepSeek · Open weight

14.2%

126

Gemini 2.5 FlashGoogle · Closed

14.1%

127

Ling 2.6 FlashInclusionAI · Open weight

14.1%

128

Mistral Medium 3Mistral · Closed

12.5%

129

Sarvam 105BSarvam · Open weight

11.9%

130

Gemma 4 E4BGoogle · Open weight

11.9%

131

Claude 3 OpusAnthropic · Closed

11.8%

132

GPT-4oOpenAI · Closed

11.2%

133

Ministral 3 14B (Reasoning)Mistral · Open weight

11.1%

134

Ministral 3 14BMistral · Open weight

11.1%

135

DeepSeek R1 Distill Qwen 32BDeepSeek · Open weight

11.0%

136

Llama 4 ScoutMeta · Open weight

10.0%

137

Gemini 1.5 ProGoogle · Closed

10.0%

138

GPT-4.1 nanoOpenAI · Closed

9.6%

139

Gemma 4 E2BGoogle · Open weight

9.5%

140

Mistral Large 2Mistral · Closed

9.2%

141

Nemotron Ultra 253BNVIDIA · Open weight

9.1%

142

Ministral 3 8B (Reasoning)Mistral · Open weight

9.0%

143

Ministral 3 8BMistral · Open weight

9.0%

144

Llama 3.1 405BMeta · Open weight

8.5%

145

LFM2.5-8B-A1BLiquidAI · Open weight

8.3%

146

GPT-4 TurboOpenAI · Closed

7.9%

147

Solar Pro 2Upstage · Closed

7.8%

148

Nova ProAmazon · Closed

7.7%

149

Gemma 3 27BGoogle · Open weight

7.4%

150

Qwen2.5 Coder 32B InstructAlibaba · Open weight

7.1%

151

GPT-4o miniOpenAI · Closed

6.9%

152

Sarvam 30BSarvam · Open weight

6.6%

153

Ministral 3 3B (Reasoning)Mistral · Open weight

6.5%

154

Ministral 3 3BMistral · Open weight

6.5%

155

Exaone 4.0 32BLG AI Research · Open weight

6.0%

156

LFM2-24B-A2BLiquidAI · Closed

5.0%

157

Phi-4Microsoft · Open weight

4.9%

158

Claude 3 HaikuAnthropic · Closed

3.9%

159

Gemini 1.0 ProGoogle · Closed

3.1%

160

Exaone 4.0 1.2BLG AI Research · Open weight

2.8%

161

LFM2.5-1.2B-ThinkingLiquidAI · Closed

2.8%

162

LFM2.5-1.2B-InstructLiquidAI · Closed

2.7%

163

Granite-4.0-H-1BIBM · Open weight

2.7%

164

Granite-4.0-1BIBM · Open weight

2.1%

165

LFM2.5-VL-1.6B-ExtractLiquidAI · Open weight

1.0%

166

Granite-4.0-350MIBM · Open weight

1.0%

167

Granite-4.0-H-350MIBM · Open weight

1.0%

The published Artificial Analysis Intelligence Index snapshot places Claude Opus 5 first at 60.7%. The third row is 1.8 points behind. The broader top-10 range is 7.3 points, so many of the published results sit in a relatively narrow band.

167 models have been evaluated on Artificial Analysis Intelligence Index. The benchmark falls in the Knowledge category. This category carries a 12% weight in BenchLM.ai's overall scoring system. Artificial Analysis Intelligence Index is currently displayed for reference but excluded from the scoring formula, so it does not directly affect overall rankings.

About Artificial Analysis Intelligence Index

Year

2026

Tasks

Cross-benchmark intelligence index

Format

Aggregated model score

Difficulty

Display-only external reference

BenchLM tracks Artificial Analysis as a display-only external reference rather than a weighted benchmark. It is useful as a market snapshot, but it is not a benchmark-native row with a single public task set, scoring harness, or exact-source methodology aligned to BenchLM's core benchmark pages.

Artificial Analysis

BenchLM freshness & provenance

Version

Artificial Analysis Intelligence Index 2026

Refresh cadence

Quarterly

Staleness state

Current

Question availability

Public benchmark set

CurrentDisplay only

BenchLM uses freshness metadata to decide whether a benchmark should still be treated as a strong differentiator, a benchmark to watch, or a display-only reference. For the full scoring policy, see the BenchLM methodology page.

FAQ

What does Artificial Analysis Intelligence Index measure?

A display-only intelligence index published by Artificial Analysis that aggregates provider-reported and benchmark-derived signals into a single model-level score.

Which model scores highest on Artificial Analysis Intelligence Index?

Claude Opus 5 by Anthropic currently leads with a score of 60.7% on Artificial Analysis Intelligence Index.

How many models are evaluated on Artificial Analysis Intelligence Index?

167 AI models have been evaluated on Artificial Analysis Intelligence Index on BenchLM.

Compare Top Models on Artificial Analysis Intelligence Index

Claude Opus 5 vs Claude Fable 5 Claude Fable 5 vs GPT-5.6 Sol GPT-5.6 Sol vs Kimi K3 Kimi K3 vs Claude Opus 4.8

Last updated: August 2, 2026 · BenchLM version Artificial Analysis Intelligence Index 2026

Know when it’s worth switching models

The model to choose, the cheaper alternative, and the release we would wait on.

Read a sample issue

Join 2,000+ readers.

One email each week. Unsubscribe anytime.