Skip to main content

Artificial Analysis Omniscience Index (AA-Omniscience Index)

A display-only Artificial Analysis factual knowledge index.

Benchmark score on AA-Omniscience Index — July 4, 2026

BenchLM mirrors the published score view for AA-Omniscience Index. Gemini 3.1 Pro leads the public snapshot at 32.9% , followed by Claude Opus 4.8 (27.4%) and Claude Opus 4.7 (Adaptive) (26.2%). BenchLM does not use these results to rank models overall.

123 modelsKnowledgeCurrentDisplay onlyUpdated July 4, 2026

The published AA-Omniscience Index snapshot is tightly clustered at the top: Gemini 3.1 Pro sits at 32.9%, while the third row is only 6.7 points behind. The broader top-10 spread is 19.4 points, so the benchmark still separates strong models even when the leaders cluster.

123 models have been evaluated on AA-Omniscience Index. The benchmark falls in the Knowledge category. This category carries a 12% weight in BenchLM.ai's overall scoring system. AA-Omniscience Index is currently displayed for reference but excluded from the scoring formula, so it does not directly affect overall rankings.

About AA-Omniscience Index

Year

2026

Tasks

Knowledge questions

Format

Index score

Difficulty

Broad factual knowledge

BenchLM stores the AA-Omniscience index as a display-only factuality signal alongside the accuracy and hallucination-rate rows.

BenchLM freshness & provenance

Version

AA-Omniscience Index 2026

Refresh cadence

Quarterly

Staleness state

Current

Question availability

Public benchmark set

CurrentDisplay only

BenchLM uses freshness metadata to decide whether a benchmark should still be treated as a strong differentiator, a benchmark to watch, or a display-only reference. For the full scoring policy, see the BenchLM methodology page.

Benchmark score table (123 models)

1
32.9%
2
27.4%
3
26.2%
4
22.7%
5
20.1%
6
18.3%
7
15.8%
8
14.2%
9
14.1%
10
13.5%
11
13.3%
12
10.2%
13
9.9%
14
6.4%
15
5.7%
16
5.6%
17
4.9%
18
4.1%
19
4.0%
20
3.8%
21
3.6%
22
3.5%
23
2.7%
24
2.4%
25
2.0%
26
1.9%
27
1.4%
28
0.7%
29
-0.8%
30
-1.0%
31
-2.5%
32
-2.9%
33
-3.6%
34
-3.9%
35
-4.0%
36
-6.0%
37
-6.0%
38
-8.1%
39
-8.1%
40
-8.1%
41
-9.2%
42
-9.7%
43
-10.0%
44
-10.1%
45
-10.5%
46
-10.7%
47
-10.7%
48
-14.3%
49
-15.1%
50
-15.3%
51
-15.5%
52
-17.3%
53
-17.4%
54
-18.7%
55
-19.0%
56
-19.8%
57
-20.0%
58
-21.4%
59
-22.3%
60
-22.9%
61
-24.0%
62
-27.1%
63
-27.5%
64
-28.4%
65
-28.4%
66
-28.7%
67
-29.5%
68
-29.8%
69
-29.9%
70
-29.9%
71
-31.5%
72
-31.6%
73
-33.3%
74
-34.0%
75
-34.6%
76
-34.6%
77
-36.0%
78
-36.1%
79
-36.2%
80
-36.3%
81
-37.5%
82
-39.4%
83
-39.6%
84
-41.1%
85
-41.3%
86
-41.8%
87
-42.0%
88
-42.0%
89
-43.1%
90
-44.2%
91
-44.2%
92
-45.4%
93
-45.5%
94
-46.4%
95
-46.7%
96
-47.6%
97
-47.6%
98
-48.1%
99
-48.5%
100
-50.0%
101
-50.1%
102
-50.9%
103
-51.9%
104
-52.4%
105
-56.0%
106
-56.4%
107
-56.7%
108
-57.9%
109
-59.5%
110
-61.7%
111
-62.3%
112
-62.5%
113
-63.9%
114
-65.7%
115
-65.9%
116
-69.2%
117
-72.0%
118
-72.1%
119
-73.6%
120
-81.8%
121
-82.6%
122
-83.9%
123
-87.2%

FAQ

What does AA-Omniscience Index measure?

A display-only Artificial Analysis factual knowledge index.

Which model scores highest on AA-Omniscience Index?

Gemini 3.1 Pro by Google currently leads with a score of 32.9% on AA-Omniscience Index.

How many models are evaluated on AA-Omniscience Index?

123 AI models have been evaluated on AA-Omniscience Index on BenchLM.

Last updated: July 4, 2026 · BenchLM version AA-Omniscience Index 2026

The AI models change fast. We track them for you.

For engineers, researchers, and the plain curious — a weekly brief on new models, ranking shifts, and pricing changes.

Free. No spam. Unsubscribe anytime.