Skip to main content

Artificial Analysis Omniscience Hallucination Rate (AA-Omniscience Hallucination Rate)

A display-only Artificial Analysis factuality metric for the rate of incorrect answers among non-correct responses.

Benchmark score on AA-Omniscience Hallucination Rate — June 13, 2026

BenchLM mirrors the published score view for AA-Omniscience Hallucination Rate. Command A+ leads the public snapshot at 14.1% , followed by MiniMax M3 (16.1%) and Qwen3.7 Max (22.9%). BenchLM does not use these results to rank models overall.

121 modelsKnowledgeCurrentDisplay onlyUpdated June 13, 2026

The published AA-Omniscience Hallucination Rate snapshot is tightly clustered at the top: Command A+ sits at 14.1%, while the third row is only 8.8 points behind. The broader top-10 spread is 17.2 points, so the benchmark still separates strong models even when the leaders cluster.

121 models have been evaluated on AA-Omniscience Hallucination Rate. The benchmark falls in the Knowledge category. This category carries a 12% weight in BenchLM.ai's overall scoring system. AA-Omniscience Hallucination Rate is currently displayed for reference but excluded from the scoring formula, so it does not directly affect overall rankings.

About AA-Omniscience Hallucination Rate

Year

2026

Tasks

Knowledge questions

Format

Hallucination rate

Difficulty

Factuality

BenchLM marks this row lower-is-better because a lower hallucination rate is preferable, even though the OpenRouter card displays the raw percentage.

BenchLM freshness & provenance

Version

AA-Omniscience Hallucination Rate 2026

Refresh cadence

Quarterly

Staleness state

Current

Question availability

Public benchmark set

CurrentDisplay only

BenchLM uses freshness metadata to decide whether a benchmark should still be treated as a strong differentiator, a benchmark to watch, or a display-only reference. For the full scoring policy, see the BenchLM methodology page.

Benchmark score table (121 models)

1
14.1%
2
16.1%
3
22.9%
4
24.5%
5
25.0%
6
25.5%
7
28.5%
8
29.4%
9
29.9%
10
31.3%
11
32.0%
12
32.9%
13
34.0%
14
34.4%
15
35.9%
16
36.2%
17
37.9%
18
39.3%
19
40.8%
20
44.2%
21
44.4%
22
47.0%
23
48.3%
24
49.7%
25
49.9%
26
51.0%
27
51.3%
28
51.9%
29
59.8%
30
60.7%
31
60.9%
32
61.3%
33
62.2%
34
64.2%
35
64.6%
36
64.6%
37
65.9%
38
66.0%
39
66.1%
40
66.8%
41
66.8%
42
67.8%
43
67.9%
44
69.3%
46
72.8%
47
73.2%
48
73.6%
49
74.2%
50
74.4%
51
74.4%
52
75.1%
53
75.4%
54
76.0%
55
77.8%
56
77.9%
57
78.2%
58
78.3%
59
78.5%
60
79.6%
61
79.7%
62
79.7%
63
79.8%
64
80.1%
65
80.3%
66
80.4%
67
80.5%
68
80.8%
69
80.9%
70
81.0%
71
81.6%
72
81.6%
73
81.7%
74
81.8%
75
82.0%
76
82.0%
77
82.1%
79
83.4%
80
83.5%
81
83.7%
82
84.0%
83
84.0%
84
84.4%
85
85.5%
86
85.5%
87
86.6%
88
86.6%
89
86.9%
90
86.9%
91
87.1%
92
87.3%
93
87.4%
94
88.6%
95
88.6%
96
89.1%
97
89.1%
98
89.4%
99
89.4%
100
89.5%
101
89.7%
102
89.8%
103
90.2%
104
90.3%
105
90.9%
106
90.9%
107
91.2%
108
91.5%
109
91.5%
110
92.3%
111
93.3%
112
93.5%
113
93.5%
114
93.5%
115
94.0%
116
94.0%
117
94.1%
118
94.4%
119
95.8%
120
95.8%
121
97.0%

FAQ

What does AA-Omniscience Hallucination Rate measure?

A display-only Artificial Analysis factuality metric for the rate of incorrect answers among non-correct responses.

Which model scores highest on AA-Omniscience Hallucination Rate?

Command A+ by Cohere currently leads with a score of 14.1% on AA-Omniscience Hallucination Rate.

How many models are evaluated on AA-Omniscience Hallucination Rate?

121 AI models have been evaluated on AA-Omniscience Hallucination Rate on BenchLM.

Last updated: June 13, 2026 · BenchLM version AA-Omniscience Hallucination Rate 2026

The AI models change fast. We track them for you.

For engineers, researchers, and the plain curious — a weekly brief on new models, ranking shifts, and pricing changes.

Free. No spam. Unsubscribe anytime.