Top AI models ranked by reasoning benchmark performance including SimpleQA and MuSR.
94
avg
93
92
91
90
89
88
87
86
85
83
82
81
80
79
77
76
75
73
72
70
69
67
66
65
64
63
62
61
60
59
58
57
55
53
51
50
49
48
47
46
45
44
43
42
41
40
39
38
37
36
35
34
33
32
31
30
29
28
27
26