The 2024 edition of AIME, maintaining the same format of 15 challenging mathematics problems with integer answers from 000 to 999.
BenchLM mirrors the published score view for AIME 2024. o3-mini leads the public snapshot at 87.3%. BenchLM does not use these results to rank models overall.
Year
2024
Tasks
15 problems
Format
Integer answers 000-999
Difficulty
High school olympiad level
AIME 2024 continues the tradition of challenging mathematical reasoning problems. These problems test deep understanding of mathematical concepts and creative problem-solving abilities.
Version
AIME 2024 2024
Refresh cadence
Annual
Staleness state
Refreshing
Question availability
Public benchmark set
BenchLM uses freshness metadata to decide whether a benchmark should still be treated as a strong differentiator, a benchmark to watch, or a display-only reference. For the full scoring policy, see the BenchLM methodology page.
The 2024 edition of AIME, maintaining the same format of 15 challenging mathematics problems with integer answers from 000 to 999.
o3-mini by OpenAI currently leads with a score of 87.3% on AIME 2024.
1 AI models have been evaluated on AIME 2024 on BenchLM.
For engineers, researchers, and the plain curious — a weekly brief on new models, ranking shifts, and pricing changes.
Free. No spam. Unsubscribe anytime.