Claude 3 Haiku Benchmark Scores & Performance

Benchmark analysis of Claude 3 Haiku by Anthropic across 14 tests.

Creator

Anthropic

Source Type

Proprietary

Reasoning

Non-Reasoning

Context Window

200K

Overall Score

46#62 of 88

Knowledge Benchmarks

MMLU
56
GPQA
56
SuperGPQA
54
OpenBookQA
52

Coding Benchmarks

HumanEval
48

Mathematics Benchmarks

AIME 2023
56
AIME 2024
58
AIME 2025
57
HMMT Feb 2023
52
HMMT Feb 2024
54
HMMT Feb 2025
53
BRUMO 2025
55

Reasoning Benchmarks

SimpleQA
54
MuSR
52

Frequently Asked Questions

How does Claude 3 Haiku perform overall in AI benchmarks?

Claude 3 Haiku ranks #62 out of 88 models with an overall score of 46. It is created by Anthropic and features a 200K context window.

Is Claude 3 Haiku good for knowledge and understanding?

Claude 3 Haiku ranks #62 out of 88 models in knowledge and understanding benchmarks with an average score of 54.5. There are stronger options in this category.

Is Claude 3 Haiku good for coding and programming?

Claude 3 Haiku ranks #62 out of 88 models in coding and programming benchmarks with an average score of 48. There are stronger options in this category.

Is Claude 3 Haiku good for mathematics?

Claude 3 Haiku ranks #62 out of 88 models in mathematics benchmarks with an average score of 55. There are stronger options in this category.

Is Claude 3 Haiku good for reasoning and logic?

Claude 3 Haiku ranks #61 out of 88 models in reasoning and logic benchmarks with an average score of 53. There are stronger options in this category.

What is the context window size of Claude 3 Haiku?

Claude 3 Haiku has a context window of 200K tokens, which determines how much text it can process in a single interaction.