Side-by-side benchmark comparison across knowledge, coding, math, and reasoning.
Claude 3 Haiku wins overall with a score of 46 vs 39 (7 point difference).Claude 3 Haiku wins 4 out of 4 categories.
Claude 3 Haiku
54.5
Llama 4 Behemoth
45.8
Claude 3 Haiku
48
Llama 4 Behemoth
40
Claude 3 Haiku
55
Llama 4 Behemoth
47
Claude 3 Haiku
53
Llama 4 Behemoth
45
Claude 3 Haiku scores higher overall with 46 vs 39, a difference of 7 points across all benchmarks.
Claude 3 Haiku leads in knowledge tasks with an average score of 54.5 vs 45.8.
Claude 3 Haiku leads in coding with an average score of 48 vs 40.
Claude 3 Haiku leads in math with an average score of 55 vs 47.
Claude 3 Haiku leads in reasoning with an average score of 53 vs 45.