Side-by-side benchmark comparison across knowledge, coding, math, and reasoning.
Claude 3 Haiku wins overall with a score of 46 vs 42 (4 point difference).Claude 3 Haiku wins 4 out of 4 categories.
Claude 3 Haiku
54.5
GPT-OSS 120B
48.8
Claude 3 Haiku
48
GPT-OSS 120B
43
Claude 3 Haiku
55
GPT-OSS 120B
50
Claude 3 Haiku
53
GPT-OSS 120B
48
Claude 3 Haiku scores higher overall with 46 vs 42, a difference of 4 points across all benchmarks.
Claude 3 Haiku leads in knowledge tasks with an average score of 54.5 vs 48.8.
Claude 3 Haiku leads in coding with an average score of 48 vs 43.
Claude 3 Haiku leads in math with an average score of 55 vs 50.
Claude 3 Haiku leads in reasoning with an average score of 53 vs 48.