Side-by-side benchmark comparison across knowledge, coding, math, and reasoning.
Claude Sonnet 4.5 wins overall with a score of 74 vs 42 (32 point difference).Claude Sonnet 4.5 wins 4 out of 4 categories.
Claude Sonnet 4.5
92
GPT-OSS 120B
48.8
Claude Sonnet 4.5
87
GPT-OSS 120B
43
Claude Sonnet 4.5
96
GPT-OSS 120B
50
Claude Sonnet 4.5
90
GPT-OSS 120B
48
Claude Sonnet 4.5 scores higher overall with 74 vs 42, a difference of 32 points across all benchmarks.
Claude Sonnet 4.5 leads in knowledge tasks with an average score of 92 vs 48.8.
Claude Sonnet 4.5 leads in coding with an average score of 87 vs 43.
Claude Sonnet 4.5 leads in math with an average score of 96 vs 50.
Claude Sonnet 4.5 leads in reasoning with an average score of 90 vs 48.