Side-by-side benchmark comparison across knowledge, coding, math, and reasoning.
Claude Sonnet 4.6 wins overall with a score of 80 vs 76 (4 point difference).Claude Sonnet 4.6 wins 3 out of 4 categories.
Claude Sonnet 4.6
96
GPT-5.1
94
Claude Sonnet 4.6
93
GPT-5.1
89
Claude Sonnet 4.6
97.1
GPT-5.1
97.1
Claude Sonnet 4.6
94
GPT-5.1
92
Claude Sonnet 4.6 scores higher overall with 80 vs 76, a difference of 4 points across all benchmarks.
Claude Sonnet 4.6 leads in knowledge tasks with an average score of 96 vs 94.
Claude Sonnet 4.6 leads in coding with an average score of 93 vs 89.
Claude Sonnet 4.6 and GPT-5.1 are tied for math with average scores of 97.1.
Claude Sonnet 4.6 leads in reasoning with an average score of 94 vs 92.