Side-by-side benchmark comparison across knowledge, coding, math, and reasoning.
Claude 4.1 Opus wins overall with a score of 61 vs 26 (35 point difference).Claude 4.1 Opus wins 4 out of 4 categories.
Claude 4.1 Opus
74.5
GLM-4.5-Air
32.8
Claude 4.1 Opus
68
GLM-4.5-Air
27
Claude 4.1 Opus
75
GLM-4.5-Air
34
Claude 4.1 Opus
73
GLM-4.5-Air
32
Claude 4.1 Opus scores higher overall with 61 vs 26, a difference of 35 points across all benchmarks.
Claude 4.1 Opus leads in knowledge tasks with an average score of 74.5 vs 32.8.
Claude 4.1 Opus leads in coding with an average score of 68 vs 27.
Claude 4.1 Opus leads in math with an average score of 75 vs 34.
Claude 4.1 Opus leads in reasoning with an average score of 73 vs 32.