GLM-4.5-Air vs Z-1

Side-by-side benchmark comparison across knowledge, coding, math, and reasoning.

Quick Verdict

Z-1 wins overall with a score of 43 vs 26 (17 point difference).Z-1 wins 4 out of 4 categories.

Knowledge

Z-1

GLM-4.5-Air

32.8

Z-1

49.8

35
MMLU
52
34
GPQA
51
32
SuperGPQA
49
30
OpenBookQA
47

Coding

Z-1

GLM-4.5-Air

27

Z-1

44

27
HumanEval
44

Mathematics

Z-1

GLM-4.5-Air

34

Z-1

51

35
AIME 2023
52
37
AIME 2024
54
36
AIME 2025
53
31
HMMT Feb 2023
48
33
HMMT Feb 2024
50
32
HMMT Feb 2025
49
34
BRUMO 2025
51

Reasoning

Z-1

GLM-4.5-Air

32

Z-1

49

33
SimpleQA
50
31
MuSR
48

Frequently Asked Questions

Which is better, GLM-4.5-Air or Z-1?

Z-1 scores higher overall with 43 vs 26, a difference of 17 points across all benchmarks.

Which is better for knowledge tasks, GLM-4.5-Air or Z-1?

Z-1 leads in knowledge tasks with an average score of 49.8 vs 32.8.

Which is better for coding, GLM-4.5-Air or Z-1?

Z-1 leads in coding with an average score of 44 vs 27.

Which is better for math, GLM-4.5-Air or Z-1?

Z-1 leads in math with an average score of 51 vs 34.

Which is better for reasoning, GLM-4.5-Air or Z-1?

Z-1 leads in reasoning with an average score of 49 vs 32.