GLM-5 (Reasoning) vs Moonshot v1

Side-by-side benchmark comparison across knowledge, coding, math, and reasoning.

Quick Verdict

GLM-5 (Reasoning) wins overall with a score of 75 vs 44 (31 point difference).GLM-5 (Reasoning) wins 4 out of 4 categories.

Knowledge

GLM-5 (Reasoning)

GLM-5 (Reasoning)

93

Moonshot v1

50.8

96
MMLU
53
94
GPQA
52
92
SuperGPQA
50
90
OpenBookQA
48

Coding

GLM-5 (Reasoning)

GLM-5 (Reasoning)

88

Moonshot v1

45

88
HumanEval
45

Mathematics

GLM-5 (Reasoning)

GLM-5 (Reasoning)

96.6

Moonshot v1

52

98
AIME 2023
53
99
AIME 2024
55
98
AIME 2025
54
94
HMMT Feb 2023
49
96
HMMT Feb 2024
51
95
HMMT Feb 2025
50
96
BRUMO 2025
52

Reasoning

GLM-5 (Reasoning)

GLM-5 (Reasoning)

91

Moonshot v1

50

92
SimpleQA
51
90
MuSR
49

Frequently Asked Questions

Which is better, GLM-5 (Reasoning) or Moonshot v1?

GLM-5 (Reasoning) scores higher overall with 75 vs 44, a difference of 31 points across all benchmarks.

Which is better for knowledge tasks, GLM-5 (Reasoning) or Moonshot v1?

GLM-5 (Reasoning) leads in knowledge tasks with an average score of 93 vs 50.8.

Which is better for coding, GLM-5 (Reasoning) or Moonshot v1?

GLM-5 (Reasoning) leads in coding with an average score of 88 vs 45.

Which is better for math, GLM-5 (Reasoning) or Moonshot v1?

GLM-5 (Reasoning) leads in math with an average score of 96.6 vs 52.

Which is better for reasoning, GLM-5 (Reasoning) or Moonshot v1?

GLM-5 (Reasoning) leads in reasoning with an average score of 91 vs 50.