GLM-5 vs o3

Side-by-side benchmark comparison across knowledge, coding, math, and reasoning.

Quick Verdict

GLM-5 wins overall with a score of 68 vs 67 (1 point difference).GLM-5 wins 1 out of 4 categories.

Knowledge

o3

GLM-5

85

o3

85.3

88
MMLU
86
86
GPQA
87
84
SuperGPQA
85
82
OpenBookQA
83

Coding

GLM-5

GLM-5

80

o3

78

80
HumanEval
78

Mathematics

Tie

GLM-5

87

o3

87

88
AIME 2023
88
90
AIME 2024
90
89
AIME 2025
89
84
HMMT Feb 2023
84
86
HMMT Feb 2024
86
85
HMMT Feb 2025
85
87
BRUMO 2025
87

Reasoning

Tie

GLM-5

83

o3

83

84
SimpleQA
84
82
MuSR
82

Frequently Asked Questions

Which is better, GLM-5 or o3?

GLM-5 scores higher overall with 68 vs 67, a difference of 1 points across all benchmarks.

Which is better for knowledge tasks, GLM-5 or o3?

o3 leads in knowledge tasks with an average score of 85.3 vs 85.

Which is better for coding, GLM-5 or o3?

GLM-5 leads in coding with an average score of 80 vs 78.

Which is better for math, GLM-5 or o3?

GLM-5 and o3 are tied for math with average scores of 87.

Which is better for reasoning, GLM-5 or o3?

GLM-5 and o3 are tied for reasoning with average scores of 83.