GLM-4.5 vs GPT-4 Turbo

Side-by-side benchmark comparison across knowledge, coding, math, and reasoning.

Quick Verdict

GPT-4 Turbo wins overall with a score of 50 vs 28 (22 point difference).GPT-4 Turbo wins 4 out of 4 categories.

Knowledge

GPT-4 Turbo

GLM-4.5

34.8

GPT-4 Turbo

58.5

37
MMLU
60
36
GPQA
60
34
SuperGPQA
58
32
OpenBookQA
56

Coding

GPT-4 Turbo

GLM-4.5

29

GPT-4 Turbo

52

29
HumanEval
52

Mathematics

GPT-4 Turbo

GLM-4.5

36

GPT-4 Turbo

59

37
AIME 2023
60
39
AIME 2024
62
38
AIME 2025
61
33
HMMT Feb 2023
56
35
HMMT Feb 2024
58
34
HMMT Feb 2025
57
36
BRUMO 2025
59

Reasoning

GPT-4 Turbo

GLM-4.5

34

GPT-4 Turbo

57

35
SimpleQA
58
33
MuSR
56

Frequently Asked Questions

Which is better, GLM-4.5 or GPT-4 Turbo?

GPT-4 Turbo scores higher overall with 50 vs 28, a difference of 22 points across all benchmarks.

Which is better for knowledge tasks, GLM-4.5 or GPT-4 Turbo?

GPT-4 Turbo leads in knowledge tasks with an average score of 58.5 vs 34.8.

Which is better for coding, GLM-4.5 or GPT-4 Turbo?

GPT-4 Turbo leads in coding with an average score of 52 vs 29.

Which is better for math, GLM-4.5 or GPT-4 Turbo?

GPT-4 Turbo leads in math with an average score of 59 vs 36.

Which is better for reasoning, GLM-4.5 or GPT-4 Turbo?

GPT-4 Turbo leads in reasoning with an average score of 57 vs 34.