DeepSeek Coder 2.0 vs GLM-4.5-Air

Side-by-side benchmark comparison across knowledge, coding, math, and reasoning.

Quick Verdict

DeepSeek Coder 2.0 wins overall with a score of 64 vs 26 (38 point difference).DeepSeek Coder 2.0 wins 4 out of 4 categories.

Knowledge

DeepSeek Coder 2.0

DeepSeek Coder 2.0

77.8

GLM-4.5-Air

32.8

80
MMLU
35
79
GPQA
34
77
SuperGPQA
32
75
OpenBookQA
30

Coding

DeepSeek Coder 2.0

DeepSeek Coder 2.0

82

GLM-4.5-Air

27

82
HumanEval
27

Mathematics

DeepSeek Coder 2.0

DeepSeek Coder 2.0

80

GLM-4.5-Air

34

81
AIME 2023
35
83
AIME 2024
37
82
AIME 2025
36
77
HMMT Feb 2023
31
79
HMMT Feb 2024
33
78
HMMT Feb 2025
32
80
BRUMO 2025
34

Reasoning

DeepSeek Coder 2.0

DeepSeek Coder 2.0

77

GLM-4.5-Air

32

78
SimpleQA
33
76
MuSR
31

Frequently Asked Questions

Which is better, DeepSeek Coder 2.0 or GLM-4.5-Air?

DeepSeek Coder 2.0 scores higher overall with 64 vs 26, a difference of 38 points across all benchmarks.

Which is better for knowledge tasks, DeepSeek Coder 2.0 or GLM-4.5-Air?

DeepSeek Coder 2.0 leads in knowledge tasks with an average score of 77.8 vs 32.8.

Which is better for coding, DeepSeek Coder 2.0 or GLM-4.5-Air?

DeepSeek Coder 2.0 leads in coding with an average score of 82 vs 27.

Which is better for math, DeepSeek Coder 2.0 or GLM-4.5-Air?

DeepSeek Coder 2.0 leads in math with an average score of 80 vs 34.

Which is better for reasoning, DeepSeek Coder 2.0 or GLM-4.5-Air?

DeepSeek Coder 2.0 leads in reasoning with an average score of 77 vs 32.