Gemma 3 27B vs GLM-4.5-Air

Side-by-side benchmark comparison across knowledge, coding, math, and reasoning.

Quick Verdict

Gemma 3 27B wins overall with a score of 36 vs 26 (10 point difference).Gemma 3 27B wins 4 out of 4 categories.

Knowledge

Gemma 3 27B

Gemma 3 27B

42.8

GLM-4.5-Air

32.8

45
MMLU
35
44
GPQA
34
42
SuperGPQA
32
40
OpenBookQA
30

Coding

Gemma 3 27B

Gemma 3 27B

37

GLM-4.5-Air

27

37
HumanEval
27

Mathematics

Gemma 3 27B

Gemma 3 27B

44

GLM-4.5-Air

34

45
AIME 2023
35
47
AIME 2024
37
46
AIME 2025
36
41
HMMT Feb 2023
31
43
HMMT Feb 2024
33
42
HMMT Feb 2025
32
44
BRUMO 2025
34

Reasoning

Gemma 3 27B

Gemma 3 27B

42

GLM-4.5-Air

32

43
SimpleQA
33
41
MuSR
31

Frequently Asked Questions

Which is better, Gemma 3 27B or GLM-4.5-Air?

Gemma 3 27B scores higher overall with 36 vs 26, a difference of 10 points across all benchmarks.

Which is better for knowledge tasks, Gemma 3 27B or GLM-4.5-Air?

Gemma 3 27B leads in knowledge tasks with an average score of 42.8 vs 32.8.

Which is better for coding, Gemma 3 27B or GLM-4.5-Air?

Gemma 3 27B leads in coding with an average score of 37 vs 27.

Which is better for math, Gemma 3 27B or GLM-4.5-Air?

Gemma 3 27B leads in math with an average score of 44 vs 34.

Which is better for reasoning, Gemma 3 27B or GLM-4.5-Air?

Gemma 3 27B leads in reasoning with an average score of 42 vs 32.