GLM-4.5-Air vs Llama 4 Scout

Side-by-side benchmark comparison across knowledge, coding, math, and reasoning.

Quick Verdict

Llama 4 Scout wins overall with a score of 38 vs 26 (12 point difference).Llama 4 Scout wins 4 out of 4 categories.

Knowledge

Llama 4 Scout

GLM-4.5-Air

32.8

Llama 4 Scout

44.8

35
MMLU
47
34
GPQA
46
32
SuperGPQA
44
30
OpenBookQA
42

Coding

Llama 4 Scout

GLM-4.5-Air

27

Llama 4 Scout

39

27
HumanEval
39

Mathematics

Llama 4 Scout

GLM-4.5-Air

34

Llama 4 Scout

46

35
AIME 2023
47
37
AIME 2024
49
36
AIME 2025
48
31
HMMT Feb 2023
43
33
HMMT Feb 2024
45
32
HMMT Feb 2025
44
34
BRUMO 2025
46

Reasoning

Llama 4 Scout

GLM-4.5-Air

32

Llama 4 Scout

44

33
SimpleQA
45
31
MuSR
43

Frequently Asked Questions

Which is better, GLM-4.5-Air or Llama 4 Scout?

Llama 4 Scout scores higher overall with 38 vs 26, a difference of 12 points across all benchmarks.

Which is better for knowledge tasks, GLM-4.5-Air or Llama 4 Scout?

Llama 4 Scout leads in knowledge tasks with an average score of 44.8 vs 32.8.

Which is better for coding, GLM-4.5-Air or Llama 4 Scout?

Llama 4 Scout leads in coding with an average score of 39 vs 27.

Which is better for math, GLM-4.5-Air or Llama 4 Scout?

Llama 4 Scout leads in math with an average score of 46 vs 34.

Which is better for reasoning, GLM-4.5-Air or Llama 4 Scout?

Llama 4 Scout leads in reasoning with an average score of 44 vs 32.