Llama 4 Maverick vs Z-1

Side-by-side benchmark comparison across knowledge, coding, math, and reasoning.

Quick Verdict

Z-1 wins overall with a score of 43 vs 37 (6 point difference).Z-1 wins 4 out of 4 categories.

Knowledge

Z-1

Llama 4 Maverick

43.8

Z-1

49.8

46
MMLU
52
45
GPQA
51
43
SuperGPQA
49
41
OpenBookQA
47

Coding

Z-1

Llama 4 Maverick

38

Z-1

44

38
HumanEval
44

Mathematics

Z-1

Llama 4 Maverick

45

Z-1

51

46
AIME 2023
52
48
AIME 2024
54
47
AIME 2025
53
42
HMMT Feb 2023
48
44
HMMT Feb 2024
50
43
HMMT Feb 2025
49
45
BRUMO 2025
51

Reasoning

Z-1

Llama 4 Maverick

43

Z-1

49

44
SimpleQA
50
42
MuSR
48

Frequently Asked Questions

Which is better, Llama 4 Maverick or Z-1?

Z-1 scores higher overall with 43 vs 37, a difference of 6 points across all benchmarks.

Which is better for knowledge tasks, Llama 4 Maverick or Z-1?

Z-1 leads in knowledge tasks with an average score of 49.8 vs 43.8.

Which is better for coding, Llama 4 Maverick or Z-1?

Z-1 leads in coding with an average score of 44 vs 38.

Which is better for math, Llama 4 Maverick or Z-1?

Z-1 leads in math with an average score of 51 vs 45.

Which is better for reasoning, Llama 4 Maverick or Z-1?

Z-1 leads in reasoning with an average score of 49 vs 43.