Moonshot v1 vs Z-1

Side-by-side benchmark comparison across knowledge, coding, math, and reasoning.

Quick Verdict

Moonshot v1 wins overall with a score of 44 vs 43 (1 point difference).Moonshot v1 wins 4 out of 4 categories.

Knowledge

Moonshot v1

Moonshot v1

50.8

Z-1

49.8

53
MMLU
52
52
GPQA
51
50
SuperGPQA
49
48
OpenBookQA
47

Coding

Moonshot v1

Moonshot v1

45

Z-1

44

45
HumanEval
44

Mathematics

Moonshot v1

Moonshot v1

52

Z-1

51

53
AIME 2023
52
55
AIME 2024
54
54
AIME 2025
53
49
HMMT Feb 2023
48
51
HMMT Feb 2024
50
50
HMMT Feb 2025
49
52
BRUMO 2025
51

Reasoning

Moonshot v1

Moonshot v1

50

Z-1

49

51
SimpleQA
50
49
MuSR
48

Frequently Asked Questions

Which is better, Moonshot v1 or Z-1?

Moonshot v1 scores higher overall with 44 vs 43, a difference of 1 points across all benchmarks.

Which is better for knowledge tasks, Moonshot v1 or Z-1?

Moonshot v1 leads in knowledge tasks with an average score of 50.8 vs 49.8.

Which is better for coding, Moonshot v1 or Z-1?

Moonshot v1 leads in coding with an average score of 45 vs 44.

Which is better for math, Moonshot v1 or Z-1?

Moonshot v1 leads in math with an average score of 52 vs 51.

Which is better for reasoning, Moonshot v1 or Z-1?

Moonshot v1 leads in reasoning with an average score of 50 vs 49.