GPT-5 (medium) vs Moonshot v1

Side-by-side benchmark comparison across knowledge, coding, math, and reasoning.

Quick Verdict

GPT-5 (medium) wins overall with a score of 70 vs 44 (26 point difference).GPT-5 (medium) wins 4 out of 4 categories.

Knowledge

GPT-5 (medium)

GPT-5 (medium)

88

Moonshot v1

50.8

91
MMLU
53
89
GPQA
52
87
SuperGPQA
50
85
OpenBookQA
48

Coding

GPT-5 (medium)

GPT-5 (medium)

83

Moonshot v1

45

83
HumanEval
45

Mathematics

GPT-5 (medium)

GPT-5 (medium)

92

Moonshot v1

52

93
AIME 2023
53
95
AIME 2024
55
94
AIME 2025
54
89
HMMT Feb 2023
49
91
HMMT Feb 2024
51
90
HMMT Feb 2025
50
92
BRUMO 2025
52

Reasoning

GPT-5 (medium)

GPT-5 (medium)

86

Moonshot v1

50

87
SimpleQA
51
85
MuSR
49

Frequently Asked Questions

Which is better, GPT-5 (medium) or Moonshot v1?

GPT-5 (medium) scores higher overall with 70 vs 44, a difference of 26 points across all benchmarks.

Which is better for knowledge tasks, GPT-5 (medium) or Moonshot v1?

GPT-5 (medium) leads in knowledge tasks with an average score of 88 vs 50.8.

Which is better for coding, GPT-5 (medium) or Moonshot v1?

GPT-5 (medium) leads in coding with an average score of 83 vs 45.

Which is better for math, GPT-5 (medium) or Moonshot v1?

GPT-5 (medium) leads in math with an average score of 92 vs 52.

Which is better for reasoning, GPT-5 (medium) or Moonshot v1?

GPT-5 (medium) leads in reasoning with an average score of 86 vs 50.