Mistral 7B v0.3 vs Moonshot v1

Side-by-side benchmark comparison across knowledge, coding, math, and reasoning.

Quick Verdict

Moonshot v1 wins overall with a score of 44 vs 21 (23 point difference).Moonshot v1 wins 4 out of 4 categories.

Knowledge

Moonshot v1

Mistral 7B v0.3

27.8

Moonshot v1

50.8

30
MMLU
53
29
GPQA
52
27
SuperGPQA
50
25
OpenBookQA
48

Coding

Moonshot v1

Mistral 7B v0.3

22

Moonshot v1

45

22
HumanEval
45

Mathematics

Moonshot v1

Mistral 7B v0.3

29

Moonshot v1

52

30
AIME 2023
53
32
AIME 2024
55
31
AIME 2025
54
26
HMMT Feb 2023
49
28
HMMT Feb 2024
51
27
HMMT Feb 2025
50
29
BRUMO 2025
52

Reasoning

Moonshot v1

Mistral 7B v0.3

27

Moonshot v1

50

28
SimpleQA
51
26
MuSR
49

Frequently Asked Questions

Which is better, Mistral 7B v0.3 or Moonshot v1?

Moonshot v1 scores higher overall with 44 vs 21, a difference of 23 points across all benchmarks.

Which is better for knowledge tasks, Mistral 7B v0.3 or Moonshot v1?

Moonshot v1 leads in knowledge tasks with an average score of 50.8 vs 27.8.

Which is better for coding, Mistral 7B v0.3 or Moonshot v1?

Moonshot v1 leads in coding with an average score of 45 vs 22.

Which is better for math, Mistral 7B v0.3 or Moonshot v1?

Moonshot v1 leads in math with an average score of 52 vs 29.

Which is better for reasoning, Mistral 7B v0.3 or Moonshot v1?

Moonshot v1 leads in reasoning with an average score of 50 vs 27.