GPT-4 Turbo vs MiMo-V2-Flash

Side-by-side benchmark comparison across knowledge, coding, math, and reasoning.

Quick Verdict

MiMo-V2-Flash wins overall with a score of 63 vs 50 (13 point difference).MiMo-V2-Flash wins 4 out of 4 categories.

Knowledge

MiMo-V2-Flash

GPT-4 Turbo

58.5

MiMo-V2-Flash

76.8

60
MMLU
79
60
GPQA
78
58
SuperGPQA
76
56
OpenBookQA
74

Coding

MiMo-V2-Flash

GPT-4 Turbo

52

MiMo-V2-Flash

71

52
HumanEval
71

Mathematics

MiMo-V2-Flash

GPT-4 Turbo

59

MiMo-V2-Flash

78

60
AIME 2023
79
62
AIME 2024
81
61
AIME 2025
80
56
HMMT Feb 2023
75
58
HMMT Feb 2024
77
57
HMMT Feb 2025
76
59
BRUMO 2025
78

Reasoning

MiMo-V2-Flash

GPT-4 Turbo

57

MiMo-V2-Flash

75

58
SimpleQA
76
56
MuSR
74

Frequently Asked Questions

Which is better, GPT-4 Turbo or MiMo-V2-Flash?

MiMo-V2-Flash scores higher overall with 63 vs 50, a difference of 13 points across all benchmarks.

Which is better for knowledge tasks, GPT-4 Turbo or MiMo-V2-Flash?

MiMo-V2-Flash leads in knowledge tasks with an average score of 76.8 vs 58.5.

Which is better for coding, GPT-4 Turbo or MiMo-V2-Flash?

MiMo-V2-Flash leads in coding with an average score of 71 vs 52.

Which is better for math, GPT-4 Turbo or MiMo-V2-Flash?

MiMo-V2-Flash leads in math with an average score of 78 vs 59.

Which is better for reasoning, GPT-4 Turbo or MiMo-V2-Flash?

MiMo-V2-Flash leads in reasoning with an average score of 75 vs 57.