DeepSeek LLM 2.0 vs MiMo-V2-Flash

Side-by-side benchmark comparison across knowledge, coding, math, and reasoning.

Quick Verdict

Both models are tied with an overall score of 63.

Knowledge

Tie

DeepSeek LLM 2.0

76.8

MiMo-V2-Flash

76.8

79
MMLU
79
78
GPQA
78
76
SuperGPQA
76
74
OpenBookQA
74

Coding

DeepSeek LLM 2.0

DeepSeek LLM 2.0

73

MiMo-V2-Flash

71

73
HumanEval
71

Mathematics

DeepSeek LLM 2.0

DeepSeek LLM 2.0

79

MiMo-V2-Flash

78

80
AIME 2023
79
82
AIME 2024
81
81
AIME 2025
80
76
HMMT Feb 2023
75
78
HMMT Feb 2024
77
77
HMMT Feb 2025
76
79
BRUMO 2025
78

Reasoning

DeepSeek LLM 2.0

DeepSeek LLM 2.0

76

MiMo-V2-Flash

75

77
SimpleQA
76
75
MuSR
74

Frequently Asked Questions

Which is better, DeepSeek LLM 2.0 or MiMo-V2-Flash?

DeepSeek LLM 2.0 and MiMo-V2-Flash are tied with identical overall scores of 63.

Which is better for knowledge tasks, DeepSeek LLM 2.0 or MiMo-V2-Flash?

DeepSeek LLM 2.0 and MiMo-V2-Flash are tied for knowledge tasks with average scores of 76.8.

Which is better for coding, DeepSeek LLM 2.0 or MiMo-V2-Flash?

DeepSeek LLM 2.0 leads in coding with an average score of 73 vs 71.

Which is better for math, DeepSeek LLM 2.0 or MiMo-V2-Flash?

DeepSeek LLM 2.0 leads in math with an average score of 79 vs 78.

Which is better for reasoning, DeepSeek LLM 2.0 or MiMo-V2-Flash?

DeepSeek LLM 2.0 leads in reasoning with an average score of 76 vs 75.