DeepSeek Coder 2.0 Benchmark Scores & Performance

Benchmark analysis of DeepSeek Coder 2.0 by DeepSeek across 14 tests.

Creator

DeepSeek

Source Type

Open Weight

Reasoning

Non-Reasoning

Context Window

128K

Overall Score

64#35 of 88

Knowledge Benchmarks

MMLU
80
GPQA
79
SuperGPQA
77
OpenBookQA
75

Coding Benchmarks

HumanEval
82

Mathematics Benchmarks

AIME 2023
81
AIME 2024
83
AIME 2025
82
HMMT Feb 2023
77
HMMT Feb 2024
79
HMMT Feb 2025
78
BRUMO 2025
80

Reasoning Benchmarks

SimpleQA
78
MuSR
76

Frequently Asked Questions

How does DeepSeek Coder 2.0 perform overall in AI benchmarks?

DeepSeek Coder 2.0 ranks #35 out of 88 models with an overall score of 64. It is created by DeepSeek and features a 128K context window.

Is DeepSeek Coder 2.0 good for knowledge and understanding?

DeepSeek Coder 2.0 ranks #35 out of 88 models in knowledge and understanding benchmarks with an average score of 77.8. There are stronger options in this category.

Is DeepSeek Coder 2.0 good for coding and programming?

DeepSeek Coder 2.0 ranks #22 out of 88 models in coding and programming benchmarks with an average score of 82. There are stronger options in this category.

Is DeepSeek Coder 2.0 good for mathematics?

DeepSeek Coder 2.0 ranks #35 out of 88 models in mathematics benchmarks with an average score of 80. There are stronger options in this category.

Is DeepSeek Coder 2.0 good for reasoning and logic?

DeepSeek Coder 2.0 ranks #35 out of 88 models in reasoning and logic benchmarks with an average score of 77. There are stronger options in this category.

Is DeepSeek Coder 2.0 open source?

Yes, DeepSeek Coder 2.0 is an open weight model created by DeepSeek, meaning it can be downloaded and run locally or fine-tuned for specific use cases.

What is the context window size of DeepSeek Coder 2.0?

DeepSeek Coder 2.0 has a context window of 128K tokens, which determines how much text it can process in a single interaction.