Side-by-side benchmark comparison across knowledge, coding, math, and reasoning.
Qwen2.5-1M wins overall with a score of 66 vs 64 (2 point difference).Qwen2.5-1M wins 3 out of 4 categories.
DeepSeek Coder 2.0
77.8
Qwen2.5-1M
81.8
DeepSeek Coder 2.0
82
Qwen2.5-1M
76
DeepSeek Coder 2.0
80
Qwen2.5-1M
84
DeepSeek Coder 2.0
77
Qwen2.5-1M
80
Qwen2.5-1M scores higher overall with 66 vs 64, a difference of 2 points across all benchmarks.
Qwen2.5-1M leads in knowledge tasks with an average score of 81.8 vs 77.8.
DeepSeek Coder 2.0 leads in coding with an average score of 82 vs 76.
Qwen2.5-1M leads in math with an average score of 84 vs 80.
Qwen2.5-1M leads in reasoning with an average score of 80 vs 77.