Side-by-side benchmark comparison across knowledge, coding, math, and reasoning.
DeepSeek V3.2 (Thinking) wins overall with a score of 69 vs 68 (1 point difference).DeepSeek V3.2 (Thinking) wins 0 out of 4 categories.
DeepSeek V3.2 (Thinking)
84
o3-pro
87.3
DeepSeek V3.2 (Thinking)
79
o3-pro
80
DeepSeek V3.2 (Thinking)
86
o3-pro
89
DeepSeek V3.2 (Thinking)
82
o3-pro
85
DeepSeek V3.2 (Thinking) scores higher overall with 69 vs 68, a difference of 1 points across all benchmarks.
o3-pro leads in knowledge tasks with an average score of 87.3 vs 84.
o3-pro leads in coding with an average score of 80 vs 79.
o3-pro leads in math with an average score of 89 vs 86.
o3-pro leads in reasoning with an average score of 85 vs 82.