Side-by-side benchmark comparison across knowledge, coding, math, and reasoning.
DeepSeek V3.2 (Thinking) wins overall with a score of 69 vs 68 (1 point difference).DeepSeek V3.2 (Thinking) wins 0 out of 4 categories.
DeepSeek V3.2 (Thinking)
84
GPT-5 mini
85
DeepSeek V3.2 (Thinking)
79
GPT-5 mini
80
DeepSeek V3.2 (Thinking)
86
GPT-5 mini
89
DeepSeek V3.2 (Thinking)
82
GPT-5 mini
83
DeepSeek V3.2 (Thinking) scores higher overall with 69 vs 68, a difference of 1 points across all benchmarks.
GPT-5 mini leads in knowledge tasks with an average score of 85 vs 84.
GPT-5 mini leads in coding with an average score of 80 vs 79.
GPT-5 mini leads in math with an average score of 89 vs 86.
GPT-5 mini leads in reasoning with an average score of 83 vs 82.