Side-by-side benchmark comparison across knowledge, coding, math, and reasoning.
Nemotron Ultra 253B wins overall with a score of 40 vs 22 (18 point difference).Nemotron Ultra 253B wins 4 out of 4 categories.
GPT-OSS 20B
28.8
Nemotron Ultra 253B
46.8
GPT-OSS 20B
23
Nemotron Ultra 253B
41
GPT-OSS 20B
30
Nemotron Ultra 253B
48
GPT-OSS 20B
28
Nemotron Ultra 253B
46
Nemotron Ultra 253B scores higher overall with 40 vs 22, a difference of 18 points across all benchmarks.
Nemotron Ultra 253B leads in knowledge tasks with an average score of 46.8 vs 28.8.
Nemotron Ultra 253B leads in coding with an average score of 41 vs 23.
Nemotron Ultra 253B leads in math with an average score of 48 vs 30.
Nemotron Ultra 253B leads in reasoning with an average score of 46 vs 28.