Nemotron Ultra 253B Benchmark Scores & Performance

Benchmark analysis of Nemotron Ultra 253B by NVIDIA across 14 tests.

Creator

NVIDIA

Source Type

Open Weight

Reasoning

Reasoning

Context Window

32K

Overall Score

40#68 of 88

Knowledge Benchmarks

MMLU
49
GPQA
48
SuperGPQA
46
OpenBookQA
44

Coding Benchmarks

HumanEval
41

Mathematics Benchmarks

AIME 2023
49
AIME 2024
51
AIME 2025
50
HMMT Feb 2023
45
HMMT Feb 2024
47
HMMT Feb 2025
46
BRUMO 2025
48

Reasoning Benchmarks

SimpleQA
47
MuSR
45

Frequently Asked Questions

How does Nemotron Ultra 253B perform overall in AI benchmarks?

Nemotron Ultra 253B ranks #68 out of 88 models with an overall score of 40. It is created by NVIDIA and features a 32K context window.

Is Nemotron Ultra 253B good for knowledge and understanding?

Nemotron Ultra 253B ranks #68 out of 88 models in knowledge and understanding benchmarks with an average score of 46.8. There are stronger options in this category.

Is Nemotron Ultra 253B good for coding and programming?

Nemotron Ultra 253B ranks #68 out of 88 models in coding and programming benchmarks with an average score of 41. There are stronger options in this category.

Is Nemotron Ultra 253B good for mathematics?

Nemotron Ultra 253B ranks #68 out of 88 models in mathematics benchmarks with an average score of 48. There are stronger options in this category.

Is Nemotron Ultra 253B good for reasoning and logic?

Nemotron Ultra 253B ranks #68 out of 88 models in reasoning and logic benchmarks with an average score of 46. There are stronger options in this category.

Is Nemotron Ultra 253B open source?

Yes, Nemotron Ultra 253B is an open weight model created by NVIDIA, meaning it can be downloaded and run locally or fine-tuned for specific use cases.

What is the context window size of Nemotron Ultra 253B?

Nemotron Ultra 253B has a context window of 32K tokens, which determines how much text it can process in a single interaction.