Nemotron 3 Ultra 500B Benchmark Scores & Performance

Benchmark analysis of Nemotron 3 Ultra 500B by NVIDIA across 14 tests.

Creator

NVIDIA

Source Type

Open Weight

Reasoning

Reasoning

Context Window

32K

Overall Score

60#42 of 88

Knowledge Benchmarks

MMLU
74
GPQA
73
SuperGPQA
71
OpenBookQA
69

Coding Benchmarks

HumanEval
66

Mathematics Benchmarks

AIME 2023
74
AIME 2024
76
AIME 2025
75
HMMT Feb 2023
70
HMMT Feb 2024
72
HMMT Feb 2025
71
BRUMO 2025
73

Reasoning Benchmarks

SimpleQA
71
MuSR
69

Frequently Asked Questions

How does Nemotron 3 Ultra 500B perform overall in AI benchmarks?

Nemotron 3 Ultra 500B ranks #42 out of 88 models with an overall score of 60. It is created by NVIDIA and features a 32K context window.

Is Nemotron 3 Ultra 500B good for knowledge and understanding?

Nemotron 3 Ultra 500B ranks #42 out of 88 models in knowledge and understanding benchmarks with an average score of 71.8. There are stronger options in this category.

Is Nemotron 3 Ultra 500B good for coding and programming?

Nemotron 3 Ultra 500B ranks #42 out of 88 models in coding and programming benchmarks with an average score of 66. There are stronger options in this category.

Is Nemotron 3 Ultra 500B good for mathematics?

Nemotron 3 Ultra 500B ranks #42 out of 88 models in mathematics benchmarks with an average score of 73. There are stronger options in this category.

Is Nemotron 3 Ultra 500B good for reasoning and logic?

Nemotron 3 Ultra 500B ranks #43 out of 88 models in reasoning and logic benchmarks with an average score of 70. There are stronger options in this category.

Is Nemotron 3 Ultra 500B open source?

Yes, Nemotron 3 Ultra 500B is an open weight model created by NVIDIA, meaning it can be downloaded and run locally or fine-tuned for specific use cases.

What is the context window size of Nemotron 3 Ultra 500B?

Nemotron 3 Ultra 500B has a context window of 32K tokens, which determines how much text it can process in a single interaction.