Nemotron-4 15B Benchmark Scores & Performance

Benchmark analysis of Nemotron-4 15B by NVIDIA across 14 tests.

Creator

NVIDIA

Source Type

Open Weight

Reasoning

Non-Reasoning

Context Window

32K

Overall Score

45#63 of 88

Knowledge Benchmarks

MMLU
54
GPQA
53
SuperGPQA
51
OpenBookQA
49

Coding Benchmarks

HumanEval
46

Mathematics Benchmarks

AIME 2023
54
AIME 2024
56
AIME 2025
55
HMMT Feb 2023
50
HMMT Feb 2024
52
HMMT Feb 2025
51
BRUMO 2025
53

Reasoning Benchmarks

SimpleQA
52
MuSR
50

Frequently Asked Questions

How does Nemotron-4 15B perform overall in AI benchmarks?

Nemotron-4 15B ranks #63 out of 88 models with an overall score of 45. It is created by NVIDIA and features a 32K context window.

Is Nemotron-4 15B good for knowledge and understanding?

Nemotron-4 15B ranks #63 out of 88 models in knowledge and understanding benchmarks with an average score of 51.8. There are stronger options in this category.

Is Nemotron-4 15B good for coding and programming?

Nemotron-4 15B ranks #63 out of 88 models in coding and programming benchmarks with an average score of 46. There are stronger options in this category.

Is Nemotron-4 15B good for mathematics?

Nemotron-4 15B ranks #63 out of 88 models in mathematics benchmarks with an average score of 53. There are stronger options in this category.

Is Nemotron-4 15B good for reasoning and logic?

Nemotron-4 15B ranks #63 out of 88 models in reasoning and logic benchmarks with an average score of 51. There are stronger options in this category.

Is Nemotron-4 15B open source?

Yes, Nemotron-4 15B is an open weight model created by NVIDIA, meaning it can be downloaded and run locally or fine-tuned for specific use cases.

What is the context window size of Nemotron-4 15B?

Nemotron-4 15B has a context window of 32K tokens, which determines how much text it can process in a single interaction.