Mistral Small 4 Benchmark Scores & Performance

BenchLM is tracking Mistral Small 4 by Mistral. Some benchmark data is visible, but trusted coverage is not complete enough for ranking yet.

BenchLM is tracking Mistral Small 4, but this profile is currently excluded from the trusted leaderboard because its source-backed benchmark coverage is not complete enough yet. We keep the model metadata and any verified benchmark rows visible while the rest of the public eval record is re-checked.

Mistral Small 4 is a open weight model with a 256K token context window. It processes queries without explicit chain-of-thought reasoning, offering faster response times and lower token usage.

Mistral Small 4 sits inside the Mistral Small 4 family alongside Mistral Small 4 (Reasoning). This profile currently has 2 trusted benchmark rows on BenchLM, but that is not enough for a leaderboard rank yet.

Creator

Mistral

Source Type

Open Weight

Reasoning

Non-Reasoning

Context Window

256K

Overall Score

Not ranked yet

Family & Lineage

Family

Mistral Small 4

Base entry

Rankings Overview

BenchLM is still verifying enough trusted benchmark coverage to place this model in the leaderboard. Category ranks will appear here once that source-backed coverage is complete.

Knowledge Benchmarks

GPQA
59.1%
MMLU-Pro
73.5%

Coding Benchmarks

LiveCodeBench
32%

Mathematics Benchmarks

AIME 2025
36%

Multimodal & Grounded Benchmarks

MMMU-Pro
46.3%

Frequently Asked Questions

How does Mistral Small 4 perform overall in AI benchmarks?

BenchLM is tracking Mistral Small 4, but trusted source-backed benchmark coverage is still coming soon. We currently list its creator, model type, and context window while we wait for verified public benchmark results.

Is Mistral Small 4 open source?

Yes, Mistral Small 4 is an open weight model created by Mistral, meaning it can be downloaded and run locally or fine-tuned for specific use cases.

Which sibling models are related to Mistral Small 4?

Mistral Small 4 belongs to the Mistral Small 4 family. Related variants on BenchLM include Mistral Small 4 (Reasoning).

Does Mistral Small 4 have full benchmark coverage on BenchLM?

Mistral Small 4 is tracked on BenchLM, but its current source-backed benchmark coverage is not strong enough for a trusted leaderboard rank yet. We keep the model page live while we verify more public benchmark results.

What is the context window size of Mistral Small 4?

Mistral Small 4 has a context window of 256K, which determines how much text it can process in a single interaction.

Last updated: March 17, 2026

Weekly LLM Updates

New model releases, benchmark scores, and leaderboard changes. Every Friday.

Free. Your signup is stored with a derived country code for compliance routing.