Skip to main content

MRCRv2

A long-context benchmark for memory, retrieval, and multi-round coherence over large contexts.

Top models on MRCRv2 — June 9, 2026

As of June 9, 2026, Qwen3.7 Plus leads the MRCRv2 leaderboard with 91.7% , followed by Qwen3.7 Max (90.4%) and Gemini 3.5 Flash (77.3%).

4 modelsReasoning25% of category scoreCurrentUpdated June 9, 2026

According to BenchLM.ai, Qwen3.7 Plus leads the MRCRv2 benchmark with a score of 91.7%, followed by Qwen3.7 Max (90.4%) and Gemini 3.5 Flash (77.3%). There is significant spread across the leaderboard, making this benchmark effective at differentiating model capabilities.

4 models have been evaluated on MRCRv2. The benchmark falls in the Reasoning category. This category carries a 17% weight in BenchLM.ai's overall scoring system. Within that category, MRCRv2 contributes 25% of the category score, so strong performance here directly affects a model's overall ranking.

About MRCRv2

Year

2025

Tasks

Long-context retrieval

Format

Multi-round long-context evaluation

Difficulty

Hard long-context

MRCRv2 is especially useful for models that compete on long context, since it checks whether they can retrieve the right information across long, multi-round interactions.

BenchLM freshness & provenance

Version

MRCRv2 2025

Refresh cadence

Quarterly

Staleness state

Current

Question availability

Public benchmark set

Current

BenchLM uses freshness metadata to decide whether a benchmark should still be treated as a strong differentiator, a benchmark to watch, or a display-only reference. For the full scoring policy, see the BenchLM methodology page.

Leaderboard (4 models)

1
91.7%
2
90.4%
3
77.3%
4
43.4%

FAQ

What does MRCRv2 measure?

A long-context benchmark for memory, retrieval, and multi-round coherence over large contexts.

Which model scores highest on MRCRv2?

Qwen3.7 Plus by Alibaba currently leads with a score of 91.7% on MRCRv2.

How many models are evaluated on MRCRv2?

4 AI models have been evaluated on MRCRv2 on BenchLM.

Last updated: June 9, 2026 · BenchLM version MRCRv2 2025

The AI models change fast. We track them for you.

For engineers, researchers, and the plain curious — a weekly brief on new models, ranking shifts, and pricing changes.

Free. No spam. Unsubscribe anytime.