Benchmark profile

GDPval rubrics

A display-only provider-table GDPval rubric score for economically valuable work tasks.

Data verified July 27, 2026

Benchmark score on GDPval rubrics — July 27, 2026

BenchLM mirrors the published score view for GDPval rubrics. MiniMax M3 leads the public snapshot at 74.7%. BenchLM does not use these results to rank models overall.

1Open

MiniMax M3

MiniMax

minimax-m3

74.7%

Overall 68.8Context 1M

1 modelAgenticCurrentDisplay onlyUpdated July 27, 2026

Benchmark score table (1 model)

Score

MiniMax M3MiniMax · Open weight

74.7%

About GDPval rubrics

Year

2026

Tasks

Economically valuable work tasks

Format

Rubric score

Difficulty

Professional agentic workflows

MiniMax reports GDPval rubrics as a percentage-style provider benchmark. BenchLM stores it separately from AA GDPval Elo and normalized GDPval rows.

MiniMax M3 model card

BenchLM freshness & provenance

Version

GDPval rubrics 2026

Refresh cadence

Quarterly

Staleness state

Current

Question availability

Public benchmark set

CurrentDisplay only

BenchLM uses freshness metadata to decide whether a benchmark should still be treated as a strong differentiator, a benchmark to watch, or a display-only reference. For the full scoring policy, see the BenchLM methodology page.

FAQ

What does GDPval rubrics measure?

A display-only provider-table GDPval rubric score for economically valuable work tasks.

Which model scores highest on GDPval rubrics?

MiniMax M3 by MiniMax currently leads with a score of 74.7% on GDPval rubrics.

How many models are evaluated on GDPval rubrics?

1 AI models have been evaluated on GDPval rubrics on BenchLM.

Last updated: July 27, 2026 · BenchLM version GDPval rubrics 2026

Know when it’s worth switching models

The model to choose, the cheaper alternative, and the release we would wait on.

One email each week. Unsubscribe anytime.