GDPval-AA (GDPval-AA)

An evaluation focused on professional domain expertise and task delivery quality in office-style knowledge work.

Top Models on GDPval-AA — March 2026

As of March 2026, MiniMax M2.7 leads the GDPval-AA leaderboard with 1495.

1 modelsMultimodal & GroundedUpdated March 18, 2026

About GDPval-AA

Year

2026

Tasks

Professional office delivery

Format

ELO-style office benchmark

Difficulty

Professional knowledge work

MiniMax describes GDPval-AA as an office-domain evaluation for professional expertise and delivery quality. BenchLM stores the published ELO-style score as a display-only benchmark reference.

MiniMax M2.7: Early Echoes of Self-Evolution

Leaderboard (1 models)

#1MiniMax M2.7
1495

FAQ

What does GDPval-AA measure?

An evaluation focused on professional domain expertise and task delivery quality in office-style knowledge work.

Which model scores highest on GDPval-AA?

MiniMax M2.7 by MiniMax currently leads with a score of 1495 on GDPval-AA.

How many models are evaluated on GDPval-AA?

1 AI models have been evaluated on GDPval-AA on BenchLM.

Last updated: March 18, 2026

Weekly LLM Benchmark Digest

Get notified when new models drop, benchmark scores change, or the leaderboard shifts. One email per week.

Free. No spam. Unsubscribe anytime. We only store derived location metadata for consent routing.