An evaluation focused on professional domain expertise and task delivery quality in office-style knowledge work.
As of March 2026, MiniMax M2.7 leads the GDPval-AA leaderboard with 1495.
Year
2026
Tasks
Professional office delivery
Format
ELO-style office benchmark
Difficulty
Professional knowledge work
MiniMax describes GDPval-AA as an office-domain evaluation for professional expertise and delivery quality. BenchLM stores the published ELO-style score as a display-only benchmark reference.
MiniMax M2.7: Early Echoes of Self-EvolutionAn evaluation focused on professional domain expertise and task delivery quality in office-style knowledge work.
MiniMax M2.7 by MiniMax currently leads with a score of 1495 on GDPval-AA.
1 AI models have been evaluated on GDPval-AA on BenchLM.
Get notified when new models drop, benchmark scores change, or the leaderboard shifts. One email per week.
Free. No spam. Unsubscribe anytime. We only store derived location metadata for consent routing.