MiniMax M2.7 Benchmark Scores & Performance

BenchLM is tracking MiniMax M2.7 by MiniMax. Some benchmark data is visible, but not enough non-generated coverage is available for a leaderboard rank yet.

BenchLM is tracking MiniMax M2.7, but this profile is currently excluded from the public leaderboard because it still lacks enough trustworthy benchmark coverage to rank safely. Generated rows may still appear below for context, but they are not enough on their own to make this model ranking-eligible.

MiniMax M2.7 is a proprietary model with a 200K token context window. It processes queries without explicit chain-of-thought reasoning, offering faster response times and lower token usage.

BenchLM links it directly to MiniMax M2.5 as the earlier related model in that lineage. This profile currently has 11 of 51 tracked benchmarks. BenchLM uses discounted fallback estimates where necessary so missing categories do not collapse the overall score, but public sourced rows still carry more weight than inferred ones.

Its strongest category is Agentic (#20), while its weakest is Coding (#22). This performance profile makes it particularly useful for coding agents, browser research, and computer-use workflows.

Creator

MiniMax

Source Type

Proprietary

Reasoning

Non-Reasoning

Context Window

200K

Overall Score

Coming soon

Family & Lineage

Family

MiniMax M2.7

Base entry

Related Earlier Model

MiniMax M2.5

Rankings Overview

BenchLM is still missing enough trustworthy benchmark coverage to rank this model across the public leaderboard. Trusted benchmark rows remain visible below for reference.

Knowledge Benchmarks

Artificial Analysis
50

Coding Benchmarks

SWE-bench Pro
56.2%
SWE Multilingual
76.5%
Multi-SWE Bench
52.7%
VIBE-Pro
55.6%
NL2Repo
39.8%

Agentic Benchmarks

Terminal-Bench 2.0
57%
Toolathlon
46.3%
MLE-Bench Lite
66.6%
MM-ClawBench
62.7%

Multimodal & Grounded Benchmarks

GDPval-AA
1495

Frequently Asked Questions

How does MiniMax M2.7 perform overall in AI benchmarks?

MiniMax M2.7 ranks #null out of 58 models with an overall score of 57 (estimated). It is created by MiniMax and features a 200K context window.

Is MiniMax M2.7 good for knowledge and understanding?

MiniMax M2.7 ranks #null out of 58 models in knowledge and understanding benchmarks with an average score of 0. It is among the top performers in this category.

Is MiniMax M2.7 good for coding and programming?

MiniMax M2.7 ranks #22 out of 58 models in coding and programming benchmarks with an average score of 56.2. There are stronger options in this category.

Is MiniMax M2.7 good for agentic tool use and computer tasks?

MiniMax M2.7 ranks #20 out of 58 models in agentic tool use and computer tasks benchmarks with an average score of 57. There are stronger options in this category.

Is MiniMax M2.7 good for multimodal and grounded tasks?

MiniMax M2.7 ranks #null out of 58 models in multimodal and grounded tasks benchmarks with an average score of 0. It is among the top performers in this category.

Does MiniMax M2.7 have full benchmark coverage on BenchLM?

Not yet. MiniMax M2.7 currently has 11 sourced benchmark scores out of the 51 benchmarks BenchLM tracks. BenchLM may use discounted fallback values for missing categories, but trustworthy public rows still carry more weight than inferred ones.

What is the context window size of MiniMax M2.7?

MiniMax M2.7 has a context window of 200K, which determines how much text it can process in a single interaction.

Last updated: March 18, 2026

Weekly LLM Updates

New model releases, benchmark scores, and leaderboard changes. Every Friday.

Free. Your signup is stored with a derived country code for compliance routing.