Benchmark profile

VIBE V2

A display-only MiniMax provider benchmark for end-to-end coding-agent and product-building tasks.

Data verified July 27, 2026

Benchmark score on VIBE V2 — July 27, 2026

BenchLM mirrors the published score view for VIBE V2. MiniMax M3 leads the public snapshot at 50.1%. BenchLM does not use these results to rank models overall.

1Open

MiniMax M3

MiniMax

minimax-m3

50.1%

Overall 68.8Context 1M

1 modelCodingCurrentDisplay onlyUpdated July 27, 2026

Benchmark score table (1 model)

Score

MiniMax M3MiniMax · Open weight

50.1%

About VIBE V2

Year

2026

Tasks

End-to-end coding-agent tasks

Format

Task success rate

Difficulty

Frontier coding-agent workflows

MiniMax reports VIBE V2 in the M3 comparison chart. BenchLM tracks it as a display-only provider row because it is not part of the weighted coding schema.

MiniMax M3 model card

BenchLM freshness & provenance

Version

VIBE V2 2026

Refresh cadence

Quarterly

Staleness state

Current

Question availability

Public benchmark set

CurrentDisplay only

BenchLM uses freshness metadata to decide whether a benchmark should still be treated as a strong differentiator, a benchmark to watch, or a display-only reference. For the full scoring policy, see the BenchLM methodology page.

FAQ

What does VIBE V2 measure?

A display-only MiniMax provider benchmark for end-to-end coding-agent and product-building tasks.

Which model scores highest on VIBE V2?

MiniMax M3 by MiniMax currently leads with a score of 50.1% on VIBE V2.

How many models are evaluated on VIBE V2?

1 AI models have been evaluated on VIBE V2 on BenchLM.

Last updated: July 27, 2026 · BenchLM version VIBE V2 2026

Know when it’s worth switching models

The model to choose, the cheaper alternative, and the release we would wait on.

One email each week. Unsubscribe anytime.