Skip to main content

Video-MME with subtitle (Video-MME (with subtitle))

A video understanding benchmark that allows subtitle access when answering multimodal questions about videos.

Benchmark score on Video-MME (with subtitle) — July 4, 2026

BenchLM mirrors the published score view for Video-MME (with subtitle). Qwen3.7 Plus leads the public snapshot at 88.0% , followed by Qwen3.6-27B (87.7%) and MiMo-V2.5 (87.7%). BenchLM does not use these results to rank models overall.

5 modelsMultimodal & GroundedCurrentDisplay onlyUpdated July 4, 2026

The published Video-MME (with subtitle) snapshot is tightly clustered at the top: Qwen3.7 Plus sits at 88.0%, while the third row is only 0.3 points behind. The broader top-10 spread is 2.6 points, so many of the published scores sit in a relatively narrow band.

5 models have been evaluated on Video-MME (with subtitle). The benchmark falls in the Multimodal & Grounded category. This category carries a 12% weight in BenchLM.ai's overall scoring system. Video-MME (with subtitle) is currently displayed for reference but excluded from the scoring formula, so it does not directly affect overall rankings.

About Video-MME (with subtitle)

Year

2026

Tasks

Video understanding

Format

Video QA with subtitle context

Difficulty

Multimodal video reasoning

The subtitle-enabled Video-MME setting measures how well a model combines video perception with textual cues from subtitles rather than relying on frames alone.

BenchLM freshness & provenance

Version

Video-MME (with subtitle) 2026

Refresh cadence

Quarterly

Staleness state

Current

Question availability

Public benchmark set

CurrentDisplay only

BenchLM uses freshness metadata to decide whether a benchmark should still be treated as a strong differentiator, a benchmark to watch, or a display-only reference. For the full scoring policy, see the BenchLM methodology page.

Benchmark score table (5 models)

1
88.0%
2
87.7%
3
87.7%
4
86.6%
5
85.4%

FAQ

What does Video-MME (with subtitle) measure?

A video understanding benchmark that allows subtitle access when answering multimodal questions about videos.

Which model scores highest on Video-MME (with subtitle)?

Qwen3.7 Plus by Alibaba currently leads with a score of 88.0% on Video-MME (with subtitle).

How many models are evaluated on Video-MME (with subtitle)?

5 AI models have been evaluated on Video-MME (with subtitle) on BenchLM.

Last updated: July 4, 2026 · BenchLM version Video-MME (with subtitle) 2026

The AI models change fast. We track them for you.

For engineers, researchers, and the plain curious — a weekly brief on new models, ranking shifts, and pricing changes.

Free. No spam. Unsubscribe anytime.