Skip to main content

VoxPopuli-Cleaned-AA Word Error Rate (VoxPopuli WER)

A speech-recognition benchmark on the cleaned Artificial Analysis VoxPopuli subset, reported as word error rate where lower is better.

Benchmark score on VoxPopuli WER — May 12, 2026

BenchLM mirrors the published score view for VoxPopuli WER. Interfaze Beta leads the public snapshot at 2.4%. BenchLM does not use these results to rank models overall.

1 modelsMultimodal & GroundedCurrentDisplay onlyUpdated May 12, 2026

About VoxPopuli WER

Year

2026

Tasks

Speech-to-text transcription

Format

Word error rate

Difficulty

Audio speech recognition

VoxPopuli-Cleaned-AA measures transcription quality on multilingual European Parliament speech using Whisper-style text normalization. BenchLM stores Interfaze's WER as a display-only multimodal and audio row.

BenchLM freshness & provenance

Version

VoxPopuli WER 2026

Refresh cadence

Quarterly

Staleness state

Current

Question availability

Public benchmark set

CurrentDisplay only

BenchLM uses freshness metadata to decide whether a benchmark should still be treated as a strong differentiator, a benchmark to watch, or a display-only reference. For the full scoring policy, see the BenchLM methodology page.

Benchmark score table (1 models)

1
2.4%

FAQ

What does VoxPopuli WER measure?

A speech-recognition benchmark on the cleaned Artificial Analysis VoxPopuli subset, reported as word error rate where lower is better.

Which model scores highest on VoxPopuli WER?

Interfaze Beta by Interfaze currently leads with a score of 2.4% on VoxPopuli WER.

How many models are evaluated on VoxPopuli WER?

1 AI models have been evaluated on VoxPopuli WER on BenchLM.

Last updated: May 12, 2026 · BenchLM version VoxPopuli WER 2026

The AI models change fast. We track them for you.

For engineers, researchers, and the plain curious — a weekly brief on new models, ranking shifts, and pricing changes.

Free. No spam. Unsubscribe anytime.