Benchmark profile

MKQA-11 multilingual retrieval (MKQA-11)

A display-only multilingual QA retrieval benchmark reported by Liquid AI for LFM2.5 retriever models, using Recall@20 across 11 languages.

Data verified August 2, 202620 confirmed releases in the last 30 daysSee provider release alerts

Benchmark score on MKQA-11 — August 2, 2026

BenchLM mirrors the published score view for MKQA-11. LFM2.5-ColBERT-350M leads the public snapshot at 69.4% , followed by LFM2.5-Embedding-350M (69.1%). BenchLM does not use these results to rank models overall.

1Open

LFM2.5-ColBERT-350M

LiquidAI

lfm2-5-colbert-350m

69.4%

Overall —Context 32K

2Open

LFM2.5-Embedding-350M

LiquidAI

lfm2-5-embedding-350m

69.1%

Overall —Context 32K

2 modelsMultilingualCurrentDisplay onlyUpdated August 2, 2026

Benchmark score table (2 models)

Score

LFM2.5-ColBERT-350MLiquidAI · Open weight

69.4%

LFM2.5-Embedding-350MLiquidAI · Open weight

69.1%

About MKQA-11

Year

2026

Tasks

Cross-lingual open-domain QA retrieval

Format

Recall@20 average

Difficulty

Multilingual retrieval

Liquid reports MKQA-11 average Recall@20 across Arabic, German, English, Spanish, French, Italian, Japanese, Korean, Norwegian, Portuguese, and Swedish. BenchLM stores the average as a display-only retrieval signal.

LFM2.5 Retrievers: Bi-directional LFMs for Fast Multilingual Search

BenchLM freshness & provenance

Version

MKQA-11 2026

Refresh cadence

Quarterly

Staleness state

Current

Question availability

Public benchmark set

CurrentDisplay only

BenchLM uses freshness metadata to decide whether a benchmark should still be treated as a strong differentiator, a benchmark to watch, or a display-only reference. For the full scoring policy, see the BenchLM methodology page.

FAQ

What does MKQA-11 measure?

A display-only multilingual QA retrieval benchmark reported by Liquid AI for LFM2.5 retriever models, using Recall@20 across 11 languages.

Which model scores highest on MKQA-11?

LFM2.5-ColBERT-350M by LiquidAI currently leads with a score of 69.4% on MKQA-11.

How many models are evaluated on MKQA-11?

2 AI models have been evaluated on MKQA-11 on BenchLM.

Compare Top Models on MKQA-11

LFM2.5-ColBERT-350M vs LFM2.5-Embedding-350M

Last updated: August 2, 2026 · BenchLM version MKQA-11 2026

Know when it’s worth switching models

The model to choose, the cheaper alternative, and the release we would wait on.

Read a sample issue

Join 2,000+ readers.

One email each week. Unsubscribe anytime.