Skip to main content

KMMLU-Redux

Cleaned KMMLU from national technical qualification exams, with errors removed, decontaminated, and deduplicated.

About KMMLU-Redux

Tasks

~3,500 questions

Format

Technical multiple choice

Difficulty

Industrial/technical

BenchLM freshness & provenance

Version

KMMLU-Redux

Refresh cadence

Static

Staleness state

Refreshing

Question availability

Public benchmark set

RefreshingDisplay only

BenchLM uses freshness metadata to decide whether a benchmark should still be treated as a strong differentiator, a benchmark to watch, or a display-only reference. For the full scoring policy, see the BenchLM methodology page.

Benchmark score table (0 models)

FAQ

What does KMMLU-Redux measure?

Cleaned KMMLU from national technical qualification exams, with errors removed, decontaminated, and deduplicated.

Which model scores highest on KMMLU-Redux?

No models have been evaluated on KMMLU-Redux yet.

How many models are evaluated on KMMLU-Redux?

0 AI models have been evaluated on KMMLU-Redux on BenchLM.

Last updated: April 16, 2026 · BenchLM version KMMLU-Redux

The AI models change fast. We track them for you.

For engineers, researchers, and the plain curious — a weekly brief on new models, ranking shifts, and pricing changes.

Free. No spam. Unsubscribe anytime.