Skip to main content

Kimi Code Bench v2

A Moonshot AI internal coding-agent benchmark for realistic software-engineering tasks across mainstream programming languages and production technology stacks.

Benchmark score on Kimi Code Bench v2 — June 12, 2026

BenchLM mirrors the published score view for Kimi Code Bench v2. Kimi K2.7 Code leads the public snapshot at 62.0%. BenchLM does not use these results to rank models overall.

1 modelsCodingCurrentDisplay onlyUpdated June 12, 2026

About Kimi Code Bench v2

Year

2026

Tasks

Realistic coding-agent tasks

Format

Coding-agent pass rate

Difficulty

Production software engineering

Moonshot describes Kimi Code Bench v2 as an in-house coding-agent benchmark covering backend services, infrastructure, performance engineering, systems programming, security, frontend development, and ML/data engineering. BenchLM stores provider-reported exact values as display-only launch evidence.

BenchLM freshness & provenance

Version

Kimi Code Bench v2 2026

Refresh cadence

Quarterly

Staleness state

Current

Question availability

Public benchmark set

CurrentDisplay only

BenchLM uses freshness metadata to decide whether a benchmark should still be treated as a strong differentiator, a benchmark to watch, or a display-only reference. For the full scoring policy, see the BenchLM methodology page.

Benchmark score table (1 models)

1
62.0%

FAQ

What does Kimi Code Bench v2 measure?

A Moonshot AI internal coding-agent benchmark for realistic software-engineering tasks across mainstream programming languages and production technology stacks.

Which model scores highest on Kimi Code Bench v2?

Kimi K2.7 Code by Moonshot AI currently leads with a score of 62.0% on Kimi Code Bench v2.

How many models are evaluated on Kimi Code Bench v2?

1 AI models have been evaluated on Kimi Code Bench v2 on BenchLM.

Last updated: June 12, 2026 · BenchLM version Kimi Code Bench v2 2026

The AI models change fast. We track them for you.

For engineers, researchers, and the plain curious — a weekly brief on new models, ranking shifts, and pricing changes.

Free. No spam. Unsubscribe anytime.