Benchmark profile

Kimi Code Bench v2

A Moonshot AI internal coding-agent benchmark for realistic software-engineering tasks across mainstream programming languages and production technology stacks.

Data verified July 27, 2026

Benchmark score on Kimi Code Bench v2 — July 27, 2026

BenchLM mirrors the published score view for Kimi Code Bench v2. Kimi K3 leads the public snapshot at 72.9% , followed by Kimi K2.7 Code (62.0%). BenchLM does not use these results to rank models overall.

1Closed

Kimi K3

Moonshot AI

kimi-3

72.9%

Overall 79.89Context 1.05M

2Open

Kimi K2.7 Code

Moonshot AI

kimi-k2-7-code

62.0%

Overall 54.03Context 256K

2 modelsCodingCurrentDisplay onlyUpdated July 27, 2026

Benchmark score table (2 models)

Score

Kimi K3Moonshot AI · Closed

72.9%

Kimi K2.7 CodeMoonshot AI · Open weight

62.0%

About Kimi Code Bench v2

Year

2026

Tasks

Realistic coding-agent tasks

Format

Coding-agent pass rate

Difficulty

Production software engineering

Moonshot describes Kimi Code Bench v2 as an in-house coding-agent benchmark covering backend services, infrastructure, performance engineering, systems programming, security, frontend development, and ML/data engineering. BenchLM stores provider-reported exact values as display-only launch evidence.

Kimi K2.7 Code

BenchLM freshness & provenance

Version

Kimi Code Bench v2 2026

Refresh cadence

Quarterly

Staleness state

Current

Question availability

Public benchmark set

CurrentDisplay only

BenchLM uses freshness metadata to decide whether a benchmark should still be treated as a strong differentiator, a benchmark to watch, or a display-only reference. For the full scoring policy, see the BenchLM methodology page.

FAQ

What does Kimi Code Bench v2 measure?

A Moonshot AI internal coding-agent benchmark for realistic software-engineering tasks across mainstream programming languages and production technology stacks.

Which model scores highest on Kimi Code Bench v2?

Kimi K3 by Moonshot AI currently leads with a score of 72.9% on Kimi Code Bench v2.

How many models are evaluated on Kimi Code Bench v2?

2 AI models have been evaluated on Kimi Code Bench v2 on BenchLM.

Compare Top Models on Kimi Code Bench v2

Kimi K3 vs Kimi K2.7 Code

Last updated: July 27, 2026 · BenchLM version Kimi Code Bench v2 2026

Know when it’s worth switching models

The model to choose, the cheaper alternative, and the release we would wait on.

One email each week. Unsubscribe anytime.