React Native Evals

Name: React Native Evals
Creator: BenchLM

An open benchmark for AI coding agents on real-world React Native implementation tasks, emphasizing working app behavior, recommended architecture choices, and strict constraint adherence.

Benchmark score on React Native Evals — May 7, 2026

BenchLM mirrors the published score view for React Native Evals. Composer 2 leads the public snapshot at 96.1% , followed by Composer 2 Fast (94.9%) and GPT-5.4 (85.3%). BenchLM does not use these results to rank models overall.

1Closed

Composer 2

Cursor

96.1%

Overall —Context 200K

2Closed

Composer 2 Fast

Cursor

94.9%

Overall —Context 200K

3Closed

GPT-5.4

OpenAI

85.3%

Overall 89Context 1.05M

16 modelsCodingCurrentDisplay onlyUpdated May 7, 2026

The published React Native Evals snapshot is tightly clustered at the top: Composer 2 sits at 96.1%, while the third row is only 10.8 points behind. The broader top-10 spread is 20.9 points, so the benchmark still separates strong models even when the leaders cluster.

16 models have been evaluated on React Native Evals. The benchmark falls in the Coding category. This category carries a 20% weight in BenchLM.ai's overall scoring system. React Native Evals is currently displayed for reference but excluded from the scoring formula, so it does not directly affect overall rankings.

About React Native Evals

Year

2026

Tasks

React Native app implementation tasks

Format

Framework-specific app development evaluation

Difficulty

Production mobile app engineering

React Native Evals focuses on framework-specific mobile work that generic coding benchmarks often miss. The public dashboard groups tasks into areas like navigation, animation, and async state, with repeated runs and cost tracking across models.

React Native Evals

BenchLM freshness & provenance

Version

React Native Evals 2026

Refresh cadence

Quarterly

Staleness state

Current

Question availability

Public benchmark set

CurrentDisplay only

BenchLM uses freshness metadata to decide whether a benchmark should still be treated as a strong differentiator, a benchmark to watch, or a display-only reference. For the full scoring policy, see the BenchLM methodology page.

Benchmark score table (16 models)

Composer 2

CursorClosed

96.1%

Composer 2 Fast

CursorClosed

94.9%

GPT-5.4

OpenAIClosed

85.3%

GPT-5.5

OpenAIClosed

84.7%

Claude Opus 4.6

AnthropicClosed

84.1%

Claude Opus 4.7

AnthropicClosed

82.8%

Claude Sonnet 4.6

AnthropicClosed

80.6%

Gemini 3.1 Pro

GoogleClosed

78.9%

Kimi K2.5

Moonshot AIOpen

77.2%

Gemma 4 31B

GoogleOpen

75.2%

GLM-5

Z.AIOpen

74.8%

Grok 4

xAIClosed

72.6%

GPT-OSS 120B

OpenAIOpen

71.6%

DeepSeek V3.2

DeepSeekOpen

71.5%

MiniMax M2.7

MiniMaxOpen

71.4%

GPT-OSS 20B

OpenAIOpen

71%

FAQ

What does React Native Evals measure?

An open benchmark for AI coding agents on real-world React Native implementation tasks, emphasizing working app behavior, recommended architecture choices, and strict constraint adherence.

Which model scores highest on React Native Evals?

Composer 2 by Cursor currently leads with a score of 96.1% on React Native Evals.

How many models are evaluated on React Native Evals?

16 AI models have been evaluated on React Native Evals on BenchLM.

Compare Top Models on React Native Evals

Composer 2 vs Composer 2 Fast Composer 2 Fast vs GPT-5.4 GPT-5.4 vs GPT-5.5 GPT-5.5 vs Claude Opus 4.6

Learn More

Read our explainer: React Native Evals benchmark deep dive

Last updated: May 7, 2026 · BenchLM version React Native Evals 2026

The AI models change fast. We track them for you.

For engineers, researchers, and the plain curious — a weekly brief on new models, ranking shifts, and pricing changes.

Free. No spam. Unsubscribe anytime.