Superseded.xAI has newer models in this line:Grok 4.5

Model profile · xAI

Grok 4.3

Name: Grok 4.3
Author: xAI

SupersededReleased Apr 30, 2026ProprietaryReasoning1M context

Grok 4.3 scores 64.2 out of 100 and ranks #36 of 215. This profile shows 2 source-displayable benchmark rows; its strongest eligible category is Agentic at #129. API pricing is $1.25 input and $2.5 output per million tokens, with cached input at $0.2.

Data as of July 30, 2026 · How the score is built

Compare Grok 4.3 Find alternatives

Strongest published evidence

Agentic ranks #129. Particularly useful for coding agents, browser research, and computer-use workflows.

Validate before choosing

2 published rows leave some tracked benchmark slots empty.

Decision snapshot

Each value carries a field reference instead of floating alone. Markers compare this model with the current ranked and priced catalog; they are not absolute quality thresholds.

Capability

64.2/100

field median 57.2

#36 of 215 ranked models

Price

$1.25input / $2.50 output

input median $1

cached $0.20 · blended $1.88

Speed

209tok/s

field median 107 tok/s

First token 12.36 s

Context

1Mtokens

field median 256,000

Reported for this model; direct source link not stored

Capability shape

Each axis shows percentile within that category’s eligible cohort. The comparison outline is the median of the six nearest public-score peers; a collapsed vertex means the category is not rank-eligible.

Agentic0th percentile
CodingNot eligible
ReasoningNot eligible
KnowledgeNot eligible
MathNot eligible
MultilingualNot eligible
MultimodalNot eligible
Instruction followingNot eligible

The dashed outline is median of 6 nearest peers.

Top decileTop quartileMid-fieldNot eligible

What it costs to get this score

Published API price against the public score. The x-axis uses a log scale; the dashed path marks models that are not beaten by a cheaper, higher-scoring option. Price uses average of published input and output rates.

Explore all models

The chart opens on the current model. Scroll horizontally to inspect the full price axis.

Current modelGrok 4.3 · 64.2 score · $1.88 blended per million tokens

Horizontal: blended price per million tokens, log scale · Vertical: public score

How much of this is verified

Coverage is split by category so a strong number never hides a thin evidence base. Verified means the row is tied to a published source; provisional rows remain visible but separate.

Agentic2/2 verified
CodingNot measured
ReasoningNot measured
KnowledgeNot measured
MathNot measured
MultilingualNot measured
MultimodalNot measured
Inst. FollowingNot measured

Verified sourceProvisionalNot measured

Spec sheet

Each documented value carries its source. Missing fields stay visible as not sourced or not published, rather than disappearing from the page.

API model ID: Not published
Context window: 1M
Maximum output: Not sourced yet
Knowledge cutoff: Not sourced yet
Input modalities: Not sourced yet
Output modalities: Not sourced yet
Parameters: Not disclosed by the provider

Availability: Not sourced yet
Cloud regions: Not tracked yet
Lifecycle: Superseded
API capabilities: Tool calling, structured outputs, and batch support are not tracked yet
Prompt caching: Published at $0.20 per million cached input tokens
Self-host: Weights are not published
Rate limits: Not tracked yet

Category score record

Scores and ranks appear only where published evidence can be displayed. The table keeps the score, weight, cohort, and evidence state together.

Category scores, ranks, weighting, benchmark coverage, and evidence status
Category	Score	Rank	Percentile	Weight	Benchmarks	Evidence
AgenticRank #129 of 129Percentile 0thWeight 22%2 benchmarksVerified	6.2	#129 of 129	0th	22%	2 benchmarks	Verified
CodingWeight 20%0 benchmarksNot measured	Not measured	Not ranked	Not available	20%	0 benchmarks	Not measured
ReasoningWeight 17%0 benchmarksNot measured	Not measured	Not ranked	Not available	17%	0 benchmarks	Not measured
KnowledgeWeight 12%0 benchmarksNot measured	Not measured	Not ranked	Not available	12%	0 benchmarks	Not measured
MathWeight 5%0 benchmarksNot measured	Not measured	Not ranked	Not available	5%	0 benchmarks	Not measured
MultilingualWeight 7%0 benchmarksNot measured	Not measured	Not ranked	Not available	7%	0 benchmarks	Not measured
MultimodalWeight 12%0 benchmarksNot measured	Not measured	Not ranked	Not available	12%	0 benchmarks	Not measured
Inst. FollowingWeight 5%0 benchmarksNot measured	Not measured	Not ranked	Not available	5%	0 benchmarks	Not measured

Benchmark ledger

Agentic opens by default. The marker compares each value with the best source-verified result in the catalog; provisional leaders do not set the reference. Expand the remaining categories for every published row.

Agentic2 rows

Agentic benchmark values, best verified comparison, weight, and source status
Benchmark	Score	Versus best verified row	Gap	Weight	Evidence
Gert LabsGert Labs Composite Game Benchmark	Score43.86%	Versus best verified row Best verified: Claude Opus 4.8 · 72.97%	Gap29.1 behind	WeightDisplay only	Benchmark exact Gert Labs rankings
ResearchClawBench	Score12.4%	Versus best verified row Best verified: Claude Opus 4.8 · 21.1%	Gap8.7 behind	WeightDisplay only	Benchmark exact ResearchClawBench leaderboard

Lineage

The sequence follows explicit supersedes links. Scores and prices remain blank when the corresponding public row or first-party rate is unavailable.

Apr 30, 2026 · you are here

Grok 4.3

Score 64.2 · $1.25 / $2.5

Jul 8, 2026

Grok 4.5

Score 75.5 · $2 / $6

Base entry

How to read this profile

The visual layer above carries the decisions. These notes preserve the model, ranking, coverage, and family context behind the numbers.

Grok 4.3 ranks #36 of 215 on the public leaderboard with a score of 64.2/100. Its source-verified position is #31 of 103.

Grok 4.3 is a proprietary model with a 1M context window. It uses an explicit reasoning mode, which can improve complex problem solving while adding latency and token use.

2 of 371 tracked benchmark slots currently have displayable evidence. Missing categories stay blank.

Its strongest eligible category is Agentic at #129. particularly useful for coding agents, browser research, and computer-use workflows.

Frequently asked questions

How does Grok 4.3 perform overall in AI benchmarks?

Grok 4.3 ranks #36 out of 215 models on the public BenchAlign leaderboard, with a score of 64.2/100. Its evidence status is Supported, and this profile shows 2 source-displayable benchmark rows. The label describes evidence depth, not a provider quality claim; inspect category rows before choosing a workload.

Is Grok 4.3 good for agentic tool use and computer tasks?

Grok 4.3 ranks #129 out of 129 eligible models for agentic tool use and computer tasks, with a public category score of 6.2/100. Higher-ranked alternatives are available for workloads where this category decides the choice. Check the underlying rows before treating the aggregate as a workload guarantee.

Does Grok 4.3 have full benchmark coverage on BenchLM?

No. Grok 4.3 currently has 25 source-displayable rows across 371 tracked benchmark slots. The profile exposes published, non-generated evidence and leaves missing categories blank until an exact evaluation is available. Coverage describes how much was measured; it is not a penalty added to an individual benchmark result.

What is the context window size of Grok 4.3?

Grok 4.3 has a reported context window of 1M in the exact-model catalog record. The value stays visible, but the profile marks its source link as unavailable instead of presenting it as directly documented. Maximum output length remains separate because providers often publish a different limit.

Last updated July 30, 2026. Runtime fields remain blank until a sourced snapshot exists.

Watch Grok 4.3 in the weekly brief

Get one weekly email when material rank, price, availability, or benchmark evidence changes are worth revisiting.

Read a sample issue

Join 2,000+ readers.

Grok 4.3

Strongest published evidence

Validate before choosing

Decision snapshot

Capability shape

Eligible category ranks

What it costs to get this score

How much of this is verified

Spec sheet

Category score record

Benchmark ledger

Lineage

How to read this profile

Frequently asked questions

Watch Grok 4.3 in the weekly brief