Name: Llama 4 Maverick
Rating: 17 (17 reviews)
Author: Meta

Question 1

How does Llama 4 Maverick perform overall in AI benchmarks?

Accepted Answer

Llama 4 Maverick currently ranks #115 out of 119 models on BenchLM's provisional leaderboard with an overall score of 17 (estimated). It is created by Meta and features a 1M context window.

Question 2

Is Llama 4 Maverick good for knowledge and understanding?

Accepted Answer

Llama 4 Maverick ranks #97 out of 119 models in knowledge and understanding benchmarks with an average score of 13.8. There are stronger options in this category.

Question 3

Is Llama 4 Maverick good for coding and programming?

Accepted Answer

Llama 4 Maverick ranks #90 out of 119 models in coding and programming benchmarks with an average score of 8.6. There are stronger options in this category.

Question 4

Is Llama 4 Maverick good for reasoning and logic?

Accepted Answer

Llama 4 Maverick ranks #68 out of 119 models in reasoning and logic benchmarks with an average score of 32.4. There are stronger options in this category.

Question 5

Is Llama 4 Maverick good for agentic tool use and computer tasks?

Accepted Answer

Llama 4 Maverick ranks #94 out of 119 models in agentic tool use and computer tasks benchmarks with an average score of 12.6. There are stronger options in this category.

Question 6

Is Llama 4 Maverick good for multimodal and grounded tasks?

Accepted Answer

Llama 4 Maverick ranks #79 out of 119 models in multimodal and grounded tasks benchmarks with an average score of 34.2. There are stronger options in this category.

Question 7

Is Llama 4 Maverick good for instruction following?

Accepted Answer

Llama 4 Maverick ranks #106 out of 119 models in instruction following benchmarks with an average score of 22.7. There are stronger options in this category.

Question 8

Is Llama 4 Maverick open source?

Accepted Answer

Yes, Llama 4 Maverick is an open weight model created by Meta, meaning it can be downloaded and run locally or fine-tuned for specific use cases.

Question 9

Does Llama 4 Maverick have full benchmark coverage on BenchLM?

Accepted Answer

Not yet. Llama 4 Maverick currently has 17 published benchmark scores out of the 225 benchmarks BenchLM tracks. BenchLM only exposes non-generated public benchmark rows, so missing categories stay blank until a sourced evaluation is available.

Question 10

What is the context window size of Llama 4 Maverick?

Accepted Answer

Llama 4 Maverick has a context window of 1M, which determines how much text it can process in a single interaction.

Llama 4 Maverick

Self-host vs API cost

Ranking Distribution

Category Performance

Category Breakdown

Agentic

Coding

Reasoning

Knowledge

Math

Multilingual

Multimodal

Inst. Following

Chatbot Arena Performance

Benchmark Details

Compare This Model

Frequently Asked Questions