Benchmark analysis of Llama 4 Behemoth by Meta across 14 tests.
Creator
Meta
Source Type
Open WeightReasoning
Non-ReasoningContext Window
32K
Overall Score
Llama 4 Behemoth ranks #69 out of 88 models with an overall score of 39. It is created by Meta and features a 32K context window.
Llama 4 Behemoth ranks #69 out of 88 models in knowledge and understanding benchmarks with an average score of 45.8. There are stronger options in this category.
Llama 4 Behemoth ranks #69 out of 88 models in coding and programming benchmarks with an average score of 40. There are stronger options in this category.
Llama 4 Behemoth ranks #69 out of 88 models in mathematics benchmarks with an average score of 47. There are stronger options in this category.
Llama 4 Behemoth ranks #69 out of 88 models in reasoning and logic benchmarks with an average score of 45. There are stronger options in this category.
Yes, Llama 4 Behemoth is an open weight model created by Meta, meaning it can be downloaded and run locally or fine-tuned for specific use cases.
Llama 4 Behemoth has a context window of 32K tokens, which determines how much text it can process in a single interaction.