https://www.techtarget.com/whatis/feature/Meta-Llama-4-explained-Everything-you-need-to-know
When Meta, formerly Facebook, launched its large language model (LLM) Llama in February 2023, it was originally spelled LLaMA. The acronym stands for Large Language Model Meta AI.
Since Llama 2's release in July 2023, Meta has provided the model under an open permissive license, easing organizational access and use. Its multiple iterations have expanded Llama's capabilities and improved its standing among rivals, including models from OpenAI, Anthropic and Google.
On April 5, 2025, Meta released the Llama 4 model family, the Llama 4 herd.
Meta Llama 4 is a multimodal LLM that analyzes and understands text, images and video data. This fourth-generation model also supports multiple languages from all parts of the globe.
The Llama 4 models are the first LLMs in the Llama family to employ a mixture-of-experts architecture: Only a subset of the total parameters activate for an input token. This approach targets a balance of power with efficiency.
The Llama 4 community license is not an official Open Source Initiative-approved license, but Meta refers to its Llama 4 models as open source. The Meta Llama license provides free usage and modification of the Llama 4 models with certain limits. As of April 2025, the limit was 700 million monthly users. After that point, a commercial license is required.
The three primary versions of Llama 4 are Scout, Maverick and Behemoth. The Scout and Maverick models were available at launch, with the Behemoth model still in training. This chart compares them.
| Feature | Llama 4 Scout | Llama 4 Maverick | Llama 4 Behemoth |
| Active parameters | 17 billion | 17 billion | 288 billion |
| Number of experts | 16 | 128 | 16 |
| Total parameters | 109 billion | 400 billion | 2 trillion |
| Context window | 10 million tokens | 1 million tokens | Not specified |
| Knowledge cutoff | August 2024 | August 2024 | Not specified |
| Release date | April 5, 2025 | April 5, 2025 | Not yet released |
The Meta Llama 4 models are applicable across a wide range of operations, including:
Meta introduced a series of advanced techniques to train its fourth-generation Llama family LLMs to improve accuracy and performance over prior iterations. Among the techniques used to train Llama 4 were:
Previous iterations of LlamaAfter the big debut of ChatGPT in November 2022, vendors of all sizes scrambled to find their footing in the LLM market. Meta was among the many that responded with in-house models, first Llama models, publicly announced in early 2023, which provided limited access. From Llama 2's mid-2023 release onward, all models have been available under open licenses.
The generative AI landscape is increasingly competitive, featuring major players such as OpenAI's GPT-4o, Google Gemini 2.0 and various open source projects including DeepSeek.
Here's how Llama 4 rates on three benchmarks: MMMU, or Massive Multi-discipline Multimodal Understanding, for image reasoning; LiveCodeBench, for coding; and GPQA Diamond, or Graduate-Level Google-Proof Q&A Diamond, for reasoning and knowledge. A higher score is better.
| Llama 4 Maverick | Gemini 2.0 Flash | GPT-4o | |
| MMMU image reasoning | 73.4 | 71.7 | 69.1 |
| LiveCodeBench | 43.4 | 34.05 | 32.3 |
| GPQA Diamond | 69.8 | 60.1 | 53.6 |
Meta Llama 4 Maverick and Scout are easily accessible from several different locations.
Sean Michael Kerner is an IT consultant, technology enthusiast and tinkerer. He has pulled Token Ring, configured NetWare and been known to compile his own Linux kernel. He consults with industry and media organizations on technology issues.
05 May 2025