Meta introduces Llama 3.3, a 70B parameter model delivering performance comparable to Llama 3.1 405B but with significantly lower computational demands.
Meta AI has just introduced Llama 3.3, a 70-billion parameter model that delivers performance comparable to the much larger Llama 3.1 405B, but with far lower computational demands.
https://huggingface.co/meta-llama/Llama-3.3-70B-Instruct
Supported languages: English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai.
Llama 3.3 model. Token counts refer to pretraining data only. All model versions use Grouped-Query Attention (GQA) for improved inference scalability.
Model Release Date:
Meta AI has just introduced Llama 3.3, a 70-billion parameter model that delivers performance comparable to the much larger Llama 3.1 405B, but with far lower computational demands.
https://huggingface.co/meta-llama/Llama-3.3-70B-Instruct
| Training Data | Params | Input modalities | Output modalities | Context length | GQA | Token count | Knowledge cutoff |
|---|---|---|---|---|---|---|---|---|
Llama 3.3 (text only) | A new mix of publicly available online data. | 70B | Multilingual Text | Multilingual Text and code | 128k | Yes | 15T+ | December 2023 |
Supported languages: English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai.
Llama 3.3 model. Token counts refer to pretraining data only. All model versions use Grouped-Query Attention (GQA) for improved inference scalability.
Model Release Date:
- 70B Instruct: December 6, 2024