Llama 3.3 70B model is out - Meta

MihiCherub

Well-known member
  • Sep 14, 2009
    18,868
    1
    9,642
    113
    Gampaha
    Meta introduces Llama 3.3, a 70B parameter model delivering performance comparable to Llama 3.1 405B but with significantly lower computational demands.

    ad_4nxceo65lqgskpt2gyjdxnyxikzmv_t-3njxvxaqiklpa-hbtvmpmipofzejtrxzdfs2byf_abs10_hfrknvkxzlzxlr0fvtgipmzktuwxhok7awlmun10ox6ahwyou_2r_gyatf4.png


    Meta AI has just introduced Llama 3.3, a 70-billion parameter model that delivers performance comparable to the much larger Llama 3.1 405B, but with far lower computational demands.

    https://huggingface.co/meta-llama/Llama-3.3-70B-Instruct

    Training Data​
    Params​
    Input modalities​
    Output modalities​
    Context length​
    GQA​
    Token count​
    Knowledge cutoff​
    Llama 3.3 (text only)​
    A new mix of publicly available online data.​
    70B​
    Multilingual Text​
    Multilingual Text and code​
    128k​
    Yes​
    15T+​
    December 2023​

    Supported languages: English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai.

    Llama 3.3 model. Token counts refer to pretraining data only. All model versions use Grouped-Query Attention (GQA) for improved inference scalability.

    Model Release Date:
    • 70B Instruct: December 6, 2024
    ad_4nxcgxjt1c5ju_yrd5tu6o7voalwjren2u-us_snst8csi-iljoa577shvw9egrhkbokjxsxnwhoe_jp-ug4sjpg2n0v7ot7osoejzjq3hiqs9l4yamazydozbsokt1_kt8s2bdiy1w.png
     

    RandomGuy

    Well-known member
  • Oct 15, 2014
    17,373
    16,216
    113
    thanks. mehema thread dan elakiriye adui. okkoma deshapalanaayata kade yana ewa thiyenne.