Back

Meta: Llama 3.1 405B Instruct

Llama3
Input: text
Output: text
Released: Jul 23, 2024Updated: Mar 28, 2025

The highly anticipated 400B class of Llama3 is here! Clocking in at 128k context with impressive eval scores, the Meta AI team continues to push the frontier of open-source LLMs.

Meta's latest class of model (Llama 3.1) launched with a variety of sizes & flavors. This 405B instruct-tuned version is optimized for high quality dialogue usecases.

It has demonstrated strong performance compared to leading closed-source models including GPT-4o and Claude 3.5 Sonnet in evaluations.

To read more about the model release, click here. Usage of this model is subject to Meta's Acceptable Use Policy.

32,768 Token Context

Process and analyze large documents and conversations.

Advanced Coding

Improved capabilities in front-end development and full-stack updates.

Agentic Workflows

Autonomously navigate multi-step processes with improved reliability.

Available On

ProviderModel IDContextMax OutputInput CostOutput CostThroughputLatency
DeepInfradeepInfra33K16K$0.80/M$0.80/M22.3 t/s908 ms
Lambdalambda131K131K$0.80/M$0.80/M33.9 t/s556 ms
Nebius AI StudionebiusAiStudio131K-$1.00/M$3.00/M33.9 t/s412 ms
Fireworksfireworks131K-$3.00/M$3.00/M82.5 t/s644 ms
Togethertogether131K-$3.50/M$3.50/M46.9 t/s494 ms
Hyperbolichyperbolic131K-$4.00/M$4.00/M80.1 t/s1005 ms
SambaNovasambaNova16K4K$5.00/M$10.00/M103.5 t/s1531 ms
Standard Pricing
Input Tokens
$0.0000008

per 1K tokens

Output Tokens
$0.0000008

per 1K tokens

Do Work. With AI.