Meta: Llama 3.1 405B Instruct
The highly anticipated 400B class of Llama3 is here! Clocking in at 128k context with impressive eval scores, the Meta AI team continues to push the frontier of open-source LLMs.
Meta's latest class of model (Llama 3.1) launched with a variety of sizes & flavors. This 405B instruct-tuned version is optimized for high quality dialogue usecases.
It has demonstrated strong performance compared to leading closed-source models including GPT-4o and Claude 3.5 Sonnet in evaluations.
To read more about the model release, click here. Usage of this model is subject to Meta's Acceptable Use Policy.
32,768 Token Context
Process and analyze large documents and conversations.
Advanced Coding
Improved capabilities in front-end development and full-stack updates.
Agentic Workflows
Autonomously navigate multi-step processes with improved reliability.
Available On
Provider | Model ID | Context | Max Output | Input Cost | Output Cost | Throughput | Latency |
---|---|---|---|---|---|---|---|
DeepInfra | deepInfra | 33K | 16K | $0.80/M | $0.80/M | 22.3 t/s | 908 ms |
Lambda | lambda | 131K | 131K | $0.80/M | $0.80/M | 33.9 t/s | 556 ms |
Nebius AI Studio | nebiusAiStudio | 131K | - | $1.00/M | $3.00/M | 33.9 t/s | 412 ms |
Fireworks | fireworks | 131K | - | $3.00/M | $3.00/M | 82.5 t/s | 644 ms |
Together | together | 131K | - | $3.50/M | $3.50/M | 46.9 t/s | 494 ms |
Hyperbolic | hyperbolic | 131K | - | $4.00/M | $4.00/M | 80.1 t/s | 1005 ms |
SambaNova | sambaNova | 16K | 4K | $5.00/M | $10.00/M | 103.5 t/s | 1531 ms |
per 1K tokens
per 1K tokens