Back
Meta: Llama 3 70B Instruct
Llama3
Input: text
Output: text
Released: Apr 18, 2024•Updated: Mar 28, 2025
Meta's latest class of model (Llama 3) launched with a variety of sizes & flavors. This 70B instruct-tuned version was optimized for high quality dialogue usecases.
It has demonstrated strong performance compared to leading closed-source models in human evaluations.
To read more about the model release, click here. Usage of this model is subject to Meta's Acceptable Use Policy.
8,192 Token Context
Process and analyze large documents and conversations.
Advanced Coding
Improved capabilities in front-end development and full-stack updates.
Agentic Workflows
Autonomously navigate multi-step processes with improved reliability.
Available On
Provider | Model ID | Context | Max Output | Input Cost | Output Cost | Throughput | Latency |
---|---|---|---|---|---|---|---|
DeepInfra | deepInfra | 8K | 16K | $0.30/M | $0.40/M | 34.0 t/s | 687 ms |
Hyperbolic | hyperbolic | 8K | - | $0.40/M | $0.40/M | 21.8 t/s | 1460 ms |
NovitaAI | novitaAi | 8K | 8K | $0.51/M | $0.74/M | 20.6 t/s | 1089 ms |
Groq | groq | 8K | 8K | $0.59/M | $0.79/M | 371.4 t/s | 184 ms |
Together | together | 8K | - | $0.88/M | $0.88/M | 95.9 t/s | 673 ms |
Standard Pricing
Input Tokens
$0.0000003
per 1K tokens
Output Tokens
$0.0000004
per 1K tokens