Back
Meta: Llama 3.1 8B Instruct
Llama3
Input: text
Output: text
Released: Jul 23, 2024•Updated: Mar 28, 2025
Meta's latest class of model (Llama 3.1) launched with a variety of sizes & flavors. This 8B instruct-tuned version is fast and efficient.
It has demonstrated strong performance compared to leading closed-source models in human evaluations.
To read more about the model release, click here. Usage of this model is subject to Meta's Acceptable Use Policy.
16,384 Token Context
Process and analyze large documents and conversations.
Advanced Coding
Improved capabilities in front-end development and full-stack updates.
Agentic Workflows
Autonomously navigate multi-step processes with improved reliability.
Available On
Provider | Model ID | Context | Max Output | Input Cost | Output Cost | Throughput | Latency |
---|---|---|---|---|---|---|---|
inference.net | inferenceNet | 16K | 16K | $0.02/M | $0.03/M | 70.8 t/s | 1300 ms |
DeepInfra (Turbo) | deepInfra (turbo) | 131K | 16K | $0.02/M | $0.03/M | 93.1 t/s | 476 ms |
NovitaAI | novitaAi | 16K | 8K | $0.02/M | $0.05/M | 79.6 t/s | 714 ms |
Nebius AI Studio | nebiusAiStudio | 131K | - | $0.02/M | $0.06/M | 48.4 t/s | 538 ms |
Lambda | lambda | 131K | 131K | $0.02/M | $0.04/M | 155.0 t/s | 345 ms |
DeepInfra | deepInfra | 131K | 16K | $0.03/M | $0.05/M | 58.8 t/s | 235 ms |
Cloudflare | cloudflare | 32K | - | $0.04/M | $0.38/M | 23.8 t/s | 718 ms |
Groq | groq | 131K | 131K | $0.05/M | $0.08/M | 1477.2 t/s | 903 ms |
NextBit | nextBit | 131K | - | $0.06/M | $0.10/M | 92.9 t/s | 1358 ms |
Hyperbolic | hyperbolic | 131K | - | $0.10/M | $0.10/M | 235.6 t/s | 1090 ms |
Cerebras | cerebras | 32K | 32K | $0.10/M | $0.10/M | 3074.9 t/s | 319 ms |
Friendli | friendli | 131K | 8K | $0.10/M | $0.10/M | 281.0 t/s | 292 ms |
SambaNova | sambaNova | 16K | 4K | $0.10/M | $0.20/M | 847.1 t/s | 194 ms |
kluster.ai | klusterAi | 131K | 131K | $0.18/M | $0.18/M | 63.9 t/s | 598 ms |
Together | together | 131K | - | $0.18/M | $0.18/M | 203.0 t/s | 359 ms |
Fireworks | fireworks | 131K | - | $0.20/M | $0.20/M | 265.7 t/s | 390 ms |
Avian.io | avianIo | 131K | - | $0.20/M | $0.20/M | - | - |
Standard Pricing
Input Tokens
$0.00000002
per 1K tokens
Output Tokens
$0.00000003
per 1K tokens