Meta: Llama 3.1 405B Instruct

Llama3

Input: text

Output: text

Released: Jul 23, 2024•Updated: Mar 28, 2025

The highly anticipated 400B class of Llama3 is here! Clocking in at 128k context with impressive eval scores, the Meta AI team continues to push the frontier of open-source LLMs.

Meta's latest class of model (Llama 3.1) launched with a variety of sizes & flavors. This 405B instruct-tuned version is optimized for high quality dialogue usecases.

It has demonstrated strong performance compared to leading closed-source models including GPT-4o and Claude 3.5 Sonnet in evaluations.

To read more about the model release, click here. Usage of this model is subject to Meta's Acceptable Use Policy.

32,768 Token Context

Process and analyze large documents and conversations.

Advanced Coding

Improved capabilities in front-end development and full-stack updates.

Agentic Workflows

Autonomously navigate multi-step processes with improved reliability.

Available On

Provider	Model ID	Context	Max Output	Input Cost	Output Cost	Throughput	Latency
DeepInfra	deepInfra	33K	16K	$0.80/M	$0.80/M	24.3 t/s	959 ms
Lambda	lambda	131K	131K	$0.80/M	$0.80/M	33.5 t/s	452 ms
Nebius	nebiusAiStudio	131K	-	$1.00/M	$3.00/M	32.3 t/s	674 ms
Fireworks	fireworks	131K	-	$3.00/M	$3.00/M	53.7 t/s	803 ms
Together	together	131K	-	$3.50/M	$3.50/M	51.0 t/s	1098 ms
Hyperbolic	hyperbolic	131K	-	$4.00/M	$4.00/M	59.9 t/s	1404 ms
SambaNova	sambaNova	16K	4K	$5.00/M	$10.00/M	111.9 t/s	2973 ms

Standard Pricing

Input Tokens

per 1M tokens

$0.80

Output Tokens

per 1M tokens

$0.80

Do Work. With AI.

Join Waitlist Learn more