Back

Meta: Llama 2 70B Chat

Llama2
Input: text
Output: text
Released: Jun 20, 2023Updated: Mar 28, 2025

The flagship, 70 billion parameter language model from Meta, fine tuned for chat completions. Llama 2 is an auto-regressive language model that uses an optimized transformer architecture. The tuned versions use supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF) to align to human preferences for helpfulness and safety.

4,096 Token Context

Process and analyze large documents and conversations.

Advanced Coding

Improved capabilities in front-end development and full-stack updates.

Agentic Workflows

Autonomously navigate multi-step processes with improved reliability.

Available On

ProviderModel IDContextMax OutputInput CostOutput CostThroughputLatency
Togethertogether4K-$0.90/M$0.90/M44.4 t/s656 ms
Standard Pricing
Input Tokens
$0.0000009

per 1K tokens

Output Tokens
$0.0000009

per 1K tokens

Do Work. With AI.