Back
Perplexity: Llama 3.1 Sonar 70B Online
Llama3
Input: text
Output: text
Released: Aug 1, 2024•Updated: Mar 28, 2025
Llama 3.1 Sonar is Perplexity's latest model family. It surpasses their earlier Sonar models in cost-efficiency, speed, and performance.
This is the online version of the offline chat model. It is focused on delivering helpful, up-to-date, and factual responses. #online
127,072 Token Context
Process and analyze large documents and conversations.
Advanced Coding
Improved capabilities in front-end development and full-stack updates.
Agentic Workflows
Autonomously navigate multi-step processes with improved reliability.
Available On
Provider | Model ID | Context | Max Output | Input Cost | Output Cost | Throughput | Latency |
---|---|---|---|---|---|---|---|
Perplexity | perplexity | 127K | - | $1.00/M | $1.00/M | 80.5 t/s | 1544 ms |
Standard Pricing
Input Tokens
$0.000001
per 1K tokens
Output Tokens
$0.000001
per 1K tokens
Request
$0.005
per request