Back

Perplexity: Llama 3.1 Sonar 70B Online

Llama3
Input: text
Output: text
Released: Aug 1, 2024Updated: Mar 28, 2025

Llama 3.1 Sonar is Perplexity's latest model family. It surpasses their earlier Sonar models in cost-efficiency, speed, and performance.

This is the online version of the offline chat model. It is focused on delivering helpful, up-to-date, and factual responses. #online

127,072 Token Context

Process and analyze large documents and conversations.

Advanced Coding

Improved capabilities in front-end development and full-stack updates.

Agentic Workflows

Autonomously navigate multi-step processes with improved reliability.

Available On

ProviderModel IDContextMax OutputInput CostOutput CostThroughputLatency
Perplexityperplexity127K-$1.00/M$1.00/M80.5 t/s1544 ms
Standard Pricing
Input Tokens
$0.000001

per 1K tokens

Output Tokens
$0.000001

per 1K tokens

Request
$0.005

per request

Do Work. With AI.