Back

Perplexity: Llama 3.1 Sonar 8B Online

Llama3
Input: text
Output: text
Released: Aug 1, 2024Updated: Mar 28, 2025

Llama 3.1 Sonar is Perplexity's latest model family. It surpasses their earlier Sonar models in cost-efficiency, speed, and performance.

This is the online version of the offline chat model. It is focused on delivering helpful, up-to-date, and factual responses. #online

127,072 Token Context

Process and analyze large documents and conversations.

Advanced Coding

Improved capabilities in front-end development and full-stack updates.

Agentic Workflows

Autonomously navigate multi-step processes with improved reliability.

Available On

ProviderModel IDContextMax OutputInput CostOutput CostThroughputLatency
Perplexityperplexity127K-$0.20/M$0.20/M204.2 t/s1289 ms

Standard Pricing

Input Tokens

per 1M tokens

$0.20

Output Tokens

per 1M tokens

$0.20

Request

per request

$0.005

Do Work. With AI.