Back
DeepSeek: DeepSeek V3 0324 (free)
DeepSeek
Input: text
Output: text
Released: Mar 24, 2025•Updated: Mar 28, 2025
DeepSeek V3, a 685B-parameter, mixture-of-experts model, is the latest iteration of the flagship chat model family from the DeepSeek team.
It succeeds the DeepSeek V3 model and performs really well on a variety of tasks.
163,840 Token Context
Process and analyze large documents and conversations.
Advanced Coding
Improved capabilities in front-end development and full-stack updates.
Agentic Workflows
Autonomously navigate multi-step processes with improved reliability.
Available On
Provider | Model ID | Context | Max Output | Input Cost | Output Cost | Throughput | Latency |
---|---|---|---|---|---|---|---|
DeepInfra | deepInfra | 164K | - | $0.30/M | $0.88/M | 29.9 t/s | 874 ms |
NovitaAI | novitaAi | 128K | 16K | $0.33/M | $1.30/M | 20.4 t/s | 1277 ms |
kluster.ai | klusterAi | 164K | 164K | $0.33/M | $1.40/M | 19.5 t/s | 1246 ms |
Lambda | lambda | 164K | 164K | $0.34/M | $0.88/M | 27.9 t/s | 724 ms |
inference.net | inferenceNet | 128K | 33K | $0.45/M | $1.45/M | 21.0 t/s | 1535 ms |
Nebius AI Studio | nebiusAiStudio | 164K | - | $0.50/M | $1.50/M | 21.7 t/s | 841 ms |
Parasail | parasail | 164K | 164K | $0.74/M | $1.50/M | 52.0 t/s | 829 ms |
CentML | centMl | 164K | 164K | $0.80/M | $0.80/M | 9.5 t/s | 943 ms |
Fireworks | fireworks | 164K | - | $0.90/M | $0.90/M | 68.7 t/s | 647 ms |
GMICloud | gmiCloud | 131K | - | $0.90/M | $0.90/M | 61.6 t/s | 861 ms |
Hyperbolic | hyperbolic | 164K | - | $1.25/M | $1.25/M | 21.9 t/s | 1801 ms |
Together | together | 131K | 12K | $1.25/M | $1.25/M | 31.4 t/s | 2164 ms |
SambaNova | sambaNova | 33K | 16K | $3.00/M | $4.50/M | 182.1 t/s | 2206 ms |
DeepSeek | deepSeek | 64K | 8K | $0.27/M | $1.10/M | 20.1 t/s | 4513 ms |
Standard Pricing