Back

Qwen: Qwen3 8B

Qwen3
Input: text
Output: text
Released: Apr 28, 2025Updated: May 12, 2025

Qwen3-8B is a dense 8.2B parameter causal language model from the Qwen3 series, designed for both reasoning-heavy tasks and efficient dialogue. It supports seamless switching between "thinking" mode for math, coding, and logical inference, and "non-thinking" mode for general conversation. The model is fine-tuned for instruction-following, agent integration, creative writing, and multilingual use across 100+ languages and dialects. It natively supports a 32K token context window and can extend to 131K tokens with YaRN scaling.

128,000 Token Context

Process and analyze large documents and conversations.

Hybrid Reasoning

Choose between rapid responses and extended, step-by-step processing for complex tasks.

Advanced Coding

Improved capabilities in front-end development and full-stack updates.

Agentic Workflows

Autonomously navigate multi-step processes with improved reliability.

Available On

ProviderModel IDContextMax OutputInput CostOutput CostThroughputLatency
NovitaAInovitaAi128K-$0.04/M$0.14/M41.8 t/s1402 ms
Standard Pricing
Input Tokens
$0.000000035

per 1K tokens

Output Tokens
$0.000000138

per 1K tokens

Do Work. With AI.