Back

Qwen: Qwen3 30B A3B

Qwen3
Input: text
Output: text
Released: Apr 28, 2025Updated: May 12, 2025

Qwen3, the latest generation in the Qwen large language model series, features both dense and mixture-of-experts (MoE) architectures to excel in reasoning, multilingual support, and advanced agent tasks. Its unique ability to switch seamlessly between a thinking mode for complex reasoning and a non-thinking mode for efficient dialogue ensures versatile, high-quality performance.

Significantly outperforming prior models like QwQ and Qwen2.5, Qwen3 delivers superior mathematics, coding, commonsense reasoning, creative writing, and interactive dialogue capabilities. The Qwen3-30B-A3B variant includes 30.5 billion parameters (3.3 billion activated), 48 layers, 128 experts (8 activated per task), and supports up to 131K token contexts with YaRN, setting a new standard among open-source models.

40,960 Token Context

Process and analyze large documents and conversations.

Hybrid Reasoning

Choose between rapid responses and extended, step-by-step processing for complex tasks.

Advanced Coding

Improved capabilities in front-end development and full-stack updates.

Agentic Workflows

Autonomously navigate multi-step processes with improved reliability.

Available On

ProviderModel IDContextMax OutputInput CostOutput CostThroughputLatency
DeepInfradeepInfra41K41K$0.08/M$0.29/M107.1 t/s624 ms
InferenceNetinferenceNet16K16K$0.08/M$0.29/M16.0 t/s999 ms
Parasailparasail41K41K$0.09/M$0.50/M155.0 t/s386 ms
NebiusnebiusAiStudio41K-$0.10/M$0.30/M111.8 t/s513 ms
NovitanovitaAi41K20K$0.10/M$0.45/M158.9 t/s685 ms
Fireworksfireworks40K-$0.15/M$0.60/M124.0 t/s900 ms
Standard Pricing
Input Tokens
$0.00000008

per 1K tokens

Output Tokens
$0.00000029

per 1K tokens

Do Work. With AI.