Qwen: Qwen VL Max

Qwen

Input: text

Input: image

Output: text

Released: Feb 1, 2025•Updated: Mar 28, 2025

Qwen VL Max is a visual understanding model with 7500 tokens context length. It excels in delivering optimal performance for a broader spectrum of complex tasks.

7,500 Token Context

Process and analyze large documents and conversations.

Advanced Coding

Improved capabilities in front-end development and full-stack updates.

Agentic Workflows

Autonomously navigate multi-step processes with improved reliability.

Vision Capabilities

Process and understand images alongside text inputs.

Available On

Provider	Model ID	Context	Max Output	Input Cost	Output Cost	Throughput	Latency
Alibaba	alibaba	8K	2K	$0.80/M	$3.20/M	47.1 t/s	1670 ms

Standard Pricing

Input Tokens

$0.0000008

per 1K tokens

Output Tokens

$0.0000032

per 1K tokens

Image Processing

$0.001024

per image

Do Work. With AI.

Join Waitlist Learn more