Back

Qwen: Qwen VL Max

Qwen
Input: text
Input: image
Output: text
Released: Feb 1, 2025Updated: Mar 28, 2025

Qwen VL Max is a visual understanding model with 7500 tokens context length. It excels in delivering optimal performance for a broader spectrum of complex tasks.

7,500 Token Context

Process and analyze large documents and conversations.

Advanced Coding

Improved capabilities in front-end development and full-stack updates.

Agentic Workflows

Autonomously navigate multi-step processes with improved reliability.

Vision Capabilities

Process and understand images alongside text inputs.

Available On

ProviderModel IDContextMax OutputInput CostOutput CostThroughputLatency
Alibabaalibaba8K2K$0.80/M$3.20/M30.1 t/s2218 ms
Standard Pricing
Input Tokens
$0.0000008

per 1K tokens

Output Tokens
$0.0000032

per 1K tokens

Image Processing
$0.001024

per image

Do Work. With AI.