Back
Qwen: Qwen VL Max
Qwen
Input: text
Input: image
Output: text
Released: Feb 1, 2025•Updated: Mar 28, 2025
Qwen VL Max is a visual understanding model with 7500 tokens context length. It excels in delivering optimal performance for a broader spectrum of complex tasks.
7,500 Token Context
Process and analyze large documents and conversations.
Advanced Coding
Improved capabilities in front-end development and full-stack updates.
Agentic Workflows
Autonomously navigate multi-step processes with improved reliability.
Vision Capabilities
Process and understand images alongside text inputs.
Available On
Provider | Model ID | Context | Max Output | Input Cost | Output Cost | Throughput | Latency |
---|---|---|---|---|---|---|---|
Alibaba | alibaba | 8K | 2K | $0.80/M | $3.20/M | 30.1 t/s | 2218 ms |
Standard Pricing
Input Tokens
$0.0000008
per 1K tokens
Output Tokens
$0.0000032
per 1K tokens
Image Processing
$0.001024
per image