Back
xAI: Grok Vision Beta
Grok
Input: text
Input: image
Output: text
Released: Nov 19, 2024•Updated: Mar 28, 2025
Grok Vision Beta is xAI's experimental language model with vision capability.
8,192 Token Context
Process and analyze large documents and conversations.
Advanced Coding
Improved capabilities in front-end development and full-stack updates.
Agentic Workflows
Autonomously navigate multi-step processes with improved reliability.
Vision Capabilities
Process and understand images alongside text inputs.
Available On
Provider | Model ID | Context | Max Output | Input Cost | Output Cost | Throughput | Latency |
---|---|---|---|---|---|---|---|
xAI | xAi | 8K | - | $5.00/M | $15.00/M | 62.4 t/s | 468 ms |
Standard Pricing
Input Tokens
$0.000005
per 1K tokens
Output Tokens
$0.000015
per 1K tokens
Image Processing
$0.009
per image