Google: Gemini 2.0 Flash
Gemini Flash 2.0 offers a significantly faster time to first token (TTFT) compared to Gemini Flash 1.5, while maintaining quality on par with larger models like Gemini Pro 1.5. It introduces notable enhancements in multimodal understanding, coding capabilities, complex instruction following, and function calling. These advancements come together to deliver more seamless and robust agentic experiences.
1,048,576 Token Context
Process and analyze large documents and conversations.
Advanced Coding
Improved capabilities in front-end development and full-stack updates.
Agentic Workflows
Autonomously navigate multi-step processes with improved reliability.
Vision Capabilities
Process and understand images alongside text inputs.
Available On
Provider | Model ID | Context | Max Output | Input Cost | Output Cost | Throughput | Latency |
---|---|---|---|---|---|---|---|
Google AI Studio | 1049K | 8K | $0.10/M | $0.40/M | 152.5 t/s | 553 ms | |
vertex | 1000K | 8K | $0.15/M | $0.60/M | 156.8 t/s | 469 ms |
Standard Pricing
Input Tokens
per 1M tokens
$0.10
Output Tokens
per 1M tokens
$0.40
Image Processing
per image
$0.0000258
Input Cache Read
per 1M tokens
$0.02
Input Cache Write
per 1M tokens
$0.18