Back
Google: Gemini 1.5 Pro
Gemini
Input: text
Input: image
Output: text
Released: Apr 9, 2024•Updated: Apr 12, 2025
Google's latest multimodal model, supports image and video[0] in text or chat prompts.
Optimized for language tasks including:
- Code generation
- Text generation
- Text editing
- Problem solving
- Recommendations
- Information extraction
- Data extraction or generation
- AI agents
Usage of Gemini is subject to Google's Gemini Terms of Use.
- [0]: Video input is not available through OpenRouter at this time.
2,000,000 Token Context
Process and analyze large documents and conversations.
Advanced Coding
Improved capabilities in front-end development and full-stack updates.
Agentic Workflows
Autonomously navigate multi-step processes with improved reliability.
Vision Capabilities
Process and understand images alongside text inputs.
Available On
Provider | Model ID | Context | Max Output | Input Cost | Output Cost | Throughput | Latency |
---|---|---|---|---|---|---|---|
Google Vertex | vertex | 2000K | 8K | $1.25/M | $5.00/M | 61.7 t/s | 1244 ms |
Google AI Studio | 2000K | 8K | $1.25/M | $5.00/M | 62.7 t/s | 17648 ms |
Standard Pricing
Input Tokens
$0.00000125
per 1K tokens
Output Tokens
$0.000005
per 1K tokens
Image Processing
$0.0006575
per image
Variable Pricing Tiers
prompt threshold
Threshold: 128000
Prompt: $0.0000025 / Completion: $0.00001 (per 1K tokens)