Back

Google: Gemini 1.5 Pro

Gemini
Input: text
Input: image
Output: text
Released: Apr 9, 2024Updated: Apr 12, 2025

Google's latest multimodal model, supports image and video[0] in text or chat prompts.

Optimized for language tasks including:

  • Code generation
  • Text generation
  • Text editing
  • Problem solving
  • Recommendations
  • Information extraction
  • Data extraction or generation
  • AI agents

Usage of Gemini is subject to Google's Gemini Terms of Use.

  • [0]: Video input is not available through OpenRouter at this time.

2,000,000 Token Context

Process and analyze large documents and conversations.

Advanced Coding

Improved capabilities in front-end development and full-stack updates.

Agentic Workflows

Autonomously navigate multi-step processes with improved reliability.

Vision Capabilities

Process and understand images alongside text inputs.

Available On

ProviderModel IDContextMax OutputInput CostOutput CostThroughputLatency
Google Vertexvertex2000K8K$1.25/M$5.00/M61.7 t/s1244 ms
Google AI Studiogoogle2000K8K$1.25/M$5.00/M62.7 t/s17648 ms
Standard Pricing
Input Tokens
$0.00000125

per 1K tokens

Output Tokens
$0.000005

per 1K tokens

Image Processing
$0.0006575

per image

Variable Pricing Tiers

prompt threshold

Threshold: 128000

Prompt: $0.0000025 / Completion: $0.00001 (per 1K tokens)

Do Work. With AI.