Google: Gemini 1.5 Pro

Gemini

Input: text

Input: image

Output: text

Released: Apr 9, 2024•Updated: Apr 12, 2025

Google's latest multimodal model, supports image and video[0] in text or chat prompts.

Optimized for language tasks including:

Usage of Gemini is subject to Google's Gemini Terms of Use.

Process and analyze large documents and conversations.

Improved capabilities in front-end development and full-stack updates.

Autonomously navigate multi-step processes with improved reliability.

Process and understand images alongside text inputs.

Available On

Provider	Model ID	Context	Max Output	Input Cost	Output Cost	Throughput	Latency
Google Vertex	vertex	2000K	8K	$1.25/M	$5.00/M	61.7 t/s	1244 ms
Google AI Studio	google	2000K	8K	$1.25/M	$5.00/M	62.7 t/s	17648 ms

Standard Pricing

Input Tokens

$0.00000125

per 1K tokens

Output Tokens

$0.000005

per 1K tokens

Image Processing

$0.0006575

per image

Variable Pricing Tiers

prompt threshold

Threshold: 128000

Prompt: $0.0000025 / Completion: $0.00001 (per 1K tokens)