Do Services-as-Software

Gemini 1.5 Flash is a foundation model that performs well at a variety of multimodal tasks such as visual understanding, classification, summarization, and creating content from image, audio and video. It's adept at processing visual and text inputs such as photographs, documents, infographics, and screenshots.

Gemini 1.5 Flash is designed for high-volume, high-frequency tasks where cost and latency matter. On most common tasks, Flash achieves comparable quality to other Gemini Pro models at a significantly reduced cost. Flash is well-suited for applications like chat assistants and on-demand content generation where speed and scale matter.

Usage of Gemini is subject to Google's Gemini Terms of Use.

#multimodal

Provider	Model ID	Context	Max Output	Input Cost	Output Cost	Throughput	Latency
Google Vertex	vertex	1000K	8K	$0.07/M	$0.30/M	152.9 t/s	343 ms
Google AI Studio	google	1000K	8K	$0.07/M	$0.30/M	154.4 t/s	443 ms

Provider

Model ID

Context

Max Output

Input Cost

Output Cost

Throughput

Latency

Google Vertex

vertex

1000K

$0.07/M

$0.30/M

152.9 t/s

343 ms

Google AI Studio

google

1000K

$0.07/M

$0.30/M

154.4 t/s

443 ms

Google: Gemini 1.5 Flash

1,000,000 Token Context

Advanced Coding

Agentic Workflows

Vision Capabilities

Available On

Do Work. With AI.