Do Services-as-Software

AI21 Jamba Mini 1.6 is a hybrid foundation model combining State Space Models (Mamba) with Transformer attention mechanisms. With 12 billion active parameters (52 billion total), this model excels in extremely long-context tasks (up to 256K tokens) and achieves superior inference efficiency, outperforming comparable open models on tasks such as retrieval-augmented generation (RAG) and grounded question answering. Jamba Mini 1.6 supports multilingual tasks across English, Spanish, French, Portuguese, Italian, Dutch, German, Arabic, and Hebrew, along with structured JSON output and tool-use capabilities.

Usage of this model is subject to the Jamba Open Model License.

Provider	Model ID	Context	Max Output	Input Cost	Output Cost	Throughput	Latency
AI21	ai21	256K	4K	$0.20/M	$0.40/M	207.9 t/s	460 ms

Provider

Model ID

Context

Max Output

Input Cost

Output Cost

Throughput

Latency

AI21

ai21

256K

$0.20/M

$0.40/M

207.9 t/s

460 ms

AI21: Jamba Mini 1.6

256,000 Token Context

Advanced Coding

Agentic Workflows

Available On

Do Work. With AI.