Models.do

AI21 Jamba Large 1.6 is a high-performance hybrid foundation model combining State Space Models (Mamba) with Transformer attention mechanisms. Developed by AI21, it excels in extremely long-context handling (256K tokens), demonstrates superior inference efficiency (up to 2.5x faster than comparable models), and supports structured JSON output and tool-use capabilities. It has 94 billion active parameters (398 billion total), optimized quantization support (ExpertsInt8), and multilingual proficiency in languages such as English, Spanish, French, Portuguese, Italian, Dutch, German, Arabic, and Hebrew.

Usage of this model is subject to the Jamba Open Model License.

Provider	Model ID	Context	Max Output	Input Cost	Output Cost	Throughput	Latency
AI21	ai21	256K	4K	$2.00/M	$8.00/M	58.0 t/s	880 ms

Provider

Model ID

Context

Max Output

Input Cost

Output Cost

Throughput

Latency

AI21

ai21

256K

$2.00/M

$8.00/M

58.0 t/s

880 ms

AI21: Jamba 1.6 Large

256,000 Token Context

Advanced Coding

Agentic Workflows

Available On

Standard Pricing

Do Work. With AI.