Meta: Llama 3.2 1B Instruct
Llama 3.2 1B is a 1-billion-parameter language model focused on efficiently performing natural language tasks, such as summarization, dialogue, and multilingual text analysis. Its smaller size allows it to operate efficiently in low-resource environments while maintaining strong task performance.
Supporting eight core languages and fine-tunable for more, Llama 1.3B is ideal for businesses or developers seeking lightweight yet powerful AI solutions that can operate in diverse multilingual settings without the high computational demand of larger models.
Click here for the original model card.
Usage of this model is subject to Meta's Acceptable Use Policy.
131,072 Token Context
Process and analyze large documents and conversations.
Advanced Coding
Improved capabilities in front-end development and full-stack updates.
Agentic Workflows
Autonomously navigate multi-step processes with improved reliability.
Available On
Provider | Model ID | Context | Max Output | Input Cost | Output Cost | Throughput | Latency |
---|---|---|---|---|---|---|---|
Nebius AI Studio | nebiusAiStudio | 131K | - | $0.01/M | $0.01/M | 31.0 t/s | 661 ms |
DeepInfra | deepInfra | 131K | 16K | $0.01/M | $0.01/M | 142.4 t/s | 832 ms |
inference.net | inferenceNet | 16K | 16K | $0.01/M | $0.01/M | 140.6 t/s | 1211 ms |
Cloudflare | cloudflare | 60K | - | $0.03/M | $0.20/M | 272.7 t/s | 703 ms |
SambaNova | sambaNova | 16K | 4K | $0.04/M | $0.08/M | 2586.7 t/s | 1119 ms |
per 1K tokens
per 1K tokens