Llama Guard 3 8B
Llama Guard 3 is a Llama-3.1-8B pretrained model, fine-tuned for content safety classification. Similar to previous versions, it can be used to classify content in both LLM inputs (prompt classification) and in LLM responses (response classification). It acts as an LLM – it generates text in its output that indicates whether a given prompt or response is safe or unsafe, and if unsafe, it also lists the content categories violated.
Llama Guard 3 was aligned to safeguard against the MLCommons standardized hazards taxonomy and designed to support Llama 3.1 capabilities. Specifically, it provides content moderation in 8 languages, and was optimized to support safety and security for search and code interpreter tool calls.
131,072 Token Context
Process and analyze large documents and conversations.
Advanced Coding
Improved capabilities in front-end development and full-stack updates.
Agentic Workflows
Autonomously navigate multi-step processes with improved reliability.
Available On
Provider | Model ID | Context | Max Output | Input Cost | Output Cost | Throughput | Latency |
---|---|---|---|---|---|---|---|
Nebius AI Studio | nebiusAiStudio | 131K | - | $0.02/M | $0.06/M | 24.5 t/s | 252 ms |
DeepInfra | deepInfra | 131K | - | $0.06/M | $0.06/M | - | - |
Together | together | 8K | - | $0.20/M | $0.20/M | 91.4 t/s | 529 ms |
Groq | groq | 8K | 8K | $0.20/M | $0.20/M | - | - |
SambaNova | sambaNova | 16K | 4K | $0.30/M | $0.30/M | - | - |
Cloudflare | cloudflare | - | - | $0.48/M | $0.03/M | - | - |
per 1K tokens
per 1K tokens