Back

Meta: Llama Guard 4 12B

Other
Input: image
Input: text
Output: text
Released: Apr 30, 2025Updated: Apr 30, 2025

Llama Guard 4 is a Llama 4 Scout-derived multimodal pretrained model, fine-tuned for content safety classification. Similar to previous versions, it can be used to classify content in both LLM inputs (prompt classification) and in LLM responses (response classification). It acts as an LLM—generating text in its output that indicates whether a given prompt or response is safe or unsafe, and if unsafe, it also lists the content categories violated.

Llama Guard 4 was aligned to safeguard against the standardized MLCommons hazards taxonomy and designed to support multimodal Llama 4 capabilities. Specifically, it combines features from previous Llama Guard models, providing content moderation for English and multiple supported languages, along with enhanced capabilities to handle mixed text-and-image prompts, including multiple images. Additionally, Llama Guard 4 is integrated into the Llama Moderations API, extending robust safety classification to text and images.

163,840 Token Context

Process and analyze large documents and conversations.

Advanced Coding

Improved capabilities in front-end development and full-stack updates.

Agentic Workflows

Autonomously navigate multi-step processes with improved reliability.

Vision Capabilities

Process and understand images alongside text inputs.

Available On

ProviderModel IDContextMax OutputInput CostOutput CostThroughputLatency
DeepInfradeepInfra164K-$0.05/M$0.05/M1042.8 t/s986 ms
Togethertogether1049K-$0.20/M$0.20/M--
Groqgroq131K1K$0.20/M$0.20/M--
Standard Pricing
Input Tokens
$0.00000005

per 1K tokens

Output Tokens
$0.00000005

per 1K tokens

Do Work. With AI.