Back

NeverSleep: Llama 3 Lumimaid 8B (extended)

Llama3
Input: text
Output: text
Released: May 4, 2024Updated: Mar 28, 2025

The NeverSleep team is back, with a Llama 3 8B finetune trained on their curated roleplay data. Striking a balance between eRP and RP, Lumimaid was designed to be serious, yet uncensored when necessary.

To enhance it's overall intelligence and chat capability, roughly 40% of the training data was not roleplay. This provides a breadth of knowledge to access, while still keeping roleplay as the primary strength.

Usage of this model is subject to Meta's Acceptable Use Policy.

24,576 Token Context

Process and analyze large documents and conversations.

Advanced Coding

Improved capabilities in front-end development and full-stack updates.

Agentic Workflows

Autonomously navigate multi-step processes with improved reliability.

Available On

ProviderModel IDContextMax OutputInput CostOutput CostThroughputLatency
Mancer (private)mancer (private)25K2K$0.20/M$1.25/M64.5 t/s630 ms
Featherlessfeatherless8K4K$0.80/M$1.20/M25.7 t/s1159 ms
Standard Pricing
Input Tokens
$0.0000002

per 1K tokens

Output Tokens
$0.00000125

per 1K tokens

Do Work. With AI.