Back
NeverSleep: Llama 3 Lumimaid 8B (extended)
Llama3
Input: text
Output: text
Released: May 4, 2024•Updated: Mar 28, 2025
The NeverSleep team is back, with a Llama 3 8B finetune trained on their curated roleplay data. Striking a balance between eRP and RP, Lumimaid was designed to be serious, yet uncensored when necessary.
To enhance it's overall intelligence and chat capability, roughly 40% of the training data was not roleplay. This provides a breadth of knowledge to access, while still keeping roleplay as the primary strength.
Usage of this model is subject to Meta's Acceptable Use Policy.
24,576 Token Context
Process and analyze large documents and conversations.
Advanced Coding
Improved capabilities in front-end development and full-stack updates.
Agentic Workflows
Autonomously navigate multi-step processes with improved reliability.
Available On
Provider | Model ID | Context | Max Output | Input Cost | Output Cost | Throughput | Latency |
---|---|---|---|---|---|---|---|
Mancer (private) | mancer (private) | 25K | 2K | $0.20/M | $1.25/M | 64.5 t/s | 630 ms |
Featherless | featherless | 8K | 4K | $0.80/M | $1.20/M | 25.7 t/s | 1159 ms |
Standard Pricing
Input Tokens
$0.0000002
per 1K tokens
Output Tokens
$0.00000125
per 1K tokens