NeverSleep: Llama 3 Lumimaid 8B

Llama3

Input: text

Output: text

Released: May 4, 2024•Updated: Mar 28, 2025

The NeverSleep team is back, with a Llama 3 8B finetune trained on their curated roleplay data. Striking a balance between eRP and RP, Lumimaid was designed to be serious, yet uncensored when necessary.

To enhance it's overall intelligence and chat capability, roughly 40% of the training data was not roleplay. This provides a breadth of knowledge to access, while still keeping roleplay as the primary strength.

Usage of this model is subject to Meta's Acceptable Use Policy.

24,576 Token Context

Process and analyze large documents and conversations.

Advanced Coding

Improved capabilities in front-end development and full-stack updates.

Agentic Workflows

Autonomously navigate multi-step processes with improved reliability.

Available On

Provider	Model ID	Context	Max Output	Input Cost	Output Cost	Throughput	Latency
Mancer 2	mancer (private)	25K	2K	$0.20/M	$1.25/M	32.8 t/s	567 ms
Featherless	featherless	8K	4K	$0.80/M	$1.20/M	25.7 t/s	1225 ms

Standard Pricing

Input Tokens

per 1M tokens

$0.20

Output Tokens

per 1M tokens

$1.25

Do Work. With AI.

Join Waitlist Learn more