Do Services-as-Software

Llama3.1-Typhoon2-8B-Instruct is a Thai-English instruction-tuned model with 8 billion parameters, built on Llama 3.1. It significantly improves over its base model in Thai reasoning, instruction-following, and function-calling tasks, while maintaining competitive English performance. The model is optimized for bilingual interaction and performs well on Thai-English code-switching, MT-Bench, IFEval, and tool-use benchmarks.

Despite its smaller size, it demonstrates strong generalization across math, coding, and multilingual benchmarks, outperforming comparable 8B models across most Thai-specific tasks. Full benchmark results and methodology are available in the technical report.

Provider	Model ID	Context	Max Output	Input Cost	Output Cost	Throughput	Latency
Together	together	8K	-	$0.18/M	$0.18/M	73.3 t/s	270 ms

Provider

Model ID

Context

Max Output

Input Cost

Output Cost

Throughput

Latency

Together

together

$0.18/M

73.3 t/s

270 ms

Typhoon2 8B Instruct

8,192 Token Context

Advanced Coding

Agentic Workflows

Available On

Do Work. With AI.