Skip to content

Mistral Nemo 12B

Apache 2.0

Mistral AI · 12B · transformer-decoder

2024-07-18 131K context 12B params

Use Cases

chat code reasoning multilingual tools summary

Quantization Options

QuantBitsVRAMQualityStatus
Q4_K_Mrec49.5 GBGood
Q8_0816.0 GBGood
F161628.0 GBExcellent

About this model

Mistral Nemo 12B was built jointly by Mistral AI and NVIDIA. It features a 128K context window and uses a Tekken tokenizer that's more efficient across languages than prior Mistral models. With 3.4M+ Ollama pulls, it's one of the most popular models at its size. At Q4 it fits on 12 GB GPUs comfortably, making it a strong contender alongside Gemma 3 12B. Excellent at function calling, multilingual tasks, and general instruction following.

Benchmarks

68.0
mmlu