Skip to content

Google · 12B · transformer-decoder

2025-03-12 131K context 12B params

Use Cases

chat code reasoning multilingual vision math summary

Quantization Options

QuantBitsVRAMQualityStatus
Q4_K_Mrec410.5 GBGood
Q8_0816.0 GBExcellent
F161628.0 GBExcellent

About this model

Gemma 3 12B is the sweet spot of the Gemma 3 family — multimodal, 128K context, and strong enough to compete with models twice its size. It's one of the most popular models on Ollama with tens of millions of pulls. At Q4, it fits comfortably on 12-16 GB GPUs and delivers excellent results for conversation, coding, reasoning, and image understanding. A strong all-rounder for anyone with mid-range hardware.

Benchmarks

76.0
mmlu