Skip to content

Google · 1B · transformer-decoder

2025-03-12 33K context 1B params

Use Cases

chat multilingual summary

Quantization Options

QuantBitsVRAMQualityStatus
Q4_K_M41.5 GBModerate
Q8_0rec82.0 GBGood
F16163.5 GBExcellent

About this model

Gemma 3 1B is Google's ultra-lightweight model, ideal for edge devices and resource-constrained environments. Text-only (no vision at this size), it handles basic text generation and summarization tasks with minimal hardware requirements. At under 2 GB of VRAM for Q8, this model runs on virtually any modern hardware including older GPUs and base-config Macs.

Benchmarks

42.0
mmlu