Gemma 3 1B

Name: Gemma 3 1B
Author: Google

Gemma Terms of Use

Google · 1B · transformer-decoder

🤗 HuggingFace Ollama Official

2025-03-12 33K context 1B params

Use Cases

chat multilingual summary

Quantization Options

Quant	Bits	VRAM	Quality	Status
Q4_K_M	4	1.5 GB	Moderate	—
Q8_0rec	8	2.0 GB	Good	—
F16	16	3.5 GB	Excellent	—

About this model

Gemma 3 1B is Google's ultra-lightweight model, ideal for edge devices and resource-constrained environments. Text-only (no vision at this size), it handles basic text generation and summarization tasks with minimal hardware requirements. At under 2 GB of VRAM for Q8, this model runs on virtually any modern hardware including older GPUs and base-config Macs.

Benchmarks

42.0

mmlu

Your Hardware

DevicePick…

VRAM—

Bandwidth—

Detecting…

Install

Ollama

ollama run gemma3:1b-it-q8_0

llama.cpp / GGUF

Download GGUF from HuggingFace

Specs

Parameters: 1B
Architecture: transformer-decoder
Context: 33K tokens
Min VRAM: 1.5 GB
Recommended: 2.0 GB
Family: Gemma 3
Released: 2025-03-12
License: Gemma Terms of Use