Skip to content

Google · 4B · transformer-decoder

2025-03-12 131K context 4B params

Use Cases

chat code reasoning multilingual vision summary

Quantization Options

QuantBitsVRAMQualityStatus
Q4_K_Mrec45.0 GBModerate
Q8_087.5 GBGood
F161611.5 GBExcellent

About this model

Gemma 3 4B is the smallest multimodal model in the Gemma 3 family, supporting both text and image inputs. It delivers impressive performance for its size, outperforming many larger text-only models on standard benchmarks. With 128K context and vision support at just 5 GB VRAM (Q4), it's an excellent choice for users with 8 GB GPUs who want multimodal capabilities. Drag images into Ollama's desktop app to ask questions about them.

Benchmarks

62.0
mmlu