Skip to content

Qwen 3 4B

Apache 2.0

Alibaba · 4B · transformer-decoder

2025-04-29 131K context 4B params

Use Cases

chat code reasoning multilingual math summary

Quantization Options

QuantBitsVRAMQualityStatus
Q4_K_Mrec44.5 GBModerate
Q8_086.5 GBGood
F161611.0 GBExcellent

About this model

Qwen 3 4B is a compact dense model with hybrid thinking mode — it can answer directly for simple questions or engage step-by-step reasoning for complex tasks. Supports 29+ languages and 128K context. At Q4 it fits easily on any 8 GB GPU or Mac, making it an excellent lightweight daily driver. Punches well above its weight on reasoning and math benchmarks compared to similarly sized models.

Benchmarks

65.0
mmlu