chat code reasoning multilingual math tools summary
Quantization Options
Quant
Bits
VRAM
Quality
Status
Q4_K_Mrec
4
7.5 GB
Good
—
Q8_0
8
11.5 GB
Good
—
F16
16
20.0 GB
Excellent
—
About this model
Qwen 3 8B is the workhorse of the Qwen 3 dense lineup, offering an excellent balance of capability and resource efficiency. Features hybrid thinking mode for adaptive reasoning depth and supports tool calling for agentic workflows.
At Q4 it fits on 8 GB GPUs with some headroom, and runs comfortably on 12-16 GB hardware. Strong at coding, math, and multilingual tasks — a direct upgrade over Llama 3.1 8B in most benchmarks.