chat code reasoning multilingual math tools writing summary
Quantization Options
Quant
Bits
VRAM
Quality
Status
Q4_K_Mrec
4
23.0 GB
Good
—
Q8_0
8
39.0 GB
Excellent
—
F16
16
70.0 GB
Excellent
—
About this model
Qwen 3 32B is the largest dense model in the Qwen 3 family, delivering near-frontier performance across coding, math, reasoning, and creative writing. Hybrid thinking mode allows it to compete with 70B-class models on complex tasks.
At Q4 it needs about 23 GB VRAM — fits on a RTX 3090/5090 or Macs with 24 GB+ unified memory. An excellent choice for users with high-end hardware who want the best dense model experience.