Skip to content

Qwen 3 32B

Apache 2.0

Alibaba · 32B · transformer-decoder

2025-04-29 131K context 32B params

Use Cases

chat code reasoning multilingual math tools writing summary

Quantization Options

QuantBitsVRAMQualityStatus
Q4_K_Mrec423.0 GBGood
Q8_0839.0 GBExcellent
F161670.0 GBExcellent

About this model

Qwen 3 32B is the largest dense model in the Qwen 3 family, delivering near-frontier performance across coding, math, reasoning, and creative writing. Hybrid thinking mode allows it to compete with 70B-class models on complex tasks. At Q4 it needs about 23 GB VRAM — fits on a RTX 3090/5090 or Macs with 24 GB+ unified memory. An excellent choice for users with high-end hardware who want the best dense model experience.

Benchmarks

84.0
mmlu