Skip to content

Granite 3.3 8B

Apache 2.0

IBM · 8B · transformer-decoder

2025-04-16 128K context 8B params

Use Cases

chat code reasoning math tools summary

Quantization Options

QuantBitsVRAMQualityStatus
Q4_K_M46.0 GBModerate
Q8_0rec810.0 GBGood
F161618.0 GBExcellent

About this model

Granite 3.3 8B is IBM's enterprise-focused language model built on a dense transformer decoder architecture with GQA, RoPE, and SwiGLU. It supports a 128K context window and features structured reasoning through dedicated thinking and response tags. The model delivers strong results on AlpacaEval-2.0 and Arena-Hard benchmarks, with particular strengths in mathematics, coding, and instruction adherence. Released under the Apache 2.0 license, it is well-suited for enterprise RAG applications and business use cases.