Mixtral 8x7B
by Mistral AI · mistral family
47B
parameters
text-generation code-generation reasoning multilingual math creative-writing summarization
Mixtral 8x7B is Mistral AI's mixture-of-experts model, utilizing eight expert networks of 7B parameters each with a routing mechanism that activates two experts per token. This architecture gives it 47B total parameters but only uses about 13B during inference, providing excellent efficiency. The model delivers performance competitive with much larger dense models while maintaining faster inference speeds. It excels at reasoning, multilingual tasks, and code generation, and is particularly well-suited for users who need high-quality output with reasonable hardware requirements.
Quick Start with Ollama
ollama run 8x7b-instruct-v0.1-q4_K_M | Creator | Mistral AI |
| Parameters | 47B |
| Architecture | transformer-decoder |
| Context Length | 32K tokens |
| License | Apache 2.0 |
| Released | Dec 11, 2023 |
| Ollama | mixtral |
Quantization Options
| Format | File Size | VRAM Required | Quality | Ollama Tag |
|---|---|---|---|---|
| Q4_K_M recommended | 22.6 GB | 29.7 GB |
★
★
★
★
★
| 8x7b-instruct-v0.1-q4_K_M |
| Q5_K_M | 26.3 GB | 34.4 GB |
★
★
★
★
★
| 8x7b-instruct-v0.1-q5_K_M |
| Q8_0 | 44.1 GB | 49 GB |
★
★
★
★
★
| 8x7b-instruct-v0.1-q8_0 |
Compatible Hardware for Q4_K_M
Showing compatibility for the recommended quantization (Q4_K_M, 29.7 GB VRAM).
Compatible Hardware
| Hardware | VRAM | Type | Fit |
|---|---|---|---|
| Mac Pro M2 Ultra 192GB | 192 GB | mac | Runs |
| Mac Studio M4 Ultra 192GB | 192 GB | mac | Runs |
| Mac Studio M4 Max 128GB | 128 GB | mac | Runs |
| MacBook Pro M4 Max 128GB | 128 GB | mac | Runs |
| Mac Studio M4 Max 64GB | 64 GB | mac | Runs |
| MacBook Pro M4 Max 64GB | 64 GB | mac | Runs |
| Mac mini M4 Pro 48GB | 48 GB | mac | Runs |
| MacBook Pro M4 Max 48GB | 48 GB | mac | Runs |
| MacBook Pro M4 Pro 48GB | 48 GB | mac | Runs |
| NVIDIA GeForce RTX 5090 | 32 GB | gpu | Runs (tight) |
| Mac mini M4 32GB | 32 GB | mac | Runs (tight) |
| AMD Radeon RX 7900 XTX | 24 GB | gpu | CPU Offload |
| NVIDIA GeForce RTX 3090 | 24 GB | gpu | CPU Offload |
| NVIDIA GeForce RTX 4090 | 24 GB | gpu | CPU Offload |
| Mac mini M4 Pro 24GB | 24 GB | mac | CPU Offload |
| MacBook Air M4 24GB | 24 GB | mac | CPU Offload |
| MacBook Pro M4 Pro 24GB | 24 GB | mac | CPU Offload |
| AMD Radeon RX 7900 XT | 20 GB | gpu | CPU Offload |
18 hardware
device(s) cannot run this model configuration.
Benchmark Scores
70.6
mmlu