Stability AI

Stable Diffusion 3 Medium (GGUF)

SD 3 with MMDiT architecture. Superior text rendering.

2.5B parametersmmdit-diffusionstability-community9.15GB - 9.15GB VRAM

About This Model

Stable Diffusion 3 Medium (GGUF) by Stability AI is a 2.5 billion parameter text-to-image generation model designed for efficient local deployment. This model excels in generating high-quality images from textual descriptions, making it a solid choice for artists, designers, and hobbyists who need a reliable tool for creative projects without the need for cloud services. The mmdit-diffusion architecture ensures that the model can produce detailed and contextually relevant images, though the exact context length is unknown, which might affect longer or more complex prompts.

In its size class, Stable Diffusion 3 Medium (GGUF) punches well above its weight. It offers a good balance between performance and resource efficiency, making it a practical option for users with mid-range hardware. Despite having fewer parameters than larger models, it maintains a high level of image quality and detail, often rivaling the output of more resource-intensive models. The Q8_0 quantization further enhances its efficiency, requiring only 9.2 GB of VRAM, which is manageable for most modern GPUs. This makes it an excellent choice for users who want to leverage powerful AI capabilities without investing in top-tier hardware. Ideal users include those with GPUs like the RTX 2060 or higher, ensuring smooth and fast generation times.

Check Your Hardware

See which quantizations of Stable Diffusion 3 Medium (GGUF) your hardware can run.

Quantization Options

QuantizationBitsFile SizeVRAM NeededRAM NeededQuality
Q8_088.653 GB9.15 GB9.65 GB
95%

Try It — Diffusion Generation Demo

Click "Generate" to watch how Flux.1 creates an image from noise. Real outputs from RunThisModel.com.

A cozy wooden cabin in snowy mountains at golden hour sunset

"A cozy wooden cabin in snowy mountains at golden hour sunset"

A friendly humanoid robot reading a book in a library

"A friendly humanoid robot reading a book in a library"

Gourmet sushi platter, professional food photography

"Gourmet sushi platter, professional food photography"

Woman scientist in a modern lab, natural lighting

"Woman scientist in a modern lab, natural lighting"

Snow leopard on mountain peak at dawn, golden rim light

"Snow leopard on mountain peak at dawn, golden rim light"

Cyberpunk city at night, neon signs, rain reflections

"Cyberpunk city at night, neon signs, rain reflections"

Animation simulates the diffusion denoising process at recorded generation speed. Actual generation requires GPU hardware or cloud service.

Frequently Asked Questions

How much VRAM do I need to run Stable Diffusion 3 Medium (GGUF)?

Stable Diffusion 3 Medium (GGUF) requires 9.15GB VRAM minimum with Q8_0 quantization. For full precision, you need 9.15GB VRAM.

What is the best quantization for Stable Diffusion 3 Medium (GGUF)?

Q4_K_M offers the best balance of quality and VRAM usage. Q8_0 is near-lossless if you have enough VRAM.