SD-Turbo - Distilled Stable Diffusion 2.1 For Real-Time Synthesis (huggingface.co)

submitted 11 months ago by Even_Adder@lemmy.dbzer0.com to c/stable_diffusion@lemmy.dbzer0.com

5 comments fedilink hide all child comments

SD-Turbo is a fast generative text-to-image model that can synthesize photorealistic images from a text prompt in a single network evaluation. We release SD-Turbo as a research artifact, and to study small, distilled text-to-image models. For increased quality and prompt understanding, we recommend SDXL-Turbo.

Model Description

SD-Turbo is a distilled version of Stable Diffusion 2.1, trained for real-time synthesis. SD-Turbo is based on a novel training method called Adversarial Diffusion Distillation (ADD) (see the technical report), which allows sampling large-scale foundational image diffusion models in 1 to 4 steps at high image quality. This approach uses score distillation to leverage large-scale off-the-shelf image diffusion models as a teacher signal and combines this with an adversarial loss to ensure high image fidelity even in the low-step regime of one or two sampling steps.

you are viewing a single comment's thread
view the rest of the comments

[-] FormallyKnown@feddit.dk 3 points 11 months ago

If this really can do real time synthesis, then it opens up a whole new world of possibility. Thought we had to wait years for this

this post was submitted on 01 Dec 2023

13 points (100.0% liked)

Stable Diffusion

4297 readers

1 users here now

Discuss matters related to our favourite AI Art generation technology

Also see

Other communities

founded 1 year ago

MODERATORS

db0@lemmy.dbzer0.com