🚀 Introducing SGLang Diffusion — bringing SGLang’s high-performance serving to diffusion models.
⚡️ Up to 5.9× faster inference
🧩 Supports major open-source models: Wan, Hunyuan, Qwen-Image, Qwen-Image-Edit, Flux
🧰 Easy to use via OpenAI-compatible API, CLI & Python API
Built with FastVideo to power the full diffusion ecosystem, and special thanks to
@NVIDIAAIDev and
@VoltagePark for their compute support!
⬇️Read more in the thread: