Diffusion Explainer is an interactive visualization tool designed to help anyone learn how Stable Diffusion transforms text prompts into images. It runs in your browser, allowing you to experiment with several preset prompts without any installation, coding skills, or GPUs. Try Diffusion Explainer at https://poloclub.github.io/diffusion-explainer and watch a demo video on YouTube https://youtu.be/Zg4gxdIWDds!
diffusion-explainer.mp4 |
|||
🚀 Live Demo | 📺 Demo Video | 📜 Research Paper | 📄 Blog Post |
Diffusion Explainer: Visual Explanation for Text-to-image Stable Diffusion. Seongmin Lee, Benjamin Hoover, Hendrik Strobelt, Zijie J. Wang, ShengYun Peng, Austin Wright, Kevin Li, Haekyu Park, Haoyang Yang, Duen Horng Chau. Short paper, IEEE VIS 2024.
git clone https://github.com/poloclub/diffusion-explainer.git
cd diffusion-explainer
python -m http.server 8000
Then, on your web browser, access http://localhost:8000. You can replace 8000 with other port numbers you want to use.
Led by Seongmin Lee, Diffusion Explainer is created by Machine Learning and Human-computer Interaction researchers at Georgia Tech and IBM Research. The team includes Seongmin Lee, Benjamin Hoover, Hendrik Strobelt, Jay Wang, ShengYun (Anthony) Peng, Austin Wright, Kevin Li, Haekyu Park, Alex Yang, and Polo Chau.
@article{lee2024diffusion,
title = {{D}iffusion {E}xplainer: {V}isual {E}xplanation for {T}ext-to-image {S}table {D}iffusion},
shorttitle = {Diffusion Explainer},
author = {Lee, Seongmin and Hoover, Benjamin and Strobelt, Hendrik and Wang, Zijie J and Peng, ShengYun and Wright, Austin and Li, Kevin and Park, Haekyu and Yang, Haoyang and Chau, Duen Horng},
journal={IEEE VIS},
year={2024}
}
The software is available under the MIT License.
If you have any questions, feel free to open an issue or contact Seongmin Lee.