3D Unet / Temporal Attention implementation Diffusion Forcing

[Diffusion Forcing Website] [Original Implementation]

This is a 3D-Unet implementation of paper Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion.

This repo is coded by Kiwhan Song, an amazing MIT undergrad working with Boyuan Chen and Vincent Sitzmann based on Boyuan's research template repo.

The content is not used in the original Diffusion Forcing paper but a reimplementation with better architecture for video generation. Original Diffusion Forcing code is RNN based to optimize for sequential decision making, while this repo uses Lucidrain's 3DUnet/Attention optimized for video.

This repo was originally part of our follow up project but we decided to release it early due to popularity of Diffusion Forcing among Generative AI community. Right now auto-regressive sampling with this repo is expected to be slow, since we haven't implemented causal attention caching. We've already verified diffusion forcing works in latent diffusion and can be extended to many more tokens without sacrificing compositionality with some special techniques, although those code will not be released immediately!

Project Instructions

** Update Aug 2024 ** This repo has been merged into the main [Diffusion Forcing Implementation] with version number v1.5, please directly use that instead and follow the instruction there.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

3D Unet / Temporal Attention implementation Diffusion Forcing

[Diffusion Forcing Website] [Original Implementation]

Project Instructions

About

Releases

Packages

Contributors 2

License

kwsong0113/diffusion-forcing-transformer

Folders and files

Latest commit

History

Repository files navigation

3D Unet / Temporal Attention implementation Diffusion Forcing

[Diffusion Forcing Website] [Original Implementation]

Project Instructions

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Packages