Stable Diffusion Remix

This script attempts to reproduce the Midjourney Remix.

Usage

By default, the script produces remixed samples in the current directory. Note that this method requires the upstream version of the diffusers library.

python run.py /path/to/content_image.png /path/to/style_image.png

How it works

Here is a brief description of the final method. For research details, please refer to the research directory.

The Stable Diffusion v2-1-unclip model is used as it allows guiding reverse diffusion with CLIP image embeddings instead of text embeddings.
The content image is forward-diffused to the specified timestamp to use as an initial latent vector.
Both the content and style images are encoded with the CLIP model, and their embeddings are averaged with the alpha parameter.
The reverse diffusion process is run with the initial latent vector of the content image and the averaged CLIP embeddings as guidance.

The most important hyperparameters are:

alpha: determines how much the style image affects the diffusion process.
timestamp: determines how far the content image is diffused to use as an initial latent vector.
num_inference_steps: determines how many steps of the reverse diffusion process are run.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
imgs		imgs
research		research
README.md		README.md
requirements.txt		requirements.txt
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Stable Diffusion Remix

Usage

How it works

About

Releases

Packages

Languages

unishift/stable-diffusion-remix

Folders and files

Latest commit

History

Repository files navigation

Stable Diffusion Remix

Usage

How it works

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages