Diffusion Image Analogies

Adéla Šubrtová, Michal Lukáč, Jan Čech, David Futschik, Eli Shechtman, Daniel Sýkora,

This is the official repository for the Diffusion Image Analogies paper published at the SIGGRAPH 2023 Conference Proceedings.

Installation

Clone the repo

git clone --recurse-submodules https://github.com/subrtadel/DIA.git
cd ./DIA

Create environment

conda create -n dia_env
conda activate dia_env
conda install python=3.8.5 pip=20.3 cudatoolkit=11.3 pytorch=1.11.0 torchvision=0.12.0 numpy=1.19.2 -c pytorch -c nvidia -c conda-forge -c defaults

Install packages

pip install -r requirements.txt
cd ./stable-diffusion/
pip install -e git+https://github.com/CompVis/taming-transformers.git@master#egg=taming-transformers
pip install -e .

Download the sd-v1-4.ckpt model and put it into correct folder
```
mkdir -p ./models/ldm/stable-diffusion-v1/
```
Install Image Magick.

(back to top)

Usage

Upload images into ./dataset/raw_data/ folder.

Run process_new_data.py. The images are assigned file_ids in a %05d format.

Define the triplets in a .csv file. Refer to the images by their file_id. Example file is triplets.csv. First column specifies the A input, second the A' and the third B input. Either with of without filename suffixes is fine.

Run the precompute_noises_and_conditionings.py script. This may take a while.

python precompute_noises_and_conditionings.py \
    --config ./config/parameter_estimation.yaml \
    --inversion_subfolder noise \
    --token_subfolder tokens \ 
    --triplet_file triplets.csv \
    --data_path ./dataset/data/

Check the ./config/analogy_params.yaml.

Run the do_analogies.py script.

python do_analogies.py \
    --config ./config/parameter_estimation.yaml \
    --inversion_subfolder noise \
    --token_subfolder tokens \ 
    --output_subfolder analogies \
    --triplet_file triplets.csv \
    --data_path ./dataset/data/

BibTeX

@inproceedings{Subrtova2023DIA,
    title = {Diffusion Image Analogies},
    author = {A. \v{S}ubrtov\'{a} and M. Luk\'{a}\v{c} and J. \v{C}ech and D. Futschik and E. Shechtman  and D. S\'{y}kora},
    booktitle = {ACM SIGGRAPH 2023 Conference Proceedings},
    year = {2023}
  }

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
config		config
dataset/data		dataset/data
stable-diffusion @ 21f890f		stable-diffusion @ 21f890f
.gitignore		.gitignore
.gitmodules		.gitmodules
DiffusionImageAnalogies.ipynb		DiffusionImageAnalogies.ipynb
LICENSE		LICENSE
README.md		README.md
analogy_creator.py		analogy_creator.py
ddim_invertor.py		ddim_invertor.py
do_analogies.py		do_analogies.py
estimate_CLIP_features.py		estimate_CLIP_features.py
estimate_input_noise.py		estimate_input_noise.py
modified_clip_transformers.py		modified_clip_transformers.py
precompute_noises_and_conditionings.py		precompute_noises_and_conditionings.py
process_new_data.py		process_new_data.py
requirements.txt		requirements.txt
triplets.csv		triplets.csv
utils.py		utils.py
visualize_tokens.py		visualize_tokens.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Diffusion Image Analogies

Installation

Usage

BibTeX

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

subrtadel/DIA

Folders and files

Latest commit

History

Repository files navigation

Diffusion Image Analogies

Installation

Usage

BibTeX

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages