StyleGAN2

StyleGAN2 is an GAN architecture which provides state-of-the-art results in data-driven unconditional generative image modeling. This implementation is based on a research paper from NVIDIA from 2020

What is a GAN

Two neural networks contest with each other in a game. Given a training set, this technique learns to generate new data with the same statistics as the training set. Learn more about GAN here

About us

We are a project group within the student organization Cogito at NTNU (Norwegian University of Science and Technology). The group has worked on this through the fall of 2020.

Motivation

We wanted to get a better understanding of the hottest image creation technology right now, and NVIDIA had published extremely good results in a paper just months before. We were also unable to find anyone else who had tried to recreate StyleGAN2 so we saw it as a good challange to take.

Results

Early results were plagued by mode collapse, which means that two vastly different latent vectors gave almost exactly the same output image. We got more success by tuning the learning rate down by a thousand.

The image below shows an pygame application to easily create new images, save them, and make interpolation videos between them similar to this

Improvements

The easiest change we could make would be to train the system longer. NVIDIA ran their model for 51 GPU-years with Titan graphic cards, while we trained the models on a couple of days with modern graphic cards.

Another major improvement would be to use a month worth of training time to really hone in the right learning rate, model size and model shape before commiting to letting the model train for real.

We could also consider hosting an application on a website with different models to make our creation more usable.

Setup and how to train on your own

The current h5 files take up around 50GB of space for 512x512 resolution! So we will rather give the steps to how you can train your own version.

First you need to install python. 3.6 and 3.7 is proven to work. Other python versions are used at your own risk
Run pip install -r requirements.txt in the project folder. This will install all the required libraries.
Download a dataset of images you want to train on. They need to be square shaped.
Change config.py to suit your dataset. You will at least need to change IMG_SIZE and IMAGES_TO_CONVERT.
IMG_SIZE is the resolution of both the x and the y axis of the images.
IMAGES_TO_CONVERT is the folder with the training images you downloaded in the last step.
Run prep_data.py
Run training_loop.py. You can see how your network performs by looking in the generated_images folder. Your model weights will be saved in the weights folder. Increase or decrease SAVE_INTERVAL inconfig.py if you want your network to be saved more or less often.
To use the app you need to first change MODEL_APP_WEIGHTS in the config to the generator weights you are using. Then run app.py

Name		Name	Last commit message	Last commit date
Latest commit History 65 Commits
data_tools		data_tools
images		images
models		models
.gitignore		.gitignore
README.md		README.md
app.py		app.py
config.py		config.py
experimental_training_loop.py		experimental_training_loop.py
generate_interpolation_mp4.py		generate_interpolation_mp4.py
gpu_info.py		gpu_info.py
main.py		main.py
prep_data.py		prep_data.py
project.avi		project.avi
requirements.txt		requirements.txt
training_loop.py		training_loop.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

StyleGAN2

What is a GAN

About us

Motivation

Results

Improvements

Setup and how to train on your own

About

Uh oh!

Releases

Packages

Contributors 4

Uh oh!

Languages

CogitoNTNU/StyleGan

Folders and files

Latest commit

History

Repository files navigation

StyleGAN2

What is a GAN

About us

Motivation

Results

Improvements

Setup and how to train on your own

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Uh oh!

Languages

Packages