VQGAN training details #61

Andrew-Brown1 · 2021-06-25T21:28:48Z

Hi,

Thanks for the great repo! Could I ask some questions about training VQGAN?

What batch size did you train it with, and for how long?
Also I see here that you wait until you add the discriminator loss https://www.youtube.com/watch?v=fy153-yXSQk

Do you wait until the model has converged without it before adding it?

Thanks!

rromb · 2021-09-28T16:07:45Z

Hi, great question :) Most of our published VQGAN models are trained on a single 40GB VRAM GPU with a batch size of ~12 (bs=14 for the f16 model), depending on the hyperparameters. Regarding your second question, yes, it makes sense to monitor the perceptual loss and then add the discriminator to the training loop.

Andrew-Brown1 · 2021-09-28T16:20:18Z

Hey - thanks!

gombru · 2022-10-20T11:14:41Z

Hi! Can we get any details about training costs? I.e. how many epochs and gpu days took to train the OpenImages model on a single 40GB VRAM GPU?

Thanks!

rromb closed this as completed Sep 28, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

VQGAN training details #61

VQGAN training details #61

Andrew-Brown1 commented Jun 25, 2021

rromb commented Sep 28, 2021

Andrew-Brown1 commented Sep 28, 2021

gombru commented Oct 20, 2022

VQGAN training details #61

VQGAN training details #61

Comments

Andrew-Brown1 commented Jun 25, 2021

rromb commented Sep 28, 2021

Andrew-Brown1 commented Sep 28, 2021

gombru commented Oct 20, 2022