You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi, great question :) Most of our published VQGAN models are trained on a single 40GB VRAM GPU with a batch size of ~12 (bs=14 for the f16 model), depending on the hyperparameters. Regarding your second question, yes, it makes sense to monitor the perceptual loss and then add the discriminator to the training loop.
Hi,
Thanks for the great repo! Could I ask some questions about training VQGAN?
What batch size did you train it with, and for how long?
Also I see here that you wait until you add the discriminator loss https://www.youtube.com/watch?v=fy153-yXSQk
Do you wait until the model has converged without it before adding it?
Thanks!
The text was updated successfully, but these errors were encountered: