How to decide the training epochs or early stop condition? #31

idealwhite · 2021-03-11T01:56:18Z

I really like your paper, thanks for your open source!
It seems that you did not use early stop in the ModelCheckpoint. Could you please tell me how many epochs you trained the VQGAN and transformer? Or do you have suggestions about the training epochs on new datasets?

rromb · 2021-03-29T13:36:08Z

Thanks!
The VQGAN benefits greatly from training it as long as possible (provided the data set is large enough and overfitting is a secondary concern), and tuning in the discriminator rather late. When training on ImageNet, for example, I would recommend 3-5epochs without the adversarial loss (but more is better) and then training for at least another 3-5 epochs with the discriminator turned on (again, more=better).

The stopping condition for the transformer is usually when it starts overfitting in terms of NLL on held-out test data.

sunshineatnoon · 2021-06-07T17:33:56Z

May I ask how long it usually takes to train on the ImageNet and how many GPUs are used?

gombru · 2022-10-20T11:26:13Z

Any updates on training times? Costs?

cucdengjunli · 2023-06-21T06:50:35Z

mark

idealwhite changed the title ~~Thanks for your open source! About training epochs and early stop~~ How to decide the training epochs and early stop condition? Mar 11, 2021

idealwhite changed the title ~~How to decide the training epochs and early stop condition?~~ How to decide the training epochs or early stop condition? Mar 11, 2021

bob80333 mentioned this issue May 16, 2021

KL divergent term in DiscreteVAE lucidrains/DALLE-pytorch#250

Open

hyakuchiki mentioned this issue Aug 7, 2021

Very confused by the discriminator loss #93

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to decide the training epochs or early stop condition? #31

How to decide the training epochs or early stop condition? #31

idealwhite commented Mar 11, 2021

rromb commented Mar 29, 2021

sunshineatnoon commented Jun 7, 2021 •

edited

Loading

gombru commented Oct 20, 2022

cucdengjunli commented Jun 21, 2023

How to decide the training epochs or early stop condition? #31

How to decide the training epochs or early stop condition? #31

Comments

idealwhite commented Mar 11, 2021

rromb commented Mar 29, 2021

sunshineatnoon commented Jun 7, 2021 • edited Loading

gombru commented Oct 20, 2022

cucdengjunli commented Jun 21, 2023

sunshineatnoon commented Jun 7, 2021 •

edited

Loading