Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to get the First Model for CIFAR100 #6

Open
PatrickZH opened this issue Feb 11, 2019 · 3 comments
Open

How to get the First Model for CIFAR100 #6

PatrickZH opened this issue Feb 11, 2019 · 3 comments

Comments

@PatrickZH
Copy link

Thank you for sharing the code. I wonder the detail of obtaining the first (ResNet) model for the first 10 classes in CIFAR100. I tried to reimplement your code with Pytorch. However, even at the first step, the performance is only 85% (averaged on many random splits, with your augmentation strategy), but yours is nearly 90%. It is said that the first model is trained using ResNet-Matconvnet. Do you used any pretrained model (by finetuning) when train the first model?

@PatrickZH
Copy link
Author

By the way, in the Fig. 4, it is interesting that, at the first step, all models w./w.o. data augmentation, achieved nearly the same performance. Does it mean that data augmentation does not contribute to the training of the first model?

@fmcp
Copy link
Owner

fmcp commented Feb 20, 2019

Hi,

I use ResNet-Matconvnet for training the first model. Note that I use random noise and L2 norm in the gradients. Maybe that’s the difference.

Regarding the data augmentation, since I use the base model trained with ResNet-Matconvnet, it doesn’t use my data augmentation.

@PatrickZH
Copy link
Author

Thank you for the reply! I will keep trying it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants