resnet-brelu

ResNet and DenseNet with Bipolar Rectified Linear Units

This repo is copied from: https://github.com/facebook/fb.resnet.torch

Then, the very deep 1K layer model (resnet-pre-act.lua) is copied from here: https://github.com/KaimingHe/resnet-1k-layers

Then, the DenseNet model (densenet.lua) is copied from here: https://github.com/liuzhuang13/DenseNet

Then, the code is modified in the following ways:

Added command line option for replacing ReLU with BReLU (BReLU)
Added command line options for DenseNet (growthRate, dropRate and bottleneck)
Train Cifar-10 and Cifar-100 networks for 300 epochs
Train Cifar-10 and Cifar-100 networks with learning rate decay at epoch 150 and 225 (as prescribed for densenet)
Added logging of training and test error to log.txt every epoch
Added logging of command line options to file opt.txt at startup
Added logging of model structure as text to model.txt at startup
Commented out debug printout every batch of training and validation

The current implementation of BReLU does not support training on multiple GPU's.

Results with BReLU compared to ReLU on Cifar-10 with data augmentation:

Network	ReLU (own)	BReLU (run1)	BReLU (run2)	BReLU (run3)	BReLU (run4)
ResNet-44	7.17 (6.44)	6.49	6.80	6.64	6.61
ResNet-110	6.61/6.41 (6.92)	5.76	6.10	5.86	6.07

Results with BReLU compared to ReLU on Cifar-100 with data augmentation:

Network	ReLU (own)	BReLU (run1)	BReLU (run2)	BReLU (run3)	BReLU (run4)
ResNet-44	27.76 (29.4)	29.30	29.96	29.85	29.67
ResNet-110	27.22	27.74	28.04	28.41	28.11

Training recipes

CIFAR-10

To train a pre-activation ResNet-1001 with BReLU with 1 GPU:

th main.lua -netType resnet-pre-act -depth 1001 -batchSize 64 -nGPU 1 -dataset cifar10 -nEpochs 200 -BReLU true

To train a ResNet-44 with BReLU with 1 GPU:

th main.lua -netType resnet -depth 44 -batchSize 128 -nGPU 1 -dataset cifar10 -BReLU true

To train a ResNet-110 with BReLU with 1 GPU:

th main.lua -netType resnet -depth 110 -batchSize 128 -nGPU 1 -dataset cifar10 -BReLU true

To train a DenseNet-100 with BReLU with 1 GPU:

th main.lua -netType densenet -depth 100 -batchSize 128 -growthRate 12 -nGPU 1 -dataset cifar10 -optnet true -BReLU true

To train a DenseNet-190 with BReLU with 1 GPU:

th main.lua -netType densenet -depth 190 -batchSize 64 -growthRate 40 -nGPU 1 -dataset cifar10 -optnet true -BReLU true

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
datasets		datasets
models		models
LICENSE		LICENSE
README.md		README.md
checkpoints.lua		checkpoints.lua
dataloader.lua		dataloader.lua
main.lua		main.lua
opts.lua		opts.lua
train.lua		train.lua

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

resnet-brelu

Training recipes

CIFAR-10

About

Releases

Packages

Languages

License

anokland/resnet-brelu

Folders and files

Latest commit

History

Repository files navigation

resnet-brelu

Training recipes

CIFAR-10

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages