Re-init weights to avoid recompile #609

PilCAki · 2015-08-27T23:58:12Z

If you'd like to average NN's with different inits, currently there is no easy method for getting a different random init of weights without recompiling.

Currently, I can train nets on medium sized data in about 1 minute but it sometimes takes several minutes to compile if I've got particularly complex structure.

It's possible to save the weights and re-init with the original weights. However, there should be method to reset to a "like-new" state non-identical to first init without having to recompile.

fchollet · 2015-08-28T00:06:10Z

Thought about it, I think I agree.

Would you be interested in submitting a PR for a layer.reset() method, from which we could build a model.reset() method? (note that the layer method should also work for containers, recursively, since containers must have the same API as the layers).

PilCAki · 2015-08-28T05:51:06Z

Yes, I would be interested in that. I'll see what I can do, thanks :)

pkch · 2015-12-02T22:40:57Z

In the meantime, it might be worth clarifying in the documentation how to get a set of random init weights. At this point, I don't think it occurs just from calling recompile?

fgolemo · 2016-03-07T12:01:18Z

After some experimenting around:

you can go through each layer and call its build() function, which should reset the weights, but doesn't affect the compiled model
you can recompile, but that doesn't reset the weights
you can do both in that order and it will work (as in randomize the weights and biases)

I tried implementing that here (only for Sequential model so far): #1908
please give me feedback if that works for you. I tried with a few different models and it seemed to successfully reset.

seven7e · 2016-09-05T03:33:28Z

@fgolemo Hi, I also meet the need for reseting weights at the start of each round of cross validation. Is the reset() function available in the release now?

gokceneraslan · 2017-03-14T19:43:05Z

No.

stale · 2017-06-13T01:18:26Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed after 30 days if no further activity occurs, but feel free to re-open a closed issue if needed.

daferna · 2017-06-15T00:38:12Z

I would like this feature, because I feel like something suspicious is happening with the memory allocator (at least using the Theano backend). I am using the functional API where I am trying to do cross validation, and even if I (1) delete the python variables for my network, (2) recreate the network and recompile it, (3) print out the auto-generated Theano tensor names using model.summary() (verifying that they are actually different) and re-fit the network with different data, I notice my per-fold loss is (nearly) monotonically decreasing.

For example, my fold losses come out to be something like: [1.17, 0.22, 0.08, 0.004, 0.04], which seems like later folds somehow are getting access to the weights of the previous fold.

vmalyi · 2017-07-11T09:17:11Z

I also feel a need of layer.reset() method for the case when I train multiple models with the same configuration but different data.

Thanks!

stale · 2017-10-09T09:19:08Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed after 30 days if no further activity occurs, but feel free to re-open a closed issue if needed.

HeikoSchuett · 2017-11-01T16:29:14Z

Hi! Has there been any progress on this? I would also love to have this feature for crossvalidation of models!

AndersAsa · 2017-11-18T18:47:59Z

Also interested

javedqadruddin · 2018-02-01T19:32:36Z

This seems to work: https://www.codementor.io/nitinsurya/how-to-re-initialize-keras-model-weights-et41zre2g

bschreck · 2018-03-15T17:06:33Z

@javedqadruddin That doesn't work for recurrent layers

MinnML · 2018-08-28T18:53:37Z

I have the same issue. My workaround for now is to put all the lines of model creation and compile in a function, do "del model" at the end of each iteration, and recreate using that function. So for example, my loss was starts from roughly the same point and not monotonically decreasing.

ceylanb · 2019-06-17T09:13:27Z

I have the same issue. My workaround for now is to put all the lines of model creation and compile in a function, do "del model" at the end of each iteration, and recreate using that function. So for example, my loss was starts from roughly the same point and not monotonically decreasing.

Memory leak occurred when I create the model and delete it using del model . It does not work. I just need to set layer weights randomly and in this process, layer initializer should be used instead of numpy.

This seems to work: https://www.codementor.io/nitinsurya/how-to-re-initialize-keras-model-weights-et41zre2g

I have tried this solution but an error occured because of empty layer initializer. The error was "*** AttributeError: 'NoneType' object has no attribute 'run'"

raullalves · 2019-08-04T13:18:08Z

Any progress on re-initializing a model for Recurrent layers ?

vedal · 2020-04-07T13:12:56Z

Is this currently on the roadmap?

Korrakas · 2020-07-30T09:48:07Z

Kindly but strongly revive this thread.
K-Fold cross validation isn't an esoteric stuff to do with your data, yet the impossibility to do a simple & clear weight reinitialization makes it unnecessary complicated. Resetting the initial weights, while kinda working, 1) is hacky 2) introduce a bias that doesn't fit one of KFolds objective - evaluating an architecture.

GF-Huang · 2020-10-27T23:49:01Z

Any solutions?

KeironO mentioned this issue Feb 18, 2016

Slow model initialisation #1746

Closed

fgolemo mentioned this issue Mar 7, 2016

added reset() function to sequential model #1908

Closed

stale bot added the stale label Jun 13, 2017

stale bot removed the stale label Jun 15, 2017

stale bot added the stale label Oct 9, 2017

stale bot removed the stale label Nov 1, 2017

fchollet closed this as completed Jun 24, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Re-init weights to avoid recompile #609

Re-init weights to avoid recompile #609

PilCAki commented Aug 27, 2015

fchollet commented Aug 28, 2015

PilCAki commented Aug 28, 2015

pkch commented Dec 2, 2015

fgolemo commented Mar 7, 2016

seven7e commented Sep 5, 2016

gokceneraslan commented Mar 14, 2017

stale bot commented Jun 13, 2017

daferna commented Jun 15, 2017 •

edited

Loading

vmalyi commented Jul 11, 2017

stale bot commented Oct 9, 2017

HeikoSchuett commented Nov 1, 2017

AndersAsa commented Nov 18, 2017

javedqadruddin commented Feb 1, 2018

bschreck commented Mar 15, 2018

MinnML commented Aug 28, 2018

ceylanb commented Jun 17, 2019 •

edited

Loading

raullalves commented Aug 4, 2019

vedal commented Apr 7, 2020

Korrakas commented Jul 30, 2020

GF-Huang commented Oct 27, 2020

Re-init weights to avoid recompile #609

Re-init weights to avoid recompile #609

Comments

PilCAki commented Aug 27, 2015

fchollet commented Aug 28, 2015

PilCAki commented Aug 28, 2015

pkch commented Dec 2, 2015

fgolemo commented Mar 7, 2016

seven7e commented Sep 5, 2016

gokceneraslan commented Mar 14, 2017

stale bot commented Jun 13, 2017

daferna commented Jun 15, 2017 • edited Loading

vmalyi commented Jul 11, 2017

stale bot commented Oct 9, 2017

HeikoSchuett commented Nov 1, 2017

AndersAsa commented Nov 18, 2017

javedqadruddin commented Feb 1, 2018

bschreck commented Mar 15, 2018

MinnML commented Aug 28, 2018

ceylanb commented Jun 17, 2019 • edited Loading

raullalves commented Aug 4, 2019

vedal commented Apr 7, 2020

Korrakas commented Jul 30, 2020

GF-Huang commented Oct 27, 2020

daferna commented Jun 15, 2017 •

edited

Loading

ceylanb commented Jun 17, 2019 •

edited

Loading