Not able to train with `dart` and `early_stopping_rounds` #1893

RickoClausen · 2018-12-06T07:01:39Z

Environment info

Operating System: MacOS Mojave and Ubuntu

CPU/GPU model: CPU

C++/Python/R version: Python

Error message

When using the dart booting type the model is not trained when applying early_stopping_rounds.
The rmse after training is not the same as it was at the stopping point in the training.

When I use gbdt the model trains fine, and I am able to reproduce the rmse from the in-training.

Reproducible examples

import lightgbm
import numpy as np

np.random.seed(1234)

params = {
    "early_stopping_rounds": 100,
    "metric": "root_mean_squared_error",
    "objective": "regression",
    "num_boost_round": 1000,
    "boosting_type": "dart",
}

size = (245688, 470)
x = np.random.exponential(scale=10, size=size)
y = 2 * x[:, 0] + np.random.exponential(scale=2, size=(size[0],))

x_val = np.random.exponential(scale=10, size=(int(size[0] / 13), size[1]))
y_val = 2 * x_val[:, 0] + np.random.exponential(scale=2, size=(int(size[0] / 13),))

model = lightgbm.LGBMModel(**params)

model.fit(x, y, eval_set=[(x, y), (x_val, y_val)], verbose=50)
train_pred = model.predict(x)
rmse = np.sqrt(np.mean((y - train_pred) ** 2))
print(f"Train rmse: {rmse}")

Output:

UserWarning: Starting from version 2.2.1, the library file in distribution wheels for macOS is built by the Apple Clang (Xcode_8.3.1) compiler.
This means that in case of installing LightGBM from PyPI via the ``pip install lightgbm`` command, you don't need to install the gcc compiler anymore.
Instead of that, you need to install the OpenMP library, which is required for running LightGBM on the system with the Apple Clang compiler.
You can install the OpenMP library by the following command: ``brew install libomp``.
  "You can install the OpenMP library by the following command: ``brew install libomp``.", UserWarning)
UserWarning: Found `num_boost_round` in params. Will use it instead ofargument
  warnings.warn("Found `{}` in params. Will use it instead of argument".format(alias))
UserWarning: Found `early_stopping_rounds` in params. Will use it instead of argument
  warnings.warn("Found `{}` in params. Will use it instead of argument".format(alias))
Training until validation scores don't improve for 100 rounds.
[50]    valid_0's rmse: 3.86284 valid_1's rmse: 4.04269
[100]   valid_0's rmse: 3.64912 valid_1's rmse: 3.88063
Early stopping, best iteration is:
[34]    valid_0's rmse: 2.60659 valid_1's rmse: 2.88739
Train rmse: 16.60181744687661

Thanks for an amazing product! 👍

The text was updated successfully, but these errors were encountered:

guolinke · 2018-12-06T09:16:45Z

I think early stopping and dart cannot be used together.
The reason is when using dart, the previous trees will be updated.
For example, in your case, although iteration 34 is best, these trees are changed in the later iterations, as dart will update the previous trees.

To support this, a simple solution is to clone the model of best_iteration at that time, to avoid the updates on it.
And you can write a callback function to achieve this.

@StrikerRUS
Is that easy for us to support this in python-package side? at least a warning is needed.

bbennett36 · 2018-12-14T20:28:54Z

The issue here is that he's trying to use the sklearn version of LightGBM that doesn't support early stopping (from my understanding).

I have used early stopping and dart with no issues for the past couple months on multiple models. However, I do have to set the early stopping rounds higher than normal because there is cases where the validation score will rise, then drop then start rising again. I have to use a higher learning rate as well so it doesn't take forever to run.

RickoClausen · 2019-01-21T16:52:19Z

I get the same "problem" then using the non-sklearn syntax.

bbennett36 · 2019-01-21T20:14:00Z

@RickoClausen do you have "boost_from_average" = False?

StrikerRUS · 2019-02-02T11:44:03Z

@guolinke Should the same (#1895) be done for R-package and then this issue can be closed?

guolinke · 2019-02-03T01:46:45Z

yeah, this fix should be in R package too.

StrikerRUS · 2019-02-03T11:39:47Z

ping @Laurae2 and @jameslamb for R-fix

Laurae2 · 2019-03-20T19:57:24Z

For R we can add a simple check here: https://github.com/Microsoft/LightGBM/blob/master/R-package/R/lgb.train.R#L205-L209

Will try to do it by the end of this week.

StrikerRUS · 2019-03-25T13:39:42Z

@Laurae2 It seems that this check will not solve the issue when users create early stopping callback by themselves.

StrikerRUS · 2019-04-24T11:15:42Z

@Laurae2

guolinke · 2019-08-01T05:22:50Z

ping @Laurae2 @jameslamb for R's fix

jameslamb · 2019-08-01T05:30:55Z

Thank you for the ping, will pick it up soon.

StrikerRUS · 2019-09-16T20:52:49Z

Any updates?

jameslamb · 2019-09-25T04:01:13Z

Any updates?

Sorry for the delay. Attempted a fix in #2443. There are parts of this section of the code that I'm not very familiar with, so let's see if @Laurae2 agrees with my proposal in the PR review.

StrikerRUS · 2019-10-25T19:14:34Z

R-package should be fixed in #2443.

jameslamb · 2019-10-25T19:15:39Z

Thanks @StrikerRUS . Sorry, I should have come and closed this

StrikerRUS mentioned this issue Dec 6, 2018

[python] disabled early stopping in dart mode #1895

Merged

guolinke mentioned this issue Apr 17, 2019

Can't reproduce L1-score #2112

Closed

guolinke mentioned this issue Jul 5, 2019

Exception: 'access violation reading' while training with init_model #2249

Closed

guolinke added the enhancement label Aug 1, 2019

jameslamb mentioned this issue Sep 25, 2019

[R-package] Disabled early stopping when using 'dart' boosting strategy #2443

Merged

StrikerRUS closed this as completed Oct 25, 2019

lock bot locked as resolved and limited conversation to collaborators Mar 10, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Not able to train with `dart` and `early_stopping_rounds` #1893

Not able to train with `dart` and `early_stopping_rounds` #1893

RickoClausen commented Dec 6, 2018 •

edited

Loading

guolinke commented Dec 6, 2018

bbennett36 commented Dec 14, 2018

RickoClausen commented Jan 21, 2019

bbennett36 commented Jan 21, 2019

StrikerRUS commented Feb 2, 2019

guolinke commented Feb 3, 2019

StrikerRUS commented Feb 3, 2019

Laurae2 commented Mar 20, 2019

StrikerRUS commented Mar 25, 2019

StrikerRUS commented Apr 24, 2019

guolinke commented Aug 1, 2019

jameslamb commented Aug 1, 2019

StrikerRUS commented Sep 16, 2019

jameslamb commented Sep 25, 2019

StrikerRUS commented Oct 25, 2019

jameslamb commented Oct 25, 2019

Not able to train with dart and early_stopping_rounds #1893

Not able to train with dart and early_stopping_rounds #1893

Comments

RickoClausen commented Dec 6, 2018 • edited Loading

Environment info

Error message

Reproducible examples

guolinke commented Dec 6, 2018

bbennett36 commented Dec 14, 2018

RickoClausen commented Jan 21, 2019

bbennett36 commented Jan 21, 2019

StrikerRUS commented Feb 2, 2019

guolinke commented Feb 3, 2019

StrikerRUS commented Feb 3, 2019

Laurae2 commented Mar 20, 2019

StrikerRUS commented Mar 25, 2019

StrikerRUS commented Apr 24, 2019

guolinke commented Aug 1, 2019

jameslamb commented Aug 1, 2019

StrikerRUS commented Sep 16, 2019

jameslamb commented Sep 25, 2019

StrikerRUS commented Oct 25, 2019

jameslamb commented Oct 25, 2019

Not able to train with `dart` and `early_stopping_rounds` #1893

Not able to train with `dart` and `early_stopping_rounds` #1893

RickoClausen commented Dec 6, 2018 •

edited

Loading