Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add BoostFromAverage to Python model. #4234

Closed
JoshuaC3 opened this issue Apr 27, 2021 · 1 comment
Closed

Add BoostFromAverage to Python model. #4234

JoshuaC3 opened this issue Apr 27, 2021 · 1 comment

Comments

@JoshuaC3
Copy link

JoshuaC3 commented Apr 27, 2021

Summary

Add the BoostFromAverage output to the init_score on the train_set Dataset. It seems to me that they serve the same function - that is, reducing training time by allowing the model to have some previous knowledge of the data before the first boosting round. BoostingFromAverage is a simply a subset of the init_score functionality.

Motivation

This allows us to recover the mean of the target variable after training and then when saving the model. There are many instances where we may not have access to the label data (and therefore the mean) after saving a model.

This is also the equivalent to the intercept in EBMs so aids in comparisons of LightGBM and EBM. See #3905 (comment) and EBM intercept calculations.

Finally, and maybe most importantly, this would allow a user better insights into how the model is working at each stage and would help with model interpretability #4065 and #3095.

Description

Something like, writing the results from BoostFromAverage back to the model here

double GBDT::BoostFromAverage(int class_id, bool update_scorer) {

References

@JoshuaC3 JoshuaC3 changed the title Add BoostFromAverage output to init_score Add BoostFromAverage to Python model. Apr 27, 2021
@StrikerRUS
Copy link
Collaborator

Superseded by #4313 as a more general approach.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants