Feature Cycling as an option instead of Random Feature Sampling #4066

JoshuaC3 · 2021-03-12T10:27:05Z

Summary

Have the option such that the model can select features cyclically, instead of simply randomly selecting the features.

See here for initial discussion on LightGBMs and EBMs.

Motivation

Model explain ability is becoming ever more important in the ML space. LightGBM can take advantage of some of the methods used by Explainable Boosted Machines to make models more interpretable. One of the features of EBMs is build shallow, single-feature trees (currently possible in LightGBM by toying with parameters). However, these trees are boosted in a cyclic fashion.

So, for example,

So for a model with 3-features:

Tree 1 - feature 1
Tree 2 - feature 2
Tree 3 - feature 3
Tree 4 - feature 1
Tree 5 - feature 2
Tree 6 - feature 3
...

This allows the model to ensure it gains information from colinear features that might be equally as important. When comparing this to a small number of deeper trees, it is easy to get a bias (lots of gain) in the first few features the are randomly selected.

Description

The feature would be used to make LightGBM more interpretable and results more comparable to EBMs. This will allow users to make informed decision on interpretability vs model performance.

Additionally, in certain cases, the model maybe be more robust at inference time if colinear features are missing.

References

A great conceptual video explanation.

InterpretML: A Unified Framework for Machine Learning Interpretability
InterpretML: A toolkit for understanding machine learning models
InterpretMLs Explainable Boosting Machine

StrikerRUS · 2021-03-27T23:01:56Z

Closed in favor of being in #2302. We decided to keep all feature requests in one place.

Welcome to contribute this feature! Please re-open this issue (or post a comment if you are not a topic starter) if you are actively working on implementing this feature.

StrikerRUS mentioned this issue Mar 27, 2021

Feature Requests & Voting Hub #2302

Open

StrikerRUS added feature request help wanted labels Mar 27, 2021

StrikerRUS closed this as completed Mar 27, 2021

JoshuaC3 mentioned this issue Apr 23, 2021

One Feature per Tree #4134

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature Cycling as an option instead of Random Feature Sampling #4066

Feature Cycling as an option instead of Random Feature Sampling #4066

JoshuaC3 commented Mar 12, 2021

StrikerRUS commented Mar 27, 2021

Feature Cycling as an option instead of Random Feature Sampling #4066

Feature Cycling as an option instead of Random Feature Sampling #4066

Comments

JoshuaC3 commented Mar 12, 2021

Summary

Motivation

Description

References

StrikerRUS commented Mar 27, 2021