Fix ssr #559

nehalsinghmangat · 2024-09-18T20:15:08Z

Fixes #532. Original SSR implementation only used an l0 penalty which defaulted to 0, meaning the full non-sparse model was always chosen. However, the SSR paper does model selection based upon the "inflection point" of the error history. We implemented the function _ind_inflection which takes as input the error history of the SSR iterations and returns the index at which the inflection occurs, using the metric for inflection described in the SSR paper. This model selection based on error history inflection is the new default when the l0 penalty is 0. The API does not change, but you can note the different results in the unittest test_ssr_history_selection.

Index alignment between coefficient and error histories was thrown off because BaseOptimizer initializes history with an unregularized regression, used in iterative nonconvex regressions. However, SSR does not require an initial guess. This commit removes the initial guess from SSR history_ and adds a test.

Previously, SSR has simply been finding the 0-sparsity model. Instead of the algorithm in the paper, it was executing a Bayes Information Criterion-like model selection, but with a very low l0 penalty. This test asserts #532

Closes #532 This commit implements the inflection point criteria from the SSR paper, although still measuring training loss, rather than cross validation loss.

Also change test to use better-conditioned feature library, since l0 penalty is based upon condition number. Also, fix use of inflection point model selection. Because error history is reversed before calculating inflection index, index is from the end of the list Also, extract calculating the index of inflection in SSR (+ tests)

Also, hack SSR through tests that provide single-feature data in the same way that trapping was handled.

codecov · 2024-09-18T20:19:29Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 95.31%. Comparing base (23da590) to head (18bac4f).

Additional details and impacted files

@@            Coverage Diff             @@
##           master     #559      +/-   ##
==========================================
+ Coverage   95.30%   95.31%   +0.01%     
==========================================
  Files          37       37              
  Lines        4046     4059      +13     
==========================================
+ Hits         3856     3869      +13     
  Misses        190      190

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

Jacob-Stevens-Haas · 2024-09-23T18:16:03Z

@himkwtn @yb6599 @MPeng5, if you're interested in doing a review, this is a fix Nehal and I were working on to the SSR optimizer.

No need to review, it's just an opportunity if you're interested. I'll merge it in a week if nobody picks up the review.

Jacob-Stevens-Haas added 5 commits September 18, 2024 11:13

tst: Modified SSR test to find a sparse model

01ebb09

Previously, SSR has simply been finding the 0-sparsity model. Instead of the algorithm in the paper, it was executing a Bayes Information Criterion-like model selection, but with a very low l0 penalty. This test asserts #532

fix(ssr): Select model by error inflection point

329a676

Closes #532 This commit implements the inflection point criteria from the SSR paper, although still measuring training loss, rather than cross validation loss.

fix(ssr): Include full model in model selection list.

18bac4f

Also, hack SSR through tests that provide single-feature data in the same way that trapping was handled.

Jacob-Stevens-Haas requested a review from himkwtn September 23, 2024 18:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix ssr #559

Fix ssr #559

nehalsinghmangat commented Sep 18, 2024

codecov bot commented Sep 18, 2024 •

edited

Loading

Jacob-Stevens-Haas commented Sep 23, 2024

Fix ssr #559

Are you sure you want to change the base?

Fix ssr #559

Conversation

nehalsinghmangat commented Sep 18, 2024

codecov bot commented Sep 18, 2024 • edited Loading

Codecov Report

Jacob-Stevens-Haas commented Sep 23, 2024

codecov bot commented Sep 18, 2024 •

edited

Loading