-
Notifications
You must be signed in to change notification settings - Fork 42
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Multi-Armed Bandit #343
Multi-Armed Bandit #343
Conversation
@AdrianSosic @julianStreibel is this already ready for review or do the two of you want to do a first round of review first? |
Hi @AVHopp, I'll do a first iteration 👍🏼 |
5ffd6df
to
3fe605f
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hi @julianStreibel, thanks so much for this very nice PR. I'm already in the middle of the review/code roast and will continue on Monday. But here already a few things where I need input / that we need to discuss. See ya next week 😎
6cd8fd2
to
ea69f52
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Here my open points
the target branch for this PR is not main but |
No, that was done by @julianStreibel, probably because the two of haven't talked about where this is supposed to go. Have change it now. Once the dev branch is merged, I'll rebase |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Here same comment as thread so that we can discuss.
ea69f52
to
7c6fb72
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
hey guys @julianStreibel @AdrianSosic this is fantastic and I love the example - thanks a lot. I might need another round since this is a lot of code, but I'll leave a first round of comments and stupid questions
…tions and recommendations already measured
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for this epic piece of work :) My main issue is the example which I think is too complex, see the individual comments.
8e1837a
to
b98b074
Compare
88e43a3
to
3f5c086
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Approve to unblock
ba27028
to
b23d400
Compare
Switch from positive/negative to success/failure
Switch from True/False to 1.0/0.0
* The normalization factor of the Frobenius norm was wrong. Should have been divided by number of Monte Carlos instead of number of arms. * Computation via bias-variance decomposition is more suggestive.
* Avoids to set the allow_* flags in the RandomRecommender since each run gets a fresh search space instance with untouched metadata * Added comment on remaining necessary flags
Otherwise causes a lot of problems because nan!=nan
b23d400
to
8eae563
Compare
This PR enables bandit models by bringing the following new features: