-
Notifications
You must be signed in to change notification settings - Fork 441
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Suggestion stuck for HyperBand when len(response.ParameterAssignments) < requestNum
#1168
Comments
Issue-Label Bot is automatically applying the labels:
Please mark this comment with 👍 or 👎 to give our bot feedback! |
Thank you for your issue @czheng94. |
Perhaps this should be a separate issue, but I'm curious if it's possible to decouple Part of my issue is setting |
Issue-Label Bot is automatically applying the labels:
Please mark this comment with 👍 or 👎 to give our bot feedback! |
Ref issue: #1389. |
/kind bug
Suggestion should support cases where
len(response.ParameterAssignments) < requestNum
What steps did you take and what happened:
After the first 9 trials succeeded,
spec.requests
in theSuggestion
will be increased by 9. However, it will be stuck at this stateThis is because the number of suggestion that can be generated from the HyperBand algorithm is only 3 at this time (there are only 3 candidates in the current bracket, and you can't jump to the next bracket before getting results from these 3 candidates). It's less than the requested number to the suggestion service. And according to the implementation below, it's going to return an error.
katib/pkg/controller.v1alpha3/suggestion/suggestionclient/suggestionclient.go
Lines 84 to 88 in 9155fa7
You will see the following logs from suggestion controller:
I believe we should always allow
len(response.ParameterAssignments) <= requestNum
, especially for HyperBand and Grid search, where there will be a fixed number of parameter assignments following a certain heuristics.What did you expect to happen:
The suggestion controller should update 3 parameter assignments returned by the suggestion service into its status, instead of throwing an error.
Anything else you would like to add:
As addressed above
Environment:
kubectl version
):/etc/os-release
):The text was updated successfully, but these errors were encountered: