Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Persist meta data of experiments #1128

Closed
sperlingxx opened this issue Apr 7, 2020 · 7 comments
Closed

Persist meta data of experiments #1128

sperlingxx opened this issue Apr 7, 2020 · 7 comments

Comments

@sperlingxx
Copy link
Member

sperlingxx commented Apr 7, 2020

/kind feature

Describe the solution you'd like
Meta data generated during HPO process is worthy to restore. It can be used to guide subsequent tuning works. Maybe it can also be precious data of further academic research.

To me, since katib has DB abstraction layer (KatibDBInterface) , we can extend it to persist meta data such as experiment settings and corresponding trial information (param assignments and metrics).

Anything else you would like to add:
In current katib UI system, visualization will be unavailable once experiment instance deleted. If we can persist all information in DB, it is possible for users to review all history experiments. I think it will be helpful.
/cc @andreyvelich @gaocegege @richardsliu @hougangliu @johnugeorge

@issue-label-bot
Copy link

Issue-Label Bot is automatically applying the labels:

Label Probability
feature 0.93

Please mark this comment with 👍 or 👎 to give our bot feedback!
Links: app homepage, dashboard and code for this bot.

@jlewi jlewi removed the feature label Apr 7, 2020
@andreyvelich
Copy link
Member

@sperlingxx Thank you for the issue. SGTM.
Previously, we stored Experiments search space in the Katib DB as well as metrics.
Currently, ones user deletes Experiment, all Trials metrics are cleaned in the DB.

Maybe we can control it with resuming Experiment flag.

@sperlingxx
Copy link
Member Author

@andreyvelich I fully agree with the approach "control it with resuming Experiment flag". Shall we add this issue into 2020 roadmap?

@andreyvelich
Copy link
Member

@gaocegege @johnugeorge Any thoughts?

@johnugeorge
Copy link
Member

related:#1150 (comment)

@andreyvelich
Copy link
Member

Resuming Experiment was implemented, feed free to re-open the issue if you have any other questions.

@pod3275
Copy link

pod3275 commented Sep 16, 2022

I have one question about this issue.

We can now control whether the Suggestion deploy is deleted after the experiment is finished by setting ResumePolicy=never or ResumePolicy=FromVolume.

However, Trial, Suggestion, and Experiment CRs still remain.
If we delete Experiment CR, the results of the experiment will not be displayed on the Katib UI.
Managing the number of CRs could be challenging when a large number of experiments are performed.

As discussed at the beginning of this issue, how about adding an option to persist all meta data of the experiment in DB and delete Katib CRs?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

6 participants