-
Notifications
You must be signed in to change notification settings - Fork 43
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Generate metadata about a study #93
Comments
I've been thinking about the extent of metadata Maestro would want to capture and I came up with some of the following:
I had some other thoughts, but I can't seem to find them right now. I'll update as things come back to me. |
I think that would be useful. It might also be helpful to capture diagnostic information as well. Does Maestro keep the history of the DAG execution? The status file tracks the most recent state of the workflow, but there isn't really a way to see the full workflow history. |
@jsemler -- The only history tracked is when the DAG changes the states of its records. The status is what gives the full history, since it has when a step was submitted and such. Currently there is no full restart history with timestamps. The history tracking was purposely left somewhat lightweight since this is all going into a pickle so I planned to revisit this when I backed things with a proper database. This ticket is going to have some implications on #95 -- the metadata related to the hashing of inputs and other information is going to dictate either how we restart or if we're able to restart at all. I'm going to be putting in some thoughts in the restart ticket. |
@jsemler -- Another thought, some objects would need to be responsible for hashing themselves. Some of the ones that come to mind right off the bat:
Other things to consider, we might need to pickle the |
The start of this issue are in PR #120 -- the |
@FrankD412 -- I agree. I think the PR #120 resolves this ticket. |
It would be useful for MaestroWF to write metadata about a launched study. This would make it easier for post processing scripts to find output data and other information.
Metadata could include:
The text was updated successfully, but these errors were encountered: