-
Notifications
You must be signed in to change notification settings - Fork 26
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SCHEMATIC-1] BugFix: manifest submission when data previously annotated without manifest upload #1538
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Great work @GiaJordan , I'll defer to @BryanFauble and/or @andrewelamb for final review
This reverts commit 68b0b24.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM!
Quality Gate passedIssues Measures |
Issue
Resolves SCHEMATIC-1 which occurs when users:
Changes
DatasetFileView
and the resulting view is "tidied", any existingeTag
columns will be removed before the new values are stored in that column.DatasetFileView
directly instead of through manifest generationNotes
--use_annotations
to generate a manifest for a dataset where there isn't already a manifest present is not supported, but this provides a way for users to get back on the recommended data flow path instead of having to start all over.DatasetFileView
is only used in this one case, gathering annotations for files in a dataset with more than 50 files. With the ability we have now to scope fileview queries in theSynapseStorage
object and get the annotations for all files in a dataset we should consider whether we should deprecate theDatasetFileView
class.--use_annotations
feature when there is no manifest in the dataset since this change enables it in at least this case.