Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Metadata #6

Open
mankoff opened this issue Oct 21, 2021 · 3 comments
Open

Metadata #6

mankoff opened this issue Oct 21, 2021 · 3 comments

Comments

@mankoff
Copy link
Member

mankoff commented Oct 21, 2021

Datalad supports metadata: http://docs.datalad.org/en/stable/metadata.html

Which format should we use? Pro's and cons here...

For @jmlea16 and all.

@AdrienWehrle
Copy link
Member

Also, how should we handle the metadata collection? A combination of manual entries and more automated extractions through e.g. metadata extractors? Automation is nice, but we need to stick to the chosen convention.

@mankoff
Copy link
Member Author

mankoff commented Oct 21, 2021

I'd love to automate this, but we as a community are not providing the right metadata to do so. I think we'll need to do it manually. Given that we'll start with 5 or 25 datasets, that's easy to do. Even if we grow to a few 100, manual is solvable at the dataset level. It would be nice if 3rd parties provided sufficient file-level metadata that we could fetch, for example, specific velocity maps based on date or time, or Landsat scenes based on cloud cover %. But for now, just data-set level stuff.

What do we want to track?

  • Source data: DOI, reference, URL
  • Science reference & DOI
  • Organization & project
  • Geospatial ROI.

Maybe keywords from a controlled vocabulary, for example, "velocity" "biology" "ice" "ocean" "atmosphere" "temperature" etc.?

@AdrienWehrle
Copy link
Member

Why not JSON? Or any other human-readable object notation. We could have nested objects like for the ROI: the name would be the standard field to add, then one could nest another object into it for the polygon.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants