-
-
Notifications
You must be signed in to change notification settings - Fork 13
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Do we have too many attributes? #1623
Comments
This is still a problem. These are not used:
I will plan to delete them if nobody objects in the next few days. Can we clean up or document the rest of the list? I think about half of it's misplaced "reproductive data" and the other half is something about parasites... |
water temperature @ 183 HMMMM. These "environmental attributes" seem like they belong with the collecting event....This is probably going to become a larger issue when the UTEP ants start going in. There is A LOT of environmental data with them (soil moisture, soil type, soil temperature). I am overwhelmed with taxonomy, geology, and locality right now and not sure who else has time to take this on. If no one is worried about it right now, I say we put it on the back burner until some of the problems people are actively working on are resolved. @dustymc do you need it resolved for something pressing? |
Yes some clearly need moved elsewhere if we ever get a better home for them. I am absolutely fine with using Attributes as a staging area; most anything can be denormalized there. I'm fine with obscure attributes - if there's a real chance someone's going to ask hard questions of tail base width then we should absolutely keep it, even if it's only used 17 times per decade. (But it needs documented - maybe those 17 determinations also represent 17 techniques and these data are therefore completely useless.) I think the temperature attributes are just new - not a problem. I am not OK with having many ways of doing something - of presenting confounded and unusable data to the world - and from here that's what a bunch of this looks like. Not pressing, I was just answering an email regarding attribution and became overwhelmingly re-appalled at the attribute row of https://docs.google.com/spreadsheets/d/1ElIuKfljO48gaosR7Ml1irSzPdjxiOmhId81ybsrfMQ/edit#gid=432374024 |
Agree with all of the above! Any chance we can ping the person who created each attribute without a definition to get them to supply one? |
Not really - we started tracking who's creating authorities and requiring definitions at about the same time. I can do this though:
See also #1450 |
ref: https://github.com/ArctosDB/data-migration/issues/71
Consider a "random structured non-core data" attribute to hold things like
which can do ~anything two new attributes can do. |
removed some unused, closing for lack of interest |
From #1597
Ref: #1620
My reservation here is that we now have ~200 attributes, half-ish undocumented (#1450). Some (most!?!) of the "documentation" we do have is not useful for any purpose: "measured how?" or "Standard beak measurement for birds". These things have become numerous enough to start causing problems merely by existing. (Operators and researchers may not find what they're looking for, turning them all on causes serious performance problems, etc.)
This request is clearly data which can fit in Attributes. There's a good definition. Given enough of it, we could ask Arctos cool questions about horse teeth.
We could also push it to Media or structured data or similar, which would support the same questions but with a lot more work. ("unformatted measurements" is NOT a suitable target; these kinds of data are formatted.)
A "these things are Attributes" definition from the AWG would be very useful. (I'd probably vote to continue adding anything that fits and looking for solutions to the problems that causes, but I don't think this is my call.)
Here are existing Attributes by frequency of usage. Can anything that's not used much be removed or merged or ??
The text was updated successfully, but these errors were encountered: