-
Notifications
You must be signed in to change notification settings - Fork 169
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Allow CITATION.cff as alternative to Authors field in dataset_description #901
Comments
Do you know if CITATION.cff can include multiple citations? E.g., citing the versioned dataset and a data paper? |
i thought that is used for software only? ie we should have one in our BIDS repo |
created an example https://github.com/Remi-Gau/cff_example_data
Testing things here https://github.com/Remi-Gau/cff_example_software
|
Updated the software example to use the |
ok @Remi-Gau smarty pants you win :-)
|
@tsalo It seems you can have several identifiers. https://github.com/citation-file-format/citation-file-format/blob/main/schema-guide.md#identifiers |
Their schema does offer a few things we don't have. Could also allow a "division of labor": typical dataset info goes in CITATION.cff, BIDS specific info goes in dataset. This could also potentially better integrate with other non-BIDS tools and services (at the moment "only" github, zenodo, zotero). FYI I am not really convinced that this should be done. Just wanted to start a conversation to weight the pros and cons. (And advertise CFF files in case it could interest people for other things).
there is a python validator for those files and there a json schema already, that could be used for other validations From the BIDS perspective we would have to ensure consistency between dataset_description and those .cff files. |
My personal opinion on this is that we should wait how CITATION.CFF develops in the next months / year / years and then revive the discussion. If we see that it becomes very important and widespread (which I hope it does), we should officially adopt it. Until then, users can add it, and bids-ignore it ... as is already done for many BIDS datasets on GIN and the Until then, one could also write a |
yup I think that sums up why this is not a hill I want to die on just yet. |
but we could still use one inside https://github.com/bids-standard/bids-specification with all relevant publications :-) so it renders nice on github (ie we don't support it for datasets, but use it for the repo) |
Agreed! |
@christian-monch wrote one during a hackathon, I believe the most recent state of it can be found here :-) |
Had forgotten about this WIP while I started creating a package to streamline the creation of datacite.yml file for BIDS dataset... |
Yes.
The Authors list is just list of strings. There is a lot more nuance to authorship than just a name. Like a whole file-format's worth! And GitHub, Zenodo, and Zotero are supporting CITATION.cff. And there is a user-friendly tool to make CITATION.cff files.
I think this should be done. The pros seem to outweigh the cons.
It's been years and it looks good to me! |
CITATION.cff
files ?
In the schema, we would write a rule like: SingleSourceAuthors:
issue:
code: AUTHORS_AND_CITATION_FILE_MUTUALLY_EXCLUSIVE
level: error
message: |
CITATION.cff file found. The "Authors" field of dataset_description.json
should be removed to avoid inconsistency.
selectors:
- path == 'CITATION.cff'
checks:
- '!("Authors" in dataset_description)' I would not be inclined to also implement this in the legacy validator. Unfortunately, CFF does not have a Javascript validator, just Python. They do share JSON schemas though, so it wouldn't be awful to validate ourselves: https://github.com/citation-file-format/cff-converter-python/tree/main/cffconvert/schemas |
I agree this change would be very helpful for including more complete authorship information in BIDS datasets. It's an issue for OpenNeuro and a BIDS solution would let us add this to datasets in a way that allowed for reuse.
The CFF Initializer tool @ericearl mentioned has a simple JavaScript validator implementation. https://github.com/citation-file-format/cff-initializer-javascript/blob/main/src/store/validation.ts |
I had worked on a little package to help create citation files for bids datasets because they can also be ingested by datalad metadata tools. Having the citation file take precedence and not having to synch with the dataset description would make things even easier. |
Looking at https://github.com/citation-file-format/citation-file-format/blob/main/README.md, we have additional overlaps with
We may want to make more than just authors mutually exclusive with Also, authors have no |
Wait... Technically we only have bids version in dataset description and not version, right? |
Ah, sorry, I didn't actually look it up. I guess I was thinking of it being part of DOIs in many cases. |
Contribution roles will be included in the next release! |
I saw that and got all excited about it! |
Please see #1525 for proposed text and validation rules. |
CITATION.cff can be used for citing software or datasets.
Would it make sense to allow them officially in a BIDS dataset ? What do you all think?
Its content would be in part redundant with dataset_description and thus might require validation for internal consistency.
Links
https://citation-file-format.github.io/
https://docs.github.com/en/repositories/managing-your-repositorys-settings-and-features/customizing-your-repository/about-citation-files
https://docs.github.com/en/repositories/managing-your-repositorys-settings-and-features/customizing-your-repository/about-citation-files#citing-a-dataset
The text was updated successfully, but these errors were encountered: