Skip to content
@softcite

Softcite

Studying how researchers use, cite and collaborate around software

Softcite README page

Softcite is a project to improve the visibility of research software. We produce datasets, software, and papers.

Datasets

Extracted software mentions from publicatons

Manually annotated Gold standard dataset of software mentions

Tools

Mention Extraction Tool chain

Go from a folder of PDFs to XML extracted full text annoated with software mentions.

Browser for Extractions

We have an infrastructure to build a website that provides a browser to a database created from software extractions.

There is a demonstration of this available (populated with a small set of extractions): https://cloud.science-miner.com/software_kb/frontend/index.html

Papers

  • Du, C., Cohoon, J., Lopez, P., & Howison, J. (2022). Understanding progress in software citation: a study of software citation in the CORD-19 corpus. PeerJ Computer Science, 8, e1022. https://doi.org/10.7717/peerj-cs.1022

  • Lopez, P., Du, C., Cohoon, J., Ram, K., & Howison, J. (2021). Mining Software Entities in Scientific Literature: Document-level NER for an Extremely Imbalance and Large-scale Task. Proceedings of the 30th ACM International Conference on Information & Knowledge Management, 3986–3995. https://doi.org/10.1145/3459637.3481936

  • Du, C., Cohoon, J., Lopez, P., & Howison, J. (2021). Softcite dataset: A dataset of software mentions in biomedical and economic research publications. Journal of the Association for Information Science and Technology, 72(7), 870–884. https://doi.org/10.1002/asi.24454

Associated Papers

Popular repositories Loading

  1. software-mentions software-mentions Public

    Softcite software mention recognizer, finding mentions and citations to software from within the academic literature

    JavaScript 65 11

  2. softcite_kb softcite_kb Public

    A Knowledge Base for research software relying on large-scale text mining and curated knowledge sources

    JavaScript 15

  3. software_mentions_client software_mentions_client Public

    A Python client for the Softcite software mention recognizer server

    Python 5 2

  4. softcite_dataset_v2 softcite_dataset_v2 Public

    New version of the gold-standard dataset of software mentions in research publications

    Python 3

  5. mentions_pipeline_notebook mentions_pipeline_notebook Public

    Notebook demonstrating the end-to-end usage of the Softcite software mention recognizer

    Jupyter Notebook 2

  6. tutorials tutorials Public

    Tutorials for the Softcite tools

    1

Repositories

Showing 7 of 7 repositories
  • software-mentions Public

    Softcite software mention recognizer, finding mentions and citations to software from within the academic literature

    softcite/software-mentions’s past year of commit activity
    JavaScript 65 Apache-2.0 11 7 2 Updated Sep 14, 2024
  • .github Public
    softcite/.github’s past year of commit activity
    0 0 0 0 Updated Sep 11, 2024
  • software_mentions_client Public

    A Python client for the Softcite software mention recognizer server

    softcite/software_mentions_client’s past year of commit activity
    Python 5 Apache-2.0 2 2 0 Updated Jan 7, 2024
  • tutorials Public

    Tutorials for the Softcite tools

    softcite/tutorials’s past year of commit activity
    0 1 0 0 Updated Sep 8, 2023
  • softcite_dataset_v2 Public

    New version of the gold-standard dataset of software mentions in research publications

    softcite/softcite_dataset_v2’s past year of commit activity
    Python 3 0 0 0 Updated Jun 1, 2023
  • softcite_kb Public

    A Knowledge Base for research software relying on large-scale text mining and curated knowledge sources

    softcite/softcite_kb’s past year of commit activity
    JavaScript 15 Apache-2.0 0 10 0 Updated May 14, 2023
  • mentions_pipeline_notebook Public

    Notebook demonstrating the end-to-end usage of the Softcite software mention recognizer

    softcite/mentions_pipeline_notebook’s past year of commit activity
    Jupyter Notebook 0 2 2 0 Updated Mar 8, 2023

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…