Symptom Extraction Demo

Dataset description

The datasets required for the demo are available in the data folder. The dataset is based on simulated medical conversations obtained from A dataset of simulated patient-physician medical interviews with a focus on respiratory cases, Fareez, F. et al. (2022)

The dataset is a .csv file with at least two columns:

text: the transcript of the patient-doctor interaction.
label: the expected output. For symptom tracking (binary case), the label should be 'Positive' if any symptom is mentioned in the text, and 'Negative' otherwise. For symptom extraction (multi-label case), the label should be a semicolon-separated string of symptoms, e.g. fever;other;trouble drinking fluids

The dataset can include any additional metadata as columns, in this case you'll find the source field, which corresponds to the ID of the transcript from which the text segment was taken from.

License

This repository is made available under the terms of the GNU General Public License version 2. You are free to use, modify, and distribute the code under these terms.

For those interested in using this project under a different licensing arrangement, commercial license options are also available. Please contact charlotta_lindvall@dfci.harvard.edu for more information.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
data		data
responses		responses
src		src
LICENSE		LICENSE
README.md		README.md
demo.ipynb		demo.ipynb
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Symptom Extraction Demo

Dataset description

License

About

Releases

Packages

Contributors 2

Languages

License

lindvalllab/symptom-extraction-demo

Folders and files

Latest commit

History

Repository files navigation

Symptom Extraction Demo

Dataset description

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages