Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Eskape case studies #82

Closed
wants to merge 3 commits into from
Closed

Conversation

Cateline
Copy link
Collaborator

@Cateline Cateline commented Oct 8, 2024

Description

What kind of change(s) are included?

  • Feature (adds or updates new capabilities)
  • Bug fix (fixes an issue).
  • Enhancement (adds functionality).
  • Breaking change (these changes would cause existing functionality to not work as expected).

Checklist

Please ensure that all boxes are checked before indicating that this pull request is ready for review.

  • [x ] I have read and followed the CONTRIBUTING.md guidelines.
  • [ x] I have searched for existing content to ensure this is not a duplicate.
  • [x ] I have performed a self-review of these additions (including spelling, grammar, and related).
  • [x ] I have added comments to my code to help provide understanding.
  • I have added a test which covers the code changes found within this PR.
  • [ x] I have deleted all non-relevant text in this pull request template.
  • [ x] Reviewer assignment: Tag a relevant team member to review and approve the changes.
    @jananiravi @the-mayer

This file contains essential data related to antibiotic resistance genes curated from the Comprehensive Antibiotic Resistance Database (CARD).
@jananiravi
Copy link
Member

Assigned this to @AbhirupaGhosh (primary) and @epbrenner @the-mayer (secondary).

Abhirupa/Evan/David, along with the script, could you also check if this is the right file format we want to use? Thanks!

@AbhirupaGhosh
Copy link

AbhirupaGhosh commented Oct 9, 2024

Refer to this comment for guidance.

Title: Process CARD Data, Map Short Names, and Run MolEvolveR

  • Download CARD Data: Retrieve the latest CARD dataset. (DOWNLOAD)
  • Open ARO_index.tsv: Parse the file (in R).
  • Map CARD Short Name: Map the CARD Short Name column to shortname_antibiotics.tsv and shortname_pathogens.tsv. The CARD Short Name values follow the format pathogen_gene or pathogen_gene_drug.
  • Sort and Group the data by pathogens and antibiotics.
  • Filter Favorite Bug-Drug or Bug for further analysis.
  • Download FASTA Sequences for the list of protein accessions filtered. (use Entrez)
  • Run MolEvolvR: Run the protein sequences through the MolEvolvR tool for evolutionary analysis.

@Cateline Cateline closed this Oct 11, 2024
@Cateline Cateline deleted the eskape-case-studies branch October 11, 2024 05:02
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants