-
Notifications
You must be signed in to change notification settings - Fork 9
Species assignment using targeted amplicon sequencing datasets
An amplicon panel and a whole species assignment method were developed to perform species assignment for the entire genus of Anopheles mosquitoes. k-mers provide an objective way to compare highly diverged sequences, where multiple sequence alignment or alignment to a single reference genome tends to introduce bias towards better-represented clades in the panel and the reference species respectively. Moreover, k-mers provide a natural way to incorporate small indels in addition to SNPs, which considerably increases the power to distinguish between species when working with less than 10kb sequence.
To use the anopheles dataset and perform species identification based on k-mer pairwise comparisons, use the jupyter notebook that includes all the guidelines for this exercise.
Introduction
k-mer spectra analysis
- 📖 Introduction to K-mer spectra analysis
- 📖 Basics of genome modeling
- ⚒ manual model fitting (for better understanding of the underlying model)
- ⚒ simple diploid
- ⚒ demonstrating the effect of sequencing error rate on k-mer coverage
- 📖 Common difficulties in characterisation of diploid genomes using k mer spectra analysis
- ⚒ low coverage (pitfall) - to be merged
- ⚒ very homozygous diploid
- ⚒ highly heterozygous diploid
- ⚒ Genome size of a repetitive genome (pitfall)
- ⚒ Wrong ploidy (pitfall)
- 📖 Characterization of polyploid genomes using k mer spectra analysis
- ⚒ Autotetraploid
- ⚒ Allotetraploid
- ⚒ Estimating ploidy (smudgeplot)
- 📖 Genome modeling as a quality control
- ⚒ Contamination (pitfall)
- ⚒ k-mers in an assembly (Mercury/KAT)
- 📖 Analysing genome skimming data
Separation of chromosomes
- 📖Separate sub-genomes of an allopolyploid
- 📖Separating chromosomes by comparison of sequencing libraries
Species assignment using short k-mers
Others