Skip to content

Partitioning and analysis methods for large, complex sequence datasets

License

Notifications You must be signed in to change notification settings

qtleeq/LatentStrainAnalysis

 
 

Repository files navigation

Welcome to the Latent Strain Analysis (LSA) code repository!

LSA was developed as a pre-assembly tool for partitioning metagenomic reads. It uses a hyperplane hashing function and streaming SVD in order to find covariance relations between k-mers. The code, and the process outline in LSFScripts in particular, have been optimized to scale to massive data sets in fixed memory with a highly distributed computing environment.

Documentation

Documentation for LSA, including a "getting started" tutorial with accompanying test data, and step-by-step instructions for analyzing large collections, can be found at: http://latentstrainanalysis.readthedocs.org/

The lastest version of SciPy is broken for Python 2

These changes are needed to get SciPy functioning again.

About

Partitioning and analysis methods for large, complex sequence datasets

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 95.0%
  • Shell 5.0%