Flye assembler (successor of ABruijn)

Version: 2.3.1

Flye is a de novo assembler for long and noisy reads, such as those produced by PacBio and Oxford Nanopore Technologies. The algorithm uses an A-Bruijn graph to find the overlaps between reads and does not require them to be error-corrected. After the initial assembly, Flye performs an extra repeat classification and analysis step to improve the structural accuracy of the resulting sequence. The package also includes a polisher module, which produces the final assembly of high nucleotide-level quality.

New in version 2.3

ABruijn 2.x branch has been renamed to Flye, highlighting many substantial algorithmic changes
Stable version of the repeat analysis module
New command-line syntax (fallback mode with the old syntax is available)
New --subassemblies mode for generating consensus of multiple assemblies
Improved preformance and reduced memory footprint (now scales to human genome)
Corrected reads are now supported
Extra output with information about the contigs (coverage, multiplicity, graph paths etc.)
Gzipped Fasta/q support
Multiple read files support

Manuals

Assembly graph

The Flye algorithms are operating on the assembly (repeat) graph. The edges in this graph represent genomic sequences, and nodes simply serve as junctions. The genoimc chromosomes traverse this graph (in an unknown way) so as each unique edge is covered exactly once. The genomic repeats that were not resolved are collapsed into the corresponding edges in the graph (therefore genome structure remain umbigious).

An example of a final assembly graph of a bacterial genome is above. Each edge is labeled with its id, length and coverage. Repetitive edges are shown in color, while unique edges are black. The clusters of adjacent repeats are shown with the same color. Note that each edge is represented in two copies: forward and reverse complement (marked with +/- signs), therefore the entire genome is represented in two copies as well. Sometimes (as in this example), forward and reverse-complement components are clearly separated, but often they form a single connected component (in case if the genome contain unresolved inverted repeats).

In this example, there are two unresolved repeats: (i) a red repeat of multiplicity two and length 35k and (ii) a green repeat cluster of multiplicity three and length 34k - 36k. As the repeats remained unresolved, there are no reads in the dataset that cover those repeats in full.

Third-party

Flye package includes some third-party software:

License

Flye is distributed under a BSD license. See the LICENSE file for details.

Credits

Flye was developed in Pavel Pevzner's lab at UCSD

Code contributions:

Original assembler code: Yu Lin
Original polisher code: Jeffrey Yuan
Repeat graph and current package support: Mikhail Kolmogorov

Publications

Mikhail Kolmogorov, Jeffrey Yuan, Yu Lin and Pavel Pevzner, "Assembly of Long Error-Prone Reads Using Repeat Graphs", bioRxiv, 2018

Yu Lin, Jeffrey Yuan, Mikhail Kolmogorov, Max W Shen, Mark Chaisson and Pavel Pevzner, "Assembly of Long Error-Prone Reads Using de Bruijn Graphs", PNAS, 2016

Contacts

Please report any problems directly to the github issue tracker. Also, you can send feedback to fenderglass@gmail.com

Name		Name	Last commit message	Last commit date
Latest commit History 622 Commits
bin		bin
docs		docs
flye		flye
lib		lib
src		src
.gitignore		.gitignore
.ycm_extra_conf.py		.ycm_extra_conf.py
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Flye assembler (successor of ABruijn)

Version: 2.3.1

New in version 2.3

Manuals

Assembly graph

Third-party

License

Credits

Publications

Contacts

About

Releases

Packages

Languages

License

bioluria/Flye

Folders and files

Latest commit

History

Repository files navigation

Flye assembler (successor of ABruijn)

Version: 2.3.1

New in version 2.3

Manuals

Assembly graph

Third-party

License

Credits

Publications

Contacts

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages