You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Unicycler 0.4.8 failed scoring k-mer graphs when I did a hybrid assembly. The file unicycler.log says:
SPAdes assemblies (2020-01-24 11:02:57)
---------------------------------------
Unicycler now uses SPAdes to assemble the short reads. It scores the assembly graph for each k-mer using the number of contigs (fewer is better) and the number of dead ends (fewer is better). The score function is 1/(c*(d+2)), where c is the contig count and d is the dead end count.
K-mer Contigs Dead ends Score
21 failed
35 failed
47 failed
57 failed
67 failed
73 failed
81 failed
85 failed
91 failed
95 177 39 1.38e-04 ← best
Read depth filter: removed 2 contigs totalling 459 bp
Despite these failures, Unicycler finished the assembly job and produced assembly graphs. Nevertheless, I am concerning about this issue as it means unicycler cannot find out the k-mer graph that is actually the best. Here is a complete list of dependencies I have installed using bioconda:
SPAdes 3.14.0 (the latest release)
Python 3.7.3
Racon 1.4.10, Pilon 1.23
Bowtie 2.3.5, Clang 9.0.1, SAMtools 1.10, and BLAST+ 2.9.0
I looked into directories K21, ..., K91, and K95 of the SPAdes output and found no assembly_graph.fastg files in these directories except K95. The file structure of K21, ..., K91 was:
Since function spades_assembly in spades_func.py reads assembly_graph.fastg in each K-mer folder, I believe that the absence of k-mer graphs in K21, ..., K91 cause the failures aforementioned.
Except SPAdes 3.8.1, the other two versions did not generate assembly_graph.fastg in k-mer graph folders except the one with the greatest k.
In conclusion, as far as I have tested, Unicycler may not work properly when SPAdes 3.13.2 or later is used, because these SPAdes versions do not produce assembly_graph.fastg files for every k value anymore. Could you help me to check whether this is a real issue or just my inappropriate system configuration? Thanks.
The text was updated successfully, but these errors were encountered:
wanyuac
changed the title
The scoring function of Unicycler for k-mer graphs may not work for SPAdes 3.13 or later
The scoring function of Unicycler for k-mer graphs may not work for SPAdes 3.13.2 or later
Jan 29, 2020
Hi Ryan,
Unicycler 0.4.8
failed scoring k-mer graphs when I did a hybrid assembly. The fileunicycler.log
says:Despite these failures, Unicycler finished the assembly job and produced assembly graphs. Nevertheless, I am concerning about this issue as it means unicycler cannot find out the k-mer graph that is actually the best. Here is a complete list of dependencies I have installed using bioconda:
Unicycler was run using the command line:
Debugging
I looked into directories K21, ..., K91, and K95 of the SPAdes output and found no assembly_graph.fastg files in these directories except K95. The file structure of K21, ..., K91 was:
whereas in K95:
Since function
spades_assembly
inspades_func.py
readsassembly_graph.fastg
in each K-mer folder, I believe that the absence of k-mer graphs in K21, ..., K91 cause the failures aforementioned.To test my hypothesis, I ran SPAdes 3.14.0, 3.13.2, and 3.8.1 on the same set of paired-end reads (publicly available on NCBI SRA) using the command:
Except SPAdes 3.8.1, the other two versions did not generate
assembly_graph.fastg
in k-mer graph folders except the one with the greatestk
.In conclusion, as far as I have tested, Unicycler may not work properly when SPAdes 3.13.2 or later is used, because these SPAdes versions do not produce
assembly_graph.fastg
files for everyk
value anymore. Could you help me to check whether this is a real issue or just my inappropriate system configuration? Thanks.The text was updated successfully, but these errors were encountered: