-
Notifications
You must be signed in to change notification settings - Fork 30
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
utgcns segfault at consensus step #247
Comments
Thanks for the detailed issue report, I think it might be related to marbl/canu@6a0d004 but if you're using verkko v2.0 that should be fixed. The bad fasta sequences would also be in the input sequences not the consensus output so you'd want to check your input hifi and ONT data for special characters rather than part023.fasta.WORKING. Can you upload this file to the ftp instead, https://canu.readthedocs.io/en/latest/faq.html#how-can-i-send-data-to-you, the dropbox download wasn't working. |
Hey @skoren - thanks for the swift reply, and sorry about the dropbox links not working. I uploaded the I will scan the input for abnormal bases and report back! EDIT: the "Thanks for the detailed issue report" really made my day 🙏 |
update: I ran the above Python script on my ONT and PacBio CCS fastas and found no abnormal characters. Really curious to find out what the problem could have been... |
Thanks, I got the files now and can reproduce the crash. I didn't notice this in your original post but I see you're running with I still want to fix the actual crash in the code but with the above you don't have to wait for me. One other quick question, how many lines are in your 7-consensus/ont_subset.id file? This is parameter to the consensus step so I want to make sure I'm fully consistent with your run. |
thanks for the heads up. I'd like to avoid compiling from source, so for now I'll try again without I'm closing the issue for now, might re-open if I stumble again. re: |
(possibly similar to #53 or Canu issues #1061 and #1073).
Expected behaviour
rule
generateConsensus
runs through and produces a final consensus fasta.Current behaviour
Error in rule. The log file in the
.snakemake/log/
dir says:log
When I look at the offending part, this is what I see:
part023.err
Possible solutions
long sequences
In issue #53 it was suggested that a read longer than 2^11 might be present. However, the offending line in canu has been fixed, so this can't be it.
abnormal FASTA
some canu issues suggested that unexpected characters in the FASTA file might be a problem. I looked through part023.fasta.WORKING with a short python script and couldn't find anything:
code
This runs through without any output. Manual inspection of (parts of) the FASTA file also did not show anything suspicious.
Steps to reproduce
I uploaded part023.csnpack and part023.fasta.WORKING on Dropbox.
Environment
HPC environment running Oracle Linux 9 (kernel release 5.14.0-362.24.1.0.1.el9_3.x86_64); verkko installation via
conda
. Theverkko
call was put in a wrapper script and ran on the head node of the cluster in--slurm
mode.verkko script
The text was updated successfully, but these errors were encountered: