-
Notifications
You must be signed in to change notification settings - Fork 82
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Issue with pplacer #256
Comments
Hi. Can you send us the information in |
Hi there, Running pplacer v1.1.alpha19-0-g807f6f3 analysis on gtdbtk_output/align/gtdbtk.bac120.user_msa.fasta... I have ~128Gb, is this sufficient? It worked for me before using this. Thanks, |
It should be enough memory. From the output it looks like you might have run it using 30 CPUs. Can you try running it with a single CPU? |
Hi there, I tried that already and got the same error. I also re downloaded it to see if it would make any difference but it didn’t work either. |
Hello, typically errors that are encountered during the 'caching likelihood' step of pplacer indicates that there's not enough memory. This is the step where pplacer starts consuming the 100GB+ of memory. If possible, keep an eye on what is using the memory on the system while the classify step is running. I would hazard a guess that there could be something else using the memory. As an alternative, pplacer has an mmap flag (maps the memory to disk to try and reduce the memory burden), this can be accessed using the scratch_dir command. |
I'm currently running into an issue where pplacer is running out of memory despite using The command I'm using is :
The full output log is attached, but the error that's killing it is:
Update: |
@btemperton It sounds like you've done everything right to get pplacer working but why it's out of memory is not obvious. There are certainly some quirks when using pplacer, e.g. >64 threads will cause it to hang. The issue ultimately is with pplacer, unfortunately. However, I do have an idea to try and get it working on your system:
e.g. this is my output:
Thanks. |
Closing due to inactivity. |
Hi all, I use a SLURM system and here are the job stats:
It's failing at the same pplacer step when trying to place the MAG. Any insights will be greatly appreciated. Thank you! |
Hi there,
I have used this program 5 times before and I never had an issue, however, I am now getting the following error:
**(gtdbtk) abreen:[gtdbtk]$ gtdbtk classify_wf --genome_dir all_genomes --extension fasta --out_dir gtdbtk_output
[2020-05-27 15:41:27] INFO: GTDB-Tk v1.1.1
[2020-05-27 15:41:27] INFO: gtdbtk classify_wf --genome_dir all_genomes --extension fasta --out_dir gtdbtk_output
[2020-05-27 15:41:27] INFO: Using GTDB-Tk reference data version r89: /home/.conda/envs/gtdbtk/share/gtdbtk-1.1.1/db/
[2020-05-27 15:41:27] INFO: Identifying markers in 24 genomes with 1 threads.
[2020-05-27 15:41:27] INFO: Running Prodigal V2.6.3 to identify genes.
==> Finished processing 24 of 24 (100.0%) genomes.
[2020-05-27 15:48:47] INFO: Identifying TIGRFAM protein families.
==> Finished processing 24 of 24 (100.0%) genomes.
[2020-05-27 15:51:45] INFO: Identifying Pfam protein families.
==> Finished processing 24 of 24 (100.0%) genomes.
[2020-05-27 15:52:05] INFO: Annotations done using HMMER 3.1b2 (February 2015)
[2020-05-27 15:52:06] INFO: Done.
[2020-05-27 15:52:07] INFO: Aligning markers in 24 genomes with 1 threads.
[2020-05-27 15:52:07] INFO: Processing 24 genomes identified as bacterial.
[2020-05-27 15:52:29] INFO: Read concatenated alignment for 23458 GTDB genomes.
==> Finished aligning 24 of 24 (100.0%) genomes.
[2020-05-27 15:58:23] INFO: Masking columns of multiple sequence alignment using canonical mask.
[2020-05-27 15:59:03] INFO: Masked alignment from 41155 to 5040 AAs.
[2020-05-27 15:59:03] INFO: 0 user genomes have amino acids in <10.0% of columns in filtered MSA.
[2020-05-27 15:59:03] INFO: Creating concatenated alignment for 23482 GTDB and user genomes.
[2020-05-27 15:59:03] INFO: Creating concatenated alignment for 24 user genomes.
[2020-05-27 15:59:03] INFO: Done.
[2020-05-27 15:59:03] INFO: Placing 24 bacterial genomes into reference tree with pplacer using 30 cpus (be patient).
Initialising pplacer [Caching likelihood information on reference tree..]
[2020-05-27 16:10:50] ERROR: Controlled exit resulting from an unrecoverable error or warning.
================================================================================
EXCEPTION: PplacerException
MESSAGE: An error was encountered while running pplacer, check the log file: gtdbtk_output/classify/intermediate_results/pplacer/pplacer.bac120.out
Traceback (most recent call last):
File "/home/.conda/envs/gtdbtk/bin/gtdbtk", line 492, in
File "/home/.conda/envs/gtdbtk/lib/python3.8/site-packages/gtdbtk/main.py", line 666, in parse_options
File "/home/.conda/envs/gtdbtk/lib/python3.8/site-packages/gtdbtk/main.py", line 426, in classify
File "/home/.conda/envs/gtdbtk/lib/python3.8/site-packages/gtdbtk/classify.py", line 351, in run
File "/home/.conda/envs/gtdbtk/lib/python3.8/site-packages/gtdbtk/classify.py", line 168, in place_genomes
File "/home.conda/envs/gtdbtk/lib/python3.8/site-packages/gtdbtk/external/pplacer.py", line 154, in run
gtdbtk.exceptions.PplacerException: An error was encountered while running pplacer, check the log file: gtdbtk_output/classify/intermediate_results/pplacer/pplacer.bac120.out**
Would you be able to help me please? I would really appreciate it.
Thank you.
Kind Regards,
A Breen
The text was updated successfully, but these errors were encountered: