Skip to content
This repository has been archived by the owner on Apr 19, 2023. It is now read-only.

[BUG] scenic multirun stalls during Arboreto with multiprocessing #375

Open
Jay-Leung opened this issue Mar 14, 2022 · 0 comments
Open

[BUG] scenic multirun stalls during Arboreto with multiprocessing #375

Jay-Leung opened this issue Mar 14, 2022 · 0 comments
Labels
bug Something isn't working

Comments

@Jay-Leung
Copy link

Describe the bug
I was previously using pyscenic to identify regulons in my scRNA-Seq data. To ensure robustness of the regulons identified, I used VSN scenic with multiruns. I tried running the example given in the documentation and it works for nRuns = 2. However, when I used my own dataset and increase nRuns = 100, it stalls (once at 21%, and another at 64%) during the Arboreto with multiprocessing step. The CPU usage percentage drops to 0% so it doesn't seem to be running, even though there is no error shown. I apologise that I could not include the screenshot as I tried to restart the process with -resume, but now it seems to have skipped the Arboreto step with 64% to the next step (add Pearson correlation). I was wondering if this had to do with the config file compute resources, so I increased some of the parameters. I am quite new to this so I am not sure which exact parameter to toggle, so could you kindly take a look at my config file? I am running it on a local workstation, with 16 cores/32 threads, with 120GB RAM.

To Reproduce
Steps to reproduce the behavior:

  1. Configure with these options:
nextflow config \
-profile hg38,scenic,scenic_multiruns,loom,scenic_use_cistarget_motifs,scenic_use_cistarget_tracks,singularity \
vib-singlecell-nf/vsn-pipelines > nf.config
  1. Run using this entry point:
NXF_VER=21.04.3 nextflow -C nf.config run vib-singlecell-nf/vsn-pipelines -entry scenic -r v0.27.0
  1. See error:
    Stalls at a certain percentage, with 0% CPU usage. I have to Ctl+C to force stop.

Expected behavior
Process should continue.

Please complete the following information:

  • OS: Ubuntu 20.04.4 LTS
  • Nextflow Version: e.g. 21.04.3
  • vsn-pipelines Version: 0.27.0

Additional context
config.txt
execution_trace.txt
[angry_roentgen] Nextflow Workflow Report.pdf
execution_timeline.pdf

I have attached the config file and execution trace in .txt files, and the reports and timeline in pdf files. The 65th run for arboreto stalled for 5 hours+ before I Ctl+C to force stop. I would appreciate some guidance on the compute resource parameters - specifically, does the cpus in the config refer to number of cores or threads as well?

Thank you so much!

@Jay-Leung Jay-Leung added the bug Something isn't working label Mar 14, 2022
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant