-
Notifications
You must be signed in to change notification settings - Fork 743
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
update modules in GATK's gcnvcaller pipeline #3561
update modules in GATK's gcnvcaller pipeline #3561
Conversation
tuple val(meta2), path(fai) | ||
tuple val(meta2), path(dict) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
tuple val(meta2), path(fai) | |
tuple val(meta2), path(dict) | |
tuple val(meta3), path(fai) | |
tuple val(meta4), path(dict) |
@@ -24,11 +24,16 @@ input: | |||
description: | | |||
Groovy Map containing sample information | |||
e.g. [ id:'test', single_end:false ] | |||
- bam: | |||
- meta2: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can you add the other metas too?
process GATK4_DETERMINEGERMLINECONTIGPLOIDY { | ||
tag "$meta.id" | ||
label 'process_single' | ||
|
||
//Conda is not supported at the moment: https://github.com/broadinstitute/gatk/issues/7811 | ||
container "nf-core/gatk:4.4.0.0" //Biocontainers is missing a package | ||
container "quay.io/nf-core/gatk:4.4.0.0" //Biocontainers is missing a package |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
container "quay.io/nf-core/gatk:4.4.0.0" //Biocontainers is missing a package | |
container "nf-core/gatk:4.4.0.0" //Biocontainers is missing a package |
Quay.io is the default registry in all nf-core pipelines so we leave this out for more flexibility
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
When I tried without it, singularity fails to pull the image. https://github.com/nf-core/modules/actions/runs/5394391319/jobs/9795497440
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@adamrtalbot can you help with this? :)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It's fixed in nf-core/tools#2336 but will require a new release.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Also fixed with Nextflow version 23.04+ which includes singularity.registry
, which is set to quay.io
in the NF-Core template.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Awesome thank you!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Cool! I will let quay.io
be a part of the uri for now, but I will remove it after the next tools release 😄
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Caused issue: #3668
@@ -3,7 +3,7 @@ process GATK4_GERMLINECNVCALLER { | |||
label 'process_single' | |||
|
|||
//Conda is not supported at the moment: https://github.com/broadinstitute/gatk/issues/7811 | |||
container "nf-core/gatk:4.4.0.0" //Biocontainers is missing a package | |||
container "quay.io/nf-core/gatk:4.4.0.0" //Biocontainers is missing a package |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
container "quay.io/nf-core/gatk:4.4.0.0" //Biocontainers is missing a package | |
container "nf-core/gatk:4.4.0.0" //Biocontainers is missing a package |
path model | ||
path ploidy | ||
tuple val(meta2), path(model) | ||
tuple val(meta2), path(ploidy) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
tuple val(meta2), path(ploidy) | |
tuple val(meta3), path(ploidy) |
Can you also fix the meta.yml here?
@@ -3,36 +3,33 @@ process GATK4_POSTPROCESSGERMLINECNVCALLS { | |||
label 'process_single' | |||
|
|||
//Conda is not supported at the moment: https://github.com/broadinstitute/gatk/issues/7811 | |||
container "nf-core/gatk:4.4.0.0" //Biocontainers is missing a package | |||
container "quay.io/nf-core/gatk:4.4.0.0" //Biocontainers is missing a package |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
container "quay.io/nf-core/gatk:4.4.0.0" //Biocontainers is missing a package | |
container "nf-core/gatk:4.4.0.0" //Biocontainers is missing a package |
tuple val(meta), path(ploidy) | ||
path model | ||
path calls | ||
tuple val(meta), path(ploidy, stageAs:'ploidy') |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why do you use stageAs
here?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ploidy
and calls
are generated by other modules upstream and GATK attaches the same suffix (-calls
) to their names, and if someone running the cnvcalling workflow doesn't customize the prefixes the names will clash. That's why I have used stageAs here.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ah we had long discussions about this in the past and decided to not use stageAs
in these cases and force the user to use different prefixes (which is a best practice to do anyway)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ah, I see.. Alright I will change it 😄
path calls | ||
tuple val(meta), path(ploidy, stageAs:'ploidy') | ||
tuple val(meta2), path(model) | ||
tuple val(meta2), path(calls) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
tuple val(meta2), path(calls) | |
tuple val(meta3), path(calls) |
Same here
Thanks for the review @nvnieuwk |
PR checklist
This PR updates the following modules,
Major change here is the removal of tarred inputs and outputs for modules 2, 3, and 4. Reason being that the modules are used as a part of gatk's cnvcalling workflow, and compressing the output from one module only to uncompress it in the next step is a futile exercise.
versions.yml
file.label
PROFILE=docker pytest --tag <MODULE> --symlink --keep-workflow-wd --git-aware
PROFILE=singularity pytest --tag <MODULE> --symlink --keep-workflow-wd --git-aware
PROFILE=conda pytest --tag <MODULE> --symlink --keep-workflow-wd --git-aware