Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix non-deterministic errors in GetSampleIdsFromVcfTar #531

Merged
merged 1 commit into from
May 15, 2023

Conversation

mwalker174
Copy link
Collaborator

GetSampleIdsFromVcfTar occasionally fails with an error about the piping strategy used. This PR simplifies the code that uses more disk but is safer and still fast.

@mwalker174 mwalker174 requested a review from cwhelan May 12, 2023 15:11
Copy link
Member

@cwhelan cwhelan left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good to me so long as you don't think the disk requirement will get out of hand. Easier to read this way too.

@mwalker174 mwalker174 merged commit c3a11f2 into main May 15, 2023
@mwalker174 mwalker174 deleted the mw_get_sample_ids_from_tar_fix branch May 15, 2023 16:37
gatk-sv-bot pushed a commit to Genometric/gatk-sv that referenced this pull request Jun 7, 2023
gatk-sv-bot pushed a commit to Genometric/gatk-sv that referenced this pull request Jun 27, 2023
gatk-sv-bot pushed a commit to Genometric/gatk-sv that referenced this pull request Jun 27, 2023
MattWellie added a commit to populationgenomics/gatk-sv that referenced this pull request Jul 26, 2023
* Revert to CPG commit ca70123

* Absorb Broad upstream changes:

* Add handling for Flag vcf fields to vcf_to_pandas (broadinstitute#506)
* add outlier samples list & count outputs to PlotSVCountsPerSample (broadinstitute#510)
* eliminate cram to bam conversion when possible (broadinstitute#468)
* Add ref panel inputs for MakeCohortVcf subworkflows (broadinstitute#517)
* Extend STR workflow to collect additional locus-level metrics (broadinstitute#516)
* Change ref allele to N if unsupported in gatk formatting script (broadinstitute#511)
* Add sample renaming for SD files in GatherBatchEvidence (broadinstitute#519)
* Remove vcf header contig sorting in CleanVcf5 (broadinstitute#518)
* Add support for building dockers for multiple registries. (broadinstitute#507)
* Remove non-public images from the git-sha-based target determination (broadinstitute#525)
* Fix ReadMe for build docker (broadinstitute#528)
* Fix for tiny shard of IntegrateGQ in single sample pipeline (broadinstitute#524)
* Fix issue in Genotype batch script with low PE_log_pval causing zero PE_count (broadinstitute#527)
* Manually install MASS R package in sv-pipeline-virtual-env (broadinstitute#534)
* Fix non-deterministic errors in GetSampleIdsFromVcfTar (broadinstitute#531)
* Add ApplyManualVariantFilter json template (broadinstitute#536)
* make the RunMELT task a little more robust (broadinstitute#529)
* Improve STR workflow (broadinstitute#539)
* Fix workspace format (Generate Terra workspace tsv files from transposed tables) (broadinstitute#523)
* Update VaPoR workflows (broadinstitute#532)
* Update README to link to SV callers used. (broadinstitute#541)

Co-authored-by: Mark Walker <markw@broadinstitute.org>
Co-authored-by: epiercehoffman <epierceh@broadinstitute.org>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants