4.beta.3
Pre-releaseThis release contains a number of bug fixes and improvements. Highlights include a fix for intermittent failures/timeouts when accessing data in Google Cloud Storage
(GCS), new and improved active-region detection for Mutect2
, and a new VariantRecalibrator
argument to allow the tool to scale better. See the full list of changes below. Most of the major known issues listed in the release notes for 4.beta.1 still apply, with the exception of the "intermittent GCS failures/timeouts" issue, which is now resolved.
A docker image for this release can be found in the broadinstitute/gatk repository on dockerhub. Within the image, cd into /gatk
then run gatk-launch
commands as usual.
Note: Due to our current dependency on a snapshot of google-cloud-java
, this release cannot be published to maven central.
Changes in this release:
GATK engine
: Move togoogle-cloud-java
snapshot with more robust retries, and set number of retries/reopens globally. This fixes the intermittent "all retries/reopens failed" error when accessing data on GCS (Google Cloud Storage). See issue #2749Mutect2
: Implemented a new algorithm for active-region detection, reducing spurious active regions by almost 50%Mutect2
: Filter artifacts that arise from apparent-duplicate readsMutect2 WDL
:Oncotator
is now being told the case and control sample names explicitly in the WDL. The Oncotator code for inferring this could yield incorrect answers in some cases. See issue #3343FilterByOrientationBias
: We discovered that it is impossible to guarantee a FDR threshold of all the variants when one artifact mode had high oxoQ and the other had low. We have changed the tool to guarantee the FDR threshold within each artifact mode, rather than for all variants. For more details, see issue #3344FilterByOrientationBias
: Summary table was not being populated properly. That has been fixed. See issue #3309VariantRecalibrator
: Add argument to pre-sample data for VQSR model building (and also recalibration) to reduce memory usage for production pipeline. See issue #3230- Fix a stack overflow issue at high depths in the strand artifact annotation. See issue #3317
GenomicsDBImport
: add--readerThreads
argument for multi-threaded vcf pre-loading. Improves performance of the tool by ~30% in our tests.ValidateVariants
: port gvcf validation option from GATK3- Polish up
PathSeq
and add pipeline tool - Fix error message describing how to set the
GATK_STACKTRACE_ON_USER_EXCEPTION
property Mutect2FilteringEngine
: correctMEDIAN_BASE_QUALITY_DIFFERENCE_FILTER
andMEDIAN_MAPPING_QUALITY_DIFFERENCE_FILTER
filter namesMutect2 WDL
: gaveProcessOptionalArguments
a leaner dockerGATK4 Docker Image
: changed the landing directory for the docker image to be/gatk
instead of/root
Travis CI
: fixed test report not being uploaded to GCSTravis CI
: removed non-docker unit and integration tests, which were redundant