-
Notifications
You must be signed in to change notification settings - Fork 311
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[ADAM-1358] Refactor BQSR to improve performance and legibility.
Resolves #1358. * Adds instrumentation to BQSR. * Changed SnpTable to remove RichVariant conversion, use VariantRDD API. * Refactoring SnpTable to eliminate per-residue costly masked site lookup. * Restructuring core of SnpTable around an array to improve GC performance. Additionally, wrote custom serializer to improve serialization performance. * Added test suite for SnpTable, to test table creation. * Refactored SnpTable to use an IntervalArray-like approach. This approach improves masked site lookup performance by 50%. * Added tests to SnpTableSuite to cover lookup case, and reenabled tests in BaseQualityRecalibrationSuite. * Adding unit test coverage to covariates * Revert "[ADAM-775] Allow all IUPAC codes in BQSR" This reverts commit 207eeba. * Pulled Seq allocation for base check out into an immutable set. * Rewrote dinuc covariate. 50% improvement in performance. * Rewrite main BQSR aggregate as reduce by key * Added tests to recalibrator, recalibration table. * Majorly refactors of BQSR tables. * Starting to factor out the QualityScore class * Refactoring CovariateKey to reduce size in memory * Eliminated `org.bdgenomics.adam.rich.DecadentRead` (partially resolves #577) * Refactor CovariateKey to store record group ID instead of record group name. * Removed `org.bdgenomics.adam.models.QualityScore`. * Split multi-class files into one class per file (excepting private classes) to improve navigability. * Scaladoc all the recalibrators! You get a scaladoc! And you get a scaladoc!
- Loading branch information
Showing
30 changed files
with
1,870 additions
and
1,204 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
90 changes: 0 additions & 90 deletions
90
adam-core/src/main/scala/org/bdgenomics/adam/models/QualityScore.scala
This file was deleted.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.