Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add tellread #149

Merged
merged 51 commits into from
Dec 3, 2024
Merged

Add tellread #149

merged 51 commits into from
Dec 3, 2024

Conversation

charles-cowart
Copy link
Collaborator

@charles-cowart charles-cowart commented Aug 12, 2024

Changes and additions to support tellseq.

@coveralls
Copy link

coveralls commented Aug 12, 2024

Pull Request Test Coverage Report for Build 12113094029

Details

  • 501 of 692 (72.4%) changed or added relevant lines in 19 files are covered.
  • 2 unchanged lines in 2 files lost coverage.
  • Overall coverage decreased (-2.6%) to 80.265%

Changes Missing Coverage Covered Lines Changed/Added Lines %
sequence_processing_pipeline/tests/test_SeqCountsJob.py 44 45 97.78%
sequence_processing_pipeline/tests/test_TRIntegrateJob.py 39 40 97.5%
sequence_processing_pipeline/tests/test_TellReadJob.py 36 37 97.3%
sequence_processing_pipeline/ConvertJob.py 0 3 0.0%
sequence_processing_pipeline/tests/test_Job.py 69 73 94.52%
sequence_processing_pipeline/NuQCJob.py 13 18 72.22%
sequence_processing_pipeline/TellReadJob.py 54 68 79.41%
sequence_processing_pipeline/TRIntegrateJob.py 57 74 77.03%
sequence_processing_pipeline/SeqCountsJob.py 56 74 75.68%
sequence_processing_pipeline/Pipeline.py 44 65 67.69%
Files with Coverage Reduction New Missed Lines %
sequence_processing_pipeline/NuQCJob.py 1 60.07%
sequence_processing_pipeline/tests/test_NuQCJob.py 1 93.72%
Totals Coverage Status
Change from base Build 11807631277: -2.6%
Covered Lines: 2667
Relevant Lines: 3097

💛 - Coveralls

charles-cowart and others added 24 commits August 13, 2024 19:47
Version 2.0 of TellSeq support removes the master tellread.sh script and
the drop-in replacement TRConvertJob.py for Job()s that wrap individual
steps in the original script. These steps can be used in whole or in
part in varying order in the refactored SPP plugin (qp-klp).
Added optional parameter to Pipeline() class that overwrites the values in the lane column of a sample-sheet's data section.
This functionality used to reside in the qp-klp plugin and is a common usage pattern.
This allows SPP to override the value in a sample-sheet's lane column with the value provided by the user at submission time.
Fixes error found when post-processing adapter-trimmed fastq files.
All files were being moved into one of the project sub-folders, rather than into their associated folders.
This appears to be due to recent implementation change.
All files are now moved into their correct folder.
@charles-cowart charles-cowart changed the title WIP: Add tellread Add tellread Nov 21, 2024
@charles-cowart
Copy link
Collaborator Author

@antgonza There are 36 files to review, which is quite large, but a number are new files in the contrib directory that don't need reviewing. There's also new Jinja2 template files and they're fairly easy to read. Many files just contain small changes. There is also some new code to test processing squeue output and a simulated squeue python script.

README.md Outdated Show resolved Hide resolved
sequence_processing_pipeline/Commands.py Outdated Show resolved Hide resolved
sequence_processing_pipeline/TRNormCountsJob.py Outdated Show resolved Hide resolved
sequence_processing_pipeline/TellReadJob.py Show resolved Hide resolved
sequence_processing_pipeline/TellReadJob.py Show resolved Hide resolved
sequence_processing_pipeline/templates/cloudspades.sbatch Outdated Show resolved Hide resolved
sequence_processing_pipeline/templates/cloudspades.sbatch Outdated Show resolved Hide resolved
sequence_processing_pipeline/util.py Show resolved Hide resolved
Request from Antonio to make TRNormCountsJob more generalized for
current and upcoming work.
TRNormCountsJob replaced w/SeqCountsJob:
 * takes a list of paths to fastq and/or fastq.gz files.
 * runs seqtk to count sequences and bases in parallel.
 * aggregator code produces a json file of counts from log output.
@charles-cowart
Copy link
Collaborator Author

@antgonza Ready for review! Thanks again!

@antgonza antgonza merged commit b6fb6ff into biocore:master Dec 3, 2024
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants