Running replicates statistics on bins and bed files #80
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Included a
bwstats.R
with functionality to compute statistics based onDESeq2
package for both bins and BED files.Added functions:
bw_bed_diff_analysis
,bw_bins_diff_analysis
. These operate on two lists of bigWig files, plus labels (labels are mandatory at this point because I believe this makes analysis more robust to mistakes like forgetting what was compared against what). So at this point you need to provide at least"treated"
"untreated"
or some labels to identify the groups of bigWig files.This is a work in progress. I still need to add some automated testing. Even though at this point I am not including actual values testing (relying on
DESeq2
) I am going to include at least tests to check that it runs and that the parameters passed go where they are supposed to go.Since we mostly operate on scaled bigWig files, the
estimateSizeFactors
step that usually is performed withDESeq2
is overridden. But a parameterestimate_size_factors
is provided:will run a normal
DESeq2
function. I provide this for the cases where we are not looking at scaled bigWig files. But the default value of this parameter is set toFALSE
.Returning value of these functions is a results table as the one obtained when you call
DESeq2::results
, which means that for any analysis you still need to set some cutoff thresholds for pvalue and / or fold change.