Compiled by Josh Peters, George Sun
Support by Prerna Bhargava, Kyle McLean, Divya Ramamoorthy, Tyler Toth, Alexander Triassi
Last Updated: 15 December 2018
This repository compiles basic R resources to build plots comparing distributions between one continous variable (e.g. protein concentration) across multiple discrete variables (e.g. samples). This situation is common in the life sciences due to the need to conduct replicates of experiments with multiple positive and negative controls. Additionally, these plots are common when one wants to drill-down to one gene or protein in -omics datasets. Examples include:
- Protein concentrations across samples from ELISAs
- Transcript level distributions across samples in bulk RNA-seq or cells in scRNA-seq
- Bacterial load in cells or tissue post-infection
Data with temporal aspects can be displayed with this plots, but the number of time points should be less than 3. Regardless, it is not recommended.
The files can be found within this repository.
This repository contains:
- Cheatsheet PDF for quick viewing
- Associated notebooks and raw data to generate plots within cheatsheets
- Practice data sets you can use to generate plots and play around with themes of your own. Both the data and original plot are included in the
/data
folder.- The included scripts use data from this predator-prey publication.
- Additional data sets are included from this amyloid-beta publication and this PD-1 publication.
We do not own this data. All credit goes to the authors for providing this data with publication. Usage of this data is at the discretion of the user.
Within the notebooks provided, the system and session info are detailed.
Check out our list of resources