Markdown notebooks illustrating the basics of R, RStudio and the tidyverse for medical research, epidemiology & related fields. The datasets used are either simulated or in the public domain.
Where to start: after installing R and RStudio, download these files (Click the green "Code" button, "Download ZIP"), unzip, open IntRo.Rproj
and 01_basics.qmd
. If you're familiar with Git, you can simply clone the repo.
NB - I'm currently restructuring this.
- 01:The basics (RStudio interface; using R scripts and Quarto documents; assigning values (
<-
); using functions, packages and data frames) - 02. Visualising (Making bar charts, line charts, Box plots and scatterplots with
ggplot2
) - 03. Importing data (importing and exporting data from/to file and from/to database)
- 04. Data summaries (Summarising continuous and categorical data with
dplyr
andjanitor
; the pipe|>
; table 1 withgtsummary
) - 05. Data structures (to be created)
- 06. (currently: intro4) Data subsets (Filtering rows and selecting columns with
dplyr
and base R) - 07. (currently: intro5) Data transformation (Importing and exporting data; calculate new variables, categorise continuous variables, regroup a categorical variable)
- 08. (currently: intro6) Data cleaning (Deleting and renaming variables; data classes; factors; labelling and recoding values)
- 09. (currently: intro7) Data reshaping (Joining datasets; wide and long formats; reshaping wide to long and vice versa)
- 10. SQL (to be created)
- 11. Git (to be created)
- 12. Markdown (to be created)
- 13. Workflows (to be created)
Author: Andrea Mazzella