Extracting and tidying up data from graphs in the National Ambulance Syndromic Surveillance System (England)
I extracted Cardiac/Respiratory Arrest data from graphs in five publications from the UK Health Security Agency/Public Health England bulletins on the NASSS. The five publications were:
The copied graphs were uploaded into WebPlotDigitizer (https://automeris.io/WebPlotDigitizer/) after some mild modification with GIMP (https://www.gimp.org) to remove gridlines and the daily data trace.
This graph shows all the data from the four reports plotted together.
This graph shows which data from each report was used. It also shows interpolated points (in grey) used to fill the gaps between datapoints read using WebPlotDigitizer.
Data was saved as .csv files then tidied in R (https://cran.r-project.org/) to remove duplicates and combine the datasets. Simple graphs were made using ggplot2 (https://ggplot2.tidyverse.org).
The data and reports shared here are available under the Open Government Licence (https://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/)
The graph style with lines showing where each year starts and ends, and labels for each year, was developed using example code from http://www.jasonfabris.com/blog/dteformat/