This project provides a comprehensive set of data visualizations for COVID-19 statistics in New York City. It includes time series analyses, geographic plots, and demographic breakdowns of case rates, death rates, and hospitalization rates.
-
Time Series Visualizations:
-
Geographic Visualizations:
-
Demographic Visualizations:
This project requires the following Python libraries:
- pandas
- matplotlib
- numpy
- geopandas
- geodatasets
You can install these dependencies using pip:
pip install -r requirements.txt
The data used in this project is sourced from the NYC Health Department's GitHub repository:
The project is divided into three main Jupyter notebooks:
covid-time-series-plots.ipynb
: Time series analysis of COVID-19 datacovid-geographic-plots.ipynb
: Geographic visualizations of COVID-19 case ratescovid-bar-plots.ipynb
: Demographic breakdowns and comparisons
To run the notebooks:
- Clone this repository
- Install the required dependencies
- Open the Jupyter notebooks and run the cells in order
- Daily new cases, deaths, and hospitalizations over time
- Cumulative cases, deaths, and hospitalizations over time
- Interactive slider to select date ranges
- Choropleth map of COVID-19 case rates by NYC borough
- Detailed map of COVID-19 case rates by ZIP code
- Bar plots comparing case rates, hospitalization rates, and death rates across age groups
- Bar plots comparing COVID-19 statistics between genders
- Perform actual statistical analysis (Heatmaps, test for significance)
- Interpret results
Contributions to this project are welcome. Please fork the repository and submit a pull request with your changes.
This project is licensed under the MIT License - See License for more details.
Note: This project uses data that is regularly updated. Make sure to pull the latest data from the NYC Health Department's GitHub repository for the most up-to-date visualizations.