- Build API test
- Remove repeated code DRY
- Build classes
- Data builder done
- Cleaning class needed
- plot classes needed
- Rework plots use plotly
- gender plots done
- use dash for the dashboard
- make new requirements.txt
- Read in two data sets.
- Make 3 plots
- Make a virtual environment for the project
- Annotate our code
I revisited this project recently to add API calls to everything and play with a few new plotting ideas.
The API calls are built into the data_builder.py
file. You will need a API key for the Cencus data. Other than that pull which data you need and proceed to the revisited.ipynb
file.
This project was made with Anaconda installed, using the below packages in VS Code using Jupyter notebooks. PIP install packages as needed. Alternatively a virtual environment is included.
Documentation for venv: (https://docs.python.org/3/tutorial/venv.html)
From the directory pip install the requirements.txt file by running "pip install -r requirements.txt"
Feature 2 - Manipulate and clean your data: In each data set I cleanded the sheets by removing missing data and dropping data that was not needed. I then joined the 2 data sets into one to work from.
Feature 3 - Visualize data - Various plots were made to understand that data and a final cleaned cheet was made to compare that data.
Feature 4 - Utilized a virtual environment and include instructions in the README on how the user should run.
Feature 5 - Interpreted the data by annotating the code via markdown cells.
- After you have cloned the repo to your machine, navigate to the project folder in GitBash/Terminal.
- Create a virtual environment in the project folder.
- Activate the virtual environment.
- Install the required packages.
- When you are done working on your repo, deactivate the virtual environment.
Command | Linux/Mac | GitBash |
---|---|---|
Create | python3 -m venv venv |
python -m venv venv |
Activate | source venv/bin/activate |
source venv/Scripts/activate |
Install | pip install -r requirements.txt |
pip install -r requirements.txt |
Deactivate | deactivate |
deactivate |
***final_code.ipynb no longer works as the data is now formatted differently from Louisville open data. I removed the data before realizing that. Outputs are saved for viewing.