Skip to content

Latest commit

 

History

History
83 lines (55 loc) · 2.78 KB

README.md

File metadata and controls

83 lines (55 loc) · 2.78 KB

Data Mining Project

We make SIR model with python from COVID-19 Italy dataset.

Installation

Use the package manager pip to install required packages.

pip install numpy
pip install matplotlib
pip install scipy
pip install beautifulsoup4
pip install requests

Python version used :

python --version
Python 3.6.8

Usage

Brief Summary

There are 3 main file in the root directory.

  1. Scrapper scrapper.py Is used to download automatically the data from the github. Here is the github link
  2. Cleaner cleaner.py Is used to clean the downloaded data and save the new data in dataset/dataframe.csv
  3. Runner runner.py Is used to run the SIR model and compare it to the dataset/dataframe.csv so that we can get the accuracy of the model.

How to run

First, thing first. Install all the requirement. And then, run the scrapper.py like this :

python scrapper.py

Wait untill all the data is downloaded into the ./dataset/ folder. After that, the second thing we must do is clean the data :

python cleaner.py

After the data has been cleaned. We can continue to run the runner file :

python runner.py

We can see in stdout about the details. And also, remember that the value of constant.TIME must be equal to the number of row in dataframe. Unless you dont want to plot the real data result.

Model Detail

There are two models that we provide :

  1. SIR (Susceptible, Infected, Recovered/Removed)
  2. SEIRD (Susceptible, Exposed, Infected, Recovered, Death)

You can find both model file python in ./model/ directory

Contributing

No contributor needed

References

License

MIT