Bundesterminator

What it is about?

The project explores ways to predict party affiliation by text segments. Machine Learning and Deep Learning approaches are tested. This is the result of a two-week project from the Le Wagon Data Science Bootcamp, Batch 606 Berlin.

A demo is available at the following URL http://bundesterminator.herokuapp.com/.

Data

For training the models the meeting minutes of the German Parliament was used. They are available as XML files from the open data website of the German Parliament. The XML files were pre-processed and translated into CSV files (currently the python framework pandas has no XML import).

Folder Strucure

api

The trained model can be exposed by a web API. It uses a lean setting based on FastAPI and Uvicorn. The deployment settings assume a deployment on Heroku.

bundestag

The bundestag folder represents the bundestag python package. It contains the main files for training the models.

trainer.py

Pipeline for a machine learning approach.

bundestrainer.py

Class to wrap functionalities to train a Deep Learning model with Tensorflow Keras and a trained Gensim word2vev model.

bundes_w2v.py

Light wrapper to the Gensim w2v module.

data.py

Helper function to aquire the data.

utils.py

Helper function to pre-process the data.

Deployment

Other files are added to enable deployment of the API to Heroku and to have an automated workflow based on GitHub Actions.

Please note that you need to set environment variables to deploy on Google Cloud Platform. This needs to be done directly in data.py, trainer.py and bundestrainer.py. For the MAKEFILE environment variables need to be set. This will replaced by a more flexible approach in the future.

Licence

MIT

Team

The work is a colloborative effort of the following team members who each contributed to the project:

Thanks

We can not thank enough the AMAAAAAZIIIING team of Le Wagon. The patience, expertise, and dedication opened a new world for us.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.github/workflows		.github/workflows
api		api
bundestag		bundestag
scripts		scripts
tests		tests
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
MANIFEST.in		MANIFEST.in
Makefile		Makefile
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bundesterminator

What it is about?

Data

Folder Strucure

api

bundestag

trainer.py

bundestrainer.py

bundes_w2v.py

data.py

utils.py

Deployment

Licence

Team

Thanks

About

Releases

Packages

Languages

License

xchange11/bundesterminator

Folders and files

Latest commit

History

Repository files navigation

Bundesterminator

What it is about?

Data

Folder Strucure

api

bundestag

trainer.py

bundestrainer.py

bundes_w2v.py

data.py

utils.py

Deployment

Licence

Team

Thanks

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages