Name		Name	Last commit message	Last commit date
parent directory ..
01-intro		01-intro
02-regression		02-regression
03-classification		03-classification
04-evaluation		04-evaluation
05-deployment		05-deployment
06-trees		06-trees
07-bentoml-production		07-bentoml-production
08-deep-learning		08-deep-learning
09-serverless		09-serverless
10-kubernetes		10-kubernetes
11-kserve		11-kserve
article		article
cohorts		cohorts
projects		projects
README.md		README.md
asking-questions.md		asking-questions.md
generate-description.ipynb		generate-description.ipynb
generate-pages.ipynb		generate-pages.ipynb

README.md

Machine Learning Zoomcamp

Register at DataTalks.Club and join the #course-ml-zoomcamp channel
Course telegram channel
Course playlist
For the 2022 edition, see more info in the 2022 Cohort section

Syllabus

Introduction to Machine Learning
Machine Learning for Regression
Machine Learning for Classification
Evaluation Metrics for Classification
Deploying Machine Learning Models
Decision Trees and Ensemble Learning
Neural Networks and Deep Learning
Serverless Deep Learning
Kubernetes and TensorFlow Serving

Taking the course

2022 Cohort

We start the course again in September 2022

Sign up here
Register at DataTalks.Club and join the #course-ml-zoomcamp channel
Join the course telegram channel
Subscribe to the public google calendar (subscribing works from desktop only)
Tweet about it
Start date: September 5
If you have questions, check FAQ
All the materials specific to the 2022 will be in the 2022 cohort folder

Self-paced mode

You can take the course at your own pace. All the materials are freely available, and you can start learning at any time.

To take the best out of this course, we recommened this:

Register at DataTalks.Club and join the #course-ml-zoomcamp channel
For each module, watch the videos and work through the code
If you have any questions, ask them in the #course-ml-zoomcamp channel in Slack
Do homework. There are solutions, but we advise to first attempt the homework yourself, and after that check the solutions
Do at least one project. Two is better. Only this way you can make sure you're really learning. If you need feedback, use the #course-ml-zoomcamp channel

Of course, you can take each module independently.

Prerequisites

Prior programming experience (at least 1+ year)
Being comfortable with command line
No prior exposure to machine learning is required

Nice to have but not mandatory

Python (but you can learn it during the course)
Prior exposure to linear algebra will be helpful (e.g. you studied it in college but forgot)

Asking questions

The best way to get support is to use DataTalks.Club's Slack. Join the #course-ml-zoomcamp channel.

To make discussions in Slack more organized:

Follow these recommendations when asking for help
Read the DataTalks.Club community guidelines

1. Introduction to Machine Learning

2. Machine Learning for Regression

2.1 Car price prediction project
2.2 Data preparation
2.3 Exploratory data analysis
2.4 Setting up the validation framework
2.5 Linear regression
2.6 Linear regression: vector form
2.7 Training linear regression: Normal equation
2.8 Baseline model for car price prediction project
2.9 Root mean squared error
2.10 Using RMSE on validation data
2.11 Feature engineering
2.12 Categorical variables
2.13 Regularization
2.14 Tuning the model
2.15 Using the model
2.16 Car price prediction project summary
2.17 Explore more
2.18 Homework

3. Machine Learning for Classification

3.1 Churn prediction project
3.2 Data preparation
3.3 Setting up the validation framework
3.4 EDA
3.5 Feature importance: Churn rate and risk ratio
3.6 Feature importance: Mutual information
3.7 Feature importance: Correlation
3.8 One-hot encoding
3.9 Logistic regression
3.10 Training logistic regression with Scikit-Learn
3.11 Model interpretation
3.12 Using the model
3.13 Summary
3.14 Explore more
3.15 Homework

4. Evaluation Metrics for Classification

4.1 Evaluation metrics: session overview
4.2 Accuracy and dummy model
4.3 Confusion table
4.4 Precision and Recall
4.5 ROC Curves
4.6 ROC AUC
4.7 Cross-Validation
4.8 Summary
4.9 Explore more
4.10 Homework

5. Deploying Machine Learning Models

5.1 Intro / Session overview
5.2 Saving and loading the model
5.3 Web services: introduction to Flask
5.4 Serving the churn model with Flask
5.5 Python virtual environment: Pipenv
5.6 Environment management: Docker
5.7 Deployment to the cloud: AWS Elastic Beanstalk (optional)
5.8 Summary
5.9 Explore more
5.10 Homework

6. Decision Trees and Ensemble Learning

7. Production-Ready Machine Learning (Bento ML)

7.1 Intro/Session Overview
7.2 Building Your Prediction Service with BentoML
7.3 Deploying Your Prediction Service
7.4 Sending, Receiving and Validating Data
7.5 High-Performance Serving
7.6 Bento Production Deployment
7.7 (Optional) Advanced Example: Deploying Stable Diffusion Model
7.8 Summary
7.9 Homework

Midterm Project

Putting everything we've learned so far in practice!

8. Neural Networks and Deep Learning

8.1 Fashion classification
8.2 TensorFlow and Keras
8.3 Pre-trained convolutional neural networks
8.4 Convolutional neural networks
8.5 Transfer learning
8.6 Adjusting the learning rate
8.7 Checkpointing
8.8 Adding more layers
8.9 Regularization and dropout
8.10 Data augmentation
8.11 Training a larger model
8.12 Using the model
8.13 Summary
8.14 Explore more
8.15 Homework

9. Serverless Deep Learning

9.1 Introduction to Serverless
9.2 AWS Lambda
9.3 TensorFlow Lite
9.4 Preparing the code for Lambda
9.5 Preparing a Docker image
9.6 Creating the lambda function
9.7 API Gateway: exposing the lambda function
9.8 Summary
9.9 Explore more
9.10 Homework

10. Kubernetes and TensorFlow Serving

10.1 Overview
10.2 TensorFlow Serving
10.3 Creating a pre-processing service
10.4 Running everything locally with Docker-compose
10.5 Introduction to Kubernetes
10.6 Deploying a simple service to Kubernetes
10.7 Deploying TensorFlow models to Kubernetes
10.8 Deploying to EKS
10.9 Summary
10.10 Explore more
10.11 Homework

11. KServe (optional)

11.1 Overview
11.2 Running KServe locally
11.3 Deploying a Scikit-Learn model with KServe
11.4 Deploying custom Scikit-Learn images with KServe
11.5 Serving TensorFlow models with KServe
11.6 KServe transformers
11.7 Deploying with KServe and EKS
11.8 Summary
11.9 Explore more

Capstone Project 1

Putting everything we've learned so far in practice one more time!

Article

Writing an article about something not covered in the course.

Capstone project 2 (optional)

For those who love projects!

Previous cohorts

2021 Cohort

Our other courses

If you liked this course, you'll like other courses from us:

Supporters and partners

Thanks to the course sponsors for making it possible to run this course

Thanks to our friends for spreading the word about the course