Authors: Ido Michael and Eduardo Blancas
This workshop demonstrates how to develop reproducible pipelines using Ploomber.
To start, click here or on the button below:
Note: It may take a few seconds for the notebook to load.
Scroll down to the Running it locally section if you prefer to run things locally.
Familiarity with JupyterLab, and a basic knowledge of pandas and scikit-learn.
- Introduction
- Refactoring a legacy notebook
- The
pipeline.yaml
file. - Building the pipeline
- Declaring dependencies
- Adding a new task
- Incremental builds
- Execution in the cloud
You can also follow this workshop locally, but it requires a bit more setup:
Pre-requisites:
- miniconda
git
# clone the repository
git clone https://github.com/idomic/ploomber-workshop
cd ploomber-workshop
# install dependencies (requires conda)
pip install invoke
invoke setup --from-lock
# activate environment
conda activate ploomber-workshop
# start jupyter
jupyter lab
Then open index.ipynb
.
# install dependencies
pip install --upgrade pip
pip install -r requirements.dev.txt
# start jupyter
jupyter lab
Then open index.ipynb
.
If you like our project, please give us a ⭐️ on GitHub.