Skip to content

tinztwins/mlflow-workspace

Repository files navigation

MLflow Workspace

Newsletter

👉🏽 Do you enjoy our content and want to read super-detailed articles about AI? If so, subscribe to our Tinz Twins Hub and get our popular data science cheat sheets for FREE.

General

The MLflow environment serves as a tracking server for managing the machine learning lifecycle. The repo contains several infrastructure services that start in a Docker stack.

Infrastructure services

The Docker stack uses four infrastructure services. These services can be started manually so that you get an MLflow working environment in Docker.

  • JupyterLab (docker-container: jupyter)
  • MLflow Tracking Server (docker-container: mlflow)
  • Artifact Store as SFTP Server (docker-container: sftp)
  • Backend Storage as postgres db (docker-container: postgres)

Start and stop the MLflow workspace

  1. Start MLflow workspace: sh start_docker_stack.sh
  2. Visit http://127.0.0.1:5001 for MLflow UI
  3. Visit http://127.0.0.1:8888 for JupyterLab UI (enter the password mlflow)
  4. Stop MLflow workspace: sh stop_docker_stack.sh

Test your MLflow workspace setup

Adding new dependencies for JupyterLab

Apple silicon or arm64 architecture

  • Please open the .env file and change the variable SFTP_IMAGE to tinztwins/sftp-server:arm64.

Troubleshooting

  • On Linux systems there may be permission problems with the Artifact Store. The current workaround is to set the permissions manually (sudo chmod -R 777).

Useful Tools

References