Name		Name	Last commit message	Last commit date
parent directory ..
ks_app		ks_app
serving/seldon-wrapper		serving/seldon-wrapper
training/ddp/mnist		training/ddp/mnist
web-ui		web-ui
01_setup_a_kubeflow_cluster.md		01_setup_a_kubeflow_cluster.md
02_distributed_training.md		02_distributed_training.md
03_serving_the_model.md		03_serving_the_model.md
04_querying_the_model.md		04_querying_the_model.md
05_teardown.md		05_teardown.md
OWNERS		OWNERS
README.md		README.md

README.md

End-to-End kubeflow tutorial using a Pytorch model in Google Cloud

This example demonstrates how you can use kubeflow end-to-end to train and serve a distributed Pytorch model on a kubernetes cluster in GCP. This tutorial is based upon the below projects:

Goals

There are two primary goals for this tutorial:

Demonstrate an End-to-End kubeflow example
Present an End-to-End Pytorch model

By the end of this tutorial, you should learn how to:

Setup a Kubeflow cluster on a new Kubernetes deployment
Spawn up a shared-persistent storage across the cluster to store models
Train a distributed model using Pytorch and GPUs on the cluster
Serve the model using Seldon Core
Query the model from a simple front-end application

The model and the data

This tutorial trains a TensorFlow model on the MNIST dataset, which is the hello world for machine learning.

The MNIST dataset contains a large number of images of hand-written digits in the range 0 to 9, as well as the labels identifying the digit in each image.

After training, the model classifies incoming images into 10 categories (0 to 9) based on what it’s learned about handwritten images. In other words, you send an image to the model, and the model does its best to identify the digit shown in the image.

In the above screenshot, the image shows a hand-written 7. The table below the image shows a bar graph for each classification label from 0 to 9. Each bar represents the probability that the image matches the respective label. Looks like it’s pretty confident this one is an 7!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pytorch_mnist

pytorch_mnist

README.md

End-to-End kubeflow tutorial using a Pytorch model in Google Cloud

Goals

The model and the data

Steps:

Files

pytorch_mnist

Directory actions

More options

Directory actions

More options

Latest commit

History

pytorch_mnist

Folders and files

parent directory

README.md

End-to-End kubeflow tutorial using a Pytorch model in Google Cloud

Goals

The model and the data

Steps: