[Sample] Add new TFX::OSS sample #2319

numerology · 2019-10-07T17:03:32Z

Add a sample showing how to run TFX pipeline on Kubeflow Runner.

TODO:

Add proper dependency so it can be built into preload samples
Add docs
Add sample test coverage

This change is

…dency issue.

numerology · 2019-10-07T20:24:33Z

Note that the compilation of new sample here requires a HEAD version of TFX (as of 10.07.2019) and a customized value in tfx.version pointing to TFX nightly build (because there is actually no image tagged as tfx:0.15.0.dev). I suggest that we move this sample to contrib sample dir to bypass integration test and make the compiled pipeline package part of our repo so that backend server can pre-load it, otherwise we might need rather complex dependency setup, which is also error-prone because it relies on a dev version of TFX.

I will add a doc so that a user knows how to compile such hacky sample successfully.

WDYT @neuromage @gaoning777

neuromage · 2019-10-07T23:53:11Z

WDYT @neuromage @gaoning777

SGTM, thanks!

gaoning777 · 2019-10-08T12:44:35Z

I think it is a little risky to load the sample in the API server without testing. The preloaded samples are supposed to be flagship samples that are guaranteed to work. Say, a new user tries out the KFP and runs the preloaded sample but failed. It could easily scare them away. Besides, we got lots of issues in the early stages when the preloaded samples break. @SinaChavoshi used to use the old TFX samples for demos and IIRC, sample breaking during the demo is bad.

My suggestion is a little change on top of your current work: write some custom sample test infra codes to cover the test for the compiled pipeline. It shouldn't be too much custom codes but is rewarding. WDYT? @numerology @neuromage

numerology · 2019-10-08T15:30:13Z

I think it is a little risky to load the sample in the API server without testing. The preloaded samples are supposed to be flagship samples that are guaranteed to work. Say, a new user tries out the KFP and runs the preloaded sample but failed. It could easily scare them away. Besides, we got lots of issues in the early stages when the preloaded samples break. @SinaChavoshi used to use the old TFX samples for demos and IIRC, sample breaking during the demo is bad.

My suggestion is a little change on top of your current work: write some custom sample test infra codes to cover the test for the compiled pipeline. It shouldn't be too much custom codes but is rewarding. WDYT? @numerology @neuromage

One of the current problem is that we need a dev version and a nightly build of TFX to make this compile and run. If we put those dependencies in our sample test infra it might not be very stable. Talked to TFX side and I was told 0.15.0 release should be either this week or next week. I think it will be much easier and reliable to add a sample test coverage by then.

gaoning777 · 2019-10-09T09:07:26Z

what I meant is not adding dependency on the dev version of the TFX, but adding custom sample test infra codes to run the chicago_taxi_pipeline_simple.tar.gz directly without compilation. We add the python codes and the compiled one as you have done here.

numerology · 2019-10-09T16:10:06Z

what I meant is not adding dependency on the dev version of the TFX, but adding custom sample test infra codes to run the chicago_taxi_pipeline_simple.tar.gz directly without compilation. We add the python codes and the compiled one as you have done here.

I see. That's doable. Let me ping you when it's done.

…dd-hacky-tfx-preload-sample # Conflicts: # test/sample_test.yaml

numerology · 2019-10-09T23:47:46Z

PTAL @neuromage @gaoning777 Thanks!

neuromage · 2019-10-10T00:01:24Z

This is pretty awesome, thanks @numerology :-)
/lgtm
/approve

k8s-ci-robot · 2019-10-10T00:01:35Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: neuromage

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~backend/OWNERS~~ [neuromage]
~~samples/OWNERS~~ [neuromage]
~~test/OWNERS~~ [neuromage]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

gaoning777 · 2019-10-10T05:03:59Z

This is great.
Thanks

animeshsingh · 2019-10-19T03:35:52Z

@gaoning777 @numerology @neuromage we have a talk in KF summit and TF World on this topic - are there any additional materials (slides/design docs etc.) you can point to vis a vis TFX which I can use?

numerology · 2019-10-21T18:05:52Z

@gaoning777 @numerology @neuromage we have a talk in KF summit and TF World on this topic - are there any additional materials (slides/design docs etc.) you can point to vis a vis TFX which I can use?

The recommended entrypoint of a variety of docs besides the code repo is the user guide where some basic concepts and orchestration mechanism of TFX are introduced. Also there is some recorded talks and tutorials here.

* Fix custom model dockerfile * Fix python dependency installation failure. Signed-off-by: Andrews Arokiam <andrews.arokiam@ideas2it.com> * Optimize dockerfiles * Optimize all dockerfiles for layer caching. Signed-off-by: Andrews Arokiam <andrews.arokiam@ideas2it.com>

* Upgrade dockerfiles python version to 3.9 * upgrade following dockerfiles to python 3.9: - Aiffairness - Aixexplainer - Alibiexplainer - Artexplainer - custom model - custom transformer - lgb - paddle - pmml - sklearn - xgb - storage-initializer Signed-off-by: Andrews Arokiam <andrews.arokiam@ideas2it.com> * Updatedpython dependencies for explainers. Explainer image builds were failing after updating python to newer version. Updated the dependencies that were compatible with newer base image. Signed-off-by: Andrews Arokiam <andrews.arokiam@ideas2it.com> * Optimize docker image builds (kubeflow#2319) * Fix custom model dockerfile * Fix python dependency installation failure. Signed-off-by: Andrews Arokiam <andrews.arokiam@ideas2it.com> * Optimize dockerfiles * Optimize all dockerfiles for layer caching. Signed-off-by: Andrews Arokiam <andrews.arokiam@ideas2it.com> * Fixed issue with a missing library in paddle Paddle docker image has missing libs after base image and version changes Signed-off-by: Andrews Arokiam <andrews.arokiam@ideas2it.com> * Fixed failing e2e tests Added a library explictly in setup.py reverted python base image for ai explainer Signed-off-by: Andrews Arokiam <andrews.arokiam@ideas2it.com>

numerology added 5 commits October 1, 2019 17:53

init.

a0aad12

Patched Ajay's sample

54213af

Clean up the sample and add preload config.

5ca2ab2

Fix default value

d72dcb4

Remove old file

552428e

k8s-ci-robot added the do-not-merge/work-in-progress label Oct 7, 2019

k8s-ci-robot requested review from hongye-sun and neuromage October 7, 2019 17:03

k8s-ci-robot added the size/L label Oct 7, 2019

numerology added 2 commits October 7, 2019 10:26

Add compiled tfx sample

0be7fbf

Add compiled pipeline and move tfx sample to contrib to prevent depen…

73f34c6

…dency issue.

numerology assigned neuromage and gaoning777 Oct 7, 2019

numerology changed the title ~~[WIP] [Sample] Add new TFX::OSS sample~~ [Sample] Add new TFX::OSS sample Oct 7, 2019

k8s-ci-robot removed the do-not-merge/work-in-progress label Oct 7, 2019

numerology added 2 commits October 7, 2019 17:29

Add readme and remove redundant params

1788643

Add inline comments.

b98b739

numerology mentioned this pull request Oct 8, 2019

Add a simple TFX Taxi sample. #2312

Closed

Add description

04048d1

numerology mentioned this pull request Oct 8, 2019

[MKP/doc] Guideline needed for post-deployment setting about service account #2330

Closed

numerology added 4 commits October 9, 2019 11:46

Add sample test

883ec88

Merge branch 'master' of https://github.com/kubeflow/pipelines into a…

922b85a

…dd-hacky-tfx-preload-sample # Conflicts: # test/sample_test.yaml

fix test name.

815dbd8

fix test dir

10bf241

numerology added 2 commits October 9, 2019 14:56

fix data path.

82a143e

Fix pipeline_root

1302e49

k8s-ci-robot added the lgtm label Oct 10, 2019

k8s-ci-robot added the approved label Oct 10, 2019

k8s-ci-robot merged commit 361fbee into kubeflow:master Oct 10, 2019

numerology deleted the add-hacky-tfx-preload-sample branch October 10, 2019 00:10

gaoning777 mentioned this pull request Oct 10, 2019

Clean duplicate/outdated TFX components in favor of the TFX OSS components #1903

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Sample] Add new TFX::OSS sample #2319

[Sample] Add new TFX::OSS sample #2319

numerology commented Oct 7, 2019 •

edited

Loading

numerology commented Oct 7, 2019

neuromage commented Oct 7, 2019

gaoning777 commented Oct 8, 2019 •

edited

Loading

numerology commented Oct 8, 2019

gaoning777 commented Oct 9, 2019 •

edited

Loading

numerology commented Oct 9, 2019

numerology commented Oct 9, 2019

neuromage commented Oct 10, 2019

k8s-ci-robot commented Oct 10, 2019

gaoning777 commented Oct 10, 2019

animeshsingh commented Oct 19, 2019

numerology commented Oct 21, 2019

[Sample] Add new TFX::OSS sample #2319

[Sample] Add new TFX::OSS sample #2319

Conversation

numerology commented Oct 7, 2019 • edited Loading

numerology commented Oct 7, 2019

neuromage commented Oct 7, 2019

gaoning777 commented Oct 8, 2019 • edited Loading

numerology commented Oct 8, 2019

gaoning777 commented Oct 9, 2019 • edited Loading

numerology commented Oct 9, 2019

numerology commented Oct 9, 2019

neuromage commented Oct 10, 2019

k8s-ci-robot commented Oct 10, 2019

gaoning777 commented Oct 10, 2019

animeshsingh commented Oct 19, 2019

numerology commented Oct 21, 2019

numerology commented Oct 7, 2019 •

edited

Loading

gaoning777 commented Oct 8, 2019 •

edited

Loading

gaoning777 commented Oct 9, 2019 •

edited

Loading