Azureml train pipeline #186

annazietlow · 2020-02-07T17:05:57Z

[Still requires documentation about running with dutchf3 data specifically.]

This PR adds configurable Azure ML training pipelines to Seismic DeepLearning. Each pipeline is configured via the pipeline json and environment variables. See the README added in this PR (azureml_pipelines folder) for more information.

A few arguments had to be added to train.py: input and output. These allow the pipeline to pass in mounted Blob storage locations to be used as a regular file system.
azureml_requirements.txt file was added to the training directory. This file holds all dependencies for train.py so they can be installed on the compute in Azure ML.
An empty init.py was added to the segmentation/dutchf3 module. I was unable to install cv_lib and access cv_lib.segmentation.dutchf3.utils without this when installing from github (note: to get this version to work, I am referencing this feature branch in the requirements. It should be changed to microsoft:staging before merging)
There are two scripts added: kickoff_train_pipeline.py & cancel_run.py. Kickoff_train shows how to run an Azure ML train pipeline & cancel_run is a development script for cancelling the run instead of letting it take resources to completion.
The train_pipeline inherits from base_pipeline which would be a helpful abstraction for future addition of an inference pipeline
Integration tests for kicking off a train pipeline are added. These are not currently run in a build pipeline because they require communication with an Azure ML Service instance
Requirements for kicking off the pipeline are added to interpretation's requirements.txt

To Test:

Prepare dutchf3 data and upload to blob storage
Create an instance of Azure ML service
Follow instructions for setting up the environment variables and kicking off a pipeline in the pipelines README
Verify that training runs successfully (currently taking ~2 hours for 1 epoch)

…ine config

experiments/interpretation/dutchf3_patch/local/train.py

olgaliak · 2020-02-08T01:15:46Z

interpretation/deepseismic_interpretation/azureml_pipelines/README.md

+        "input_datareference_name": "normalized_data_conditioned",
+        "input_dataset_name": "normalizeddataconditioned",
+        "source_directory": "train/",
+        "arguments": ["--splits", "splits",


should the example mention how the input\output params could be be passed in?

It adds more detail to the params around line 111. Do you mean more than that?

…seismic-deeplearning into azureml-train-pipelin

maxkazmsft · 2020-02-10T17:35:01Z

experiments/interpretation/dutchf3_patch/local/azureml_requirements.txt

@@ -0,0 +1,13 @@
+git+https://github.com/olgaliak/seismic-deeplearning.git@azureml-train-pipeline#egg=cv_lib&subdirectory=cv_lib


please pull from /microsoft. Also is there a way to specify master branch here? We're on staging by default which might break one of these days

Unfortunately doing that would require the init.py file that I added so this wouldn't work until that file is added to /microsoft and /master (there is a way to specify the master branch). I have a note about changing it to microsoft/staging before the merge in the PR comment. Just leaving for now so it can be tested

maxkazmsft · 2020-02-10T17:35:31Z

experiments/interpretation/dutchf3_patch/local/azureml_requirements.txt

@@ -0,0 +1,13 @@
+git+https://github.com/olgaliak/seismic-deeplearning.git@azureml-train-pipeline#egg=cv_lib&subdirectory=cv_lib
+git+https://github.com/microsoft/seismic-deeplearning.git#egg=deepseismic-interpretation&subdirectory=interpretation
+opencv-python==4.1.2.30


Is it possible to use Pillow for your work instead of openCV?

I think we are only using OpenCV for the boarder_contant feature to pad images. This is in the train.py that we are leveraging and in other areas of the code. Are you recommending to substitute this for similar functionality in Pillow?

maxkazmsft · 2020-02-10T17:37:54Z

interpretation/deepseismic_interpretation/azureml_pipelines/README.md

@@ -0,0 +1,175 @@
+# Integrating with AzureML


Could you please also add some verbiage in the main README and point to this file? AML team would love this!

maxkazmsft · 2020-02-10T17:39:24Z

interpretation/deepseismic_interpretation/azureml_pipelines/README.md

+AML_COMPUTE_CLUSTER_MIN_NODES
+AML_COMPUTE_CLUSTER_MAX_NODES
+AML_COMPUTE_CLUSTER_SKU
+```


Please use https://pypi.org/project/python-dotenv/ in the code and then ask the user to create a .env file where these will reside. We don't want to set these in the notebook

This does use python-dotenv to grab the variables. This is just the readme instructing them to set those variables in any way they choose (.env file for vscode is mentioned)

@annazietlow The DeepSeismic team has requested the PR to go to a contrib branch so I've closed this one and we are continuing the conversations on that PR #195

maxkazmsft · 2020-02-10T17:41:04Z

interpretation/deepseismic_interpretation/azureml_pipelines/README.md

+AML_COMPUTE_CLUSTER_SKU
+```
+On Windows you can use:
+`set VARIABLE=value`


We don't support Windows :-p please feel free to get rid of this.

annazietlow added 9 commits February 5, 2020 15:22

initial add of azure ml pipeline

897c4bd

update references and dependencies

197b599

fix integration tests

b7e62fc

remove incomplete tests

cddb065

add azureml requirements.txt for dutchf3 local patch and update pipel…

71fb2b2

…ine config

add empty __init__.py to cv_lib dutchf3

e2d8edb

Get train,py to run in pipeline

36d0a89

allow output dir in train.py

2d118fc

Clean up README and __init__

4bfdf51

annazietlow requested review from maxkazmsft, MJZawacki, olgaliak and kirasoderstrom February 7, 2020 19:33

Merge branch 'staging' into azureml-train-pipeline

5d188c5

olgaliak suggested changes Feb 8, 2020

View reviewed changes

annazietlow and others added 4 commits February 10, 2020 06:00

only pass output if available and use input dir for output in train.py

6855b40

Merge branch 'azureml-train-pipeline' of https://github.com/olgaliak/…

705a86d

…seismic-deeplearning into azureml-train-pipelin

update comment in train.py

8e0c2d7

Merge branch 'staging' into azureml-train-pipeline

2a462c0

maxkazmsft suggested changes Feb 10, 2020

View reviewed changes

maxkazmsft assigned maxkazmsft and yalaudah Feb 11, 2020

kirasoderstrom closed this Feb 11, 2020

kirasoderstrom mentioned this pull request Feb 11, 2020

Azureml train pipeline #195

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Azureml train pipeline #186

Azureml train pipeline #186

annazietlow commented Feb 7, 2020 •

edited

Loading

olgaliak Feb 8, 2020

annazietlow Feb 10, 2020

maxkazmsft Feb 10, 2020

annazietlow Feb 11, 2020

maxkazmsft Feb 10, 2020

kirasoderstrom Feb 11, 2020

maxkazmsft Feb 10, 2020

maxkazmsft Feb 10, 2020

annazietlow Feb 11, 2020

kirasoderstrom Feb 11, 2020

maxkazmsft Feb 10, 2020

		@@ -0,0 +1,13 @@
		git+https://github.com/olgaliak/seismic-deeplearning.git@azureml-train-pipeline#egg=cv_lib&subdirectory=cv_lib

Azureml train pipeline #186

Azureml train pipeline #186

Conversation

annazietlow commented Feb 7, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

annazietlow commented Feb 7, 2020 •

edited

Loading