support train ML model in either sync or async way #124

ylwu-amzn · 2022-01-19T10:01:56Z

Signed-off-by: Yaliang Wu ylwu@amazon.com

Description

Support training ML model in either sync or async way.
For example, PPL user should be able to train ML model in sync way; and user can use async way if the model training will be time consuming.

Main changes:

Support async URL parameter in train API. Add ?async=true if need to train model in async way, by default will train model in sync way.
If train model in sync way, won't persist task in index.
For sync way, will return both model id and task id. For async way, will just return task id and user need to poll task to know its state/progress.
Use GetRequest to get model in predict task runner

Check List

New functionality includes testing.
- All tests pass
New functionality has been documented.
- New functionality has javadoc added
Commits are signed per the DCO using --signoff

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

Signed-off-by: Yaliang Wu <ylwu@amazon.com>

Zhangxunmt · 2022-01-20T19:55:55Z

plugin/src/main/java/org/opensearch/ml/model/MLTask.java

@@ -63,6 +65,7 @@
    @Setter
    private String error;
    private User user; // TODO: support document level access control later
+    private boolean async;

    @Builder
    public MLTask(


We can consider moving this class to the ml-common in the next PR.

Yes, will send out a separate refactoring PR

Zhangxunmt · 2022-01-20T20:11:54Z

plugin/src/main/java/org/opensearch/ml/task/MLTrainingTaskRunner.java

+        if (request.isAsync()) {
+            mlTaskManager.createMLTask(mlTask, ActionListener.wrap(r -> {
+                String taskId = r.getId();
+                mlTask.setTaskId(taskId);
+                if (mlTask.isAsync()) {


Is there any user case that the request is Sync, but the mlTask is Async? I am just wondering the difference between the two.

Async property of MLTask is consistent with request, check line 116 of this class.
For async task, we will cache task in memory and persist task in index and it will return task id directly.
For sync task, we just cache task in memory. Will train model and return the model id.

Zhangxunmt

Approved with a few questions in the comments.

ylwu-amzn requested a review from a team January 19, 2022 10:01

support train ML model in either sync or async way

7291511

Signed-off-by: Yaliang Wu <ylwu@amazon.com>

ylwu-amzn force-pushed the train_sync_way branch from 20b2dcc to 7291511 Compare January 19, 2022 11:31

ylwu-amzn requested review from Zhangxunmt and jackiehanyang January 19, 2022 21:55

Zhangxunmt reviewed Jan 20, 2022

View reviewed changes

Zhangxunmt approved these changes Jan 20, 2022

View reviewed changes

ylwu-amzn merged commit 1d5da1d into opensearch-project:main Jan 21, 2022

ylwu-amzn mentioned this pull request Mar 9, 2022

Async task framework #100

Closed

ylwu-amzn added enhancement New feature or request feature and removed enhancement New feature or request labels Mar 11, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support train ML model in either sync or async way #124

support train ML model in either sync or async way #124

ylwu-amzn commented Jan 19, 2022 •

edited

Loading

Zhangxunmt Jan 20, 2022

ylwu-amzn Jan 20, 2022

Zhangxunmt Jan 20, 2022

ylwu-amzn Jan 20, 2022

Zhangxunmt left a comment

support train ML model in either sync or async way #124

support train ML model in either sync or async way #124

Conversation

ylwu-amzn commented Jan 19, 2022 • edited Loading

Description

Check List

Zhangxunmt Jan 20, 2022

Choose a reason for hiding this comment

ylwu-amzn Jan 20, 2022

Choose a reason for hiding this comment

Zhangxunmt Jan 20, 2022

Choose a reason for hiding this comment

ylwu-amzn Jan 20, 2022

Choose a reason for hiding this comment

Zhangxunmt left a comment

Choose a reason for hiding this comment

ylwu-amzn commented Jan 19, 2022 •

edited

Loading