feat: implement workflow evaluation. #292

peterhuene · 2025-01-14T16:54:34Z

This PR implements workflow evaluation in wdl-engine and in the wdl run command.

Workflow evaluation works by taking the workflow evaluation graph and "splitting" the graph into subgraphs at each entry into a conditional and scatter statement.

For conditional statements, their subgraphs are only evaluated if the conditional expression evaluates to true.

For scatter statements, their subgraphs are evaluated for each element in the scatter array.

Workflow evaluation is currently single-threaded. Progress is made on the evaluation graph as much as possible until no more progress can be made, at which point evaluation will wait for outstanding tasks to complete.

Currently local task execution is limited to the amount of available parallelism supported on the host system; in the future, we'll want that to be configurable.

Before submitting this PR, please make sure:

You have added a few sentences describing the PR here.
You have added yourself or the appropriate individual as the assignee.
You have added at least one relevant code reviewer to the PR.
Your code builds clean without any errors or warnings.
You have added tests (when appropriate).
You have updated the README or other documentation to account for these
changes (when appropriate).
You have added an entry to the relevant CHANGELOG.md (see
"keep a changelog" for more information).
Your commit messages follow the conventional commit style.

This commit implements workflow evaluation in `wdl-engine` and in the `wdl run` command. Workflow evaluation works by taking the workflow evaluation graph and "splitting" the graph into subgraphs at each entry into a conditional and scatter statement. For conditional statements, their subgraphs are only evaluated if the conditional expression evaluates to true. For scatter statements, their subgraphs are evaluated for each element in the scatter array. Workflow evaluation is currently single-threaded. Progress is made on the evaluation graph as much as possible until no more progress can be made, at which point evaluation will wait for outstanding tasks to complete. Currently local task execution is limited to the amount of available parallelism supported on the host system; in the future, we'll want that to be configurable.

wdl-engine/src/backend/local.rs

wdl-engine/src/eval/v1/task.rs

peterhuene · 2025-01-14T19:09:51Z

The test error is:

error: call to function `read_int` failed: file `calls/count_lines/stdout` does not contain an integer value on a single line
   ┌─ tests/workflows/task-outputs/source.wdl:27:22
   │
27 │     Int line_count = read_int(stdout())
   │                      ^^^^^^^^

Looking into it (probably yet another line ending difference issue).

peterhuene · 2025-01-14T19:17:17Z

Not a bug; the WDL test source isn't quoting the path in the task's command, causing the backslashes in the path on Windows to be interpreted as escape sequences.

The task was failing in a piped command (also wasn't setting pipefail), which is why the task succeeded, but the read_int call failed since the output file was empty.

I'll push up a fix momentarily.

The fix is to properly quote the file that is being line counted in the task so that the backslashes in the path aren't misinterpreted.

wdl-engine/tests/workflows/test-input-keyword/source.wdl

claymcleod

💯

wdl-analysis/tests/analysis/call-unknown-input/source.diagnostics

wdl-engine/src/eval/v1/task.rs

Co-authored-by: Clay McLeod <3411613+claymcleod@users.noreply.github.com>

Co-authored-by: Andrew Frantz <andrew.frantz@stjude.org>

wdl-engine/tests/workflows/array-map-equality/source.wdl

wdl-engine/tests/workflows/call-imported-task/source.wdl

wdl-engine/tests/workflows/nested-scatter/source.wdl

a-frantz · 2025-01-15T20:46:32Z

wdl-engine/tests/workflows/placeholder-none/source.wdl

+    # The expression in this string results in an error (calling `select_first` on an array 
+    # containing no non-`None` values) and so the placeholder evaluates to the empty string and 
+    # `s` evalutes to: "Foo is "
+    String s = "Foo is ~{select_first([foo])}"


I'd expect this to fail at runtime, but because the failure is nested inside a placeholder it instead just creates an empty string? That's an interesting side effect that could lead to hard to find bugs.

That's right, if a failure occurs due to an evaluation of a None, the placeholder doesn't fail but evaluates to empty.

If an expression within a placeholder evaluates to None, and either causes the entire placeholder to evaluate to None or causes an error, then the placeholder is replaced by the empty string.

I was aware of the of the first part but not the error clause. That seems problematic to me.

peterhuene requested review from adthrasher, claymcleod and a-frantz January 14, 2025 16:54

peterhuene self-assigned this Jan 14, 2025

claymcleod reviewed Jan 14, 2025

View reviewed changes

wdl-engine/src/backend/local.rs Outdated Show resolved Hide resolved

claymcleod reviewed Jan 14, 2025

View reviewed changes

wdl-engine/src/eval/v1/task.rs Outdated Show resolved Hide resolved

fix: fix the task-outputs workflow test on Windows.

81e186c

The fix is to properly quote the file that is being line counted in the task so that the backslashes in the path aren't misinterpreted.

claymcleod reviewed Jan 14, 2025

View reviewed changes

wdl-engine/tests/workflows/test-input-keyword/source.wdl Outdated Show resolved Hide resolved

claymcleod approved these changes Jan 14, 2025

View reviewed changes

a-frantz reviewed Jan 14, 2025

View reviewed changes

wdl-analysis/tests/analysis/call-unknown-input/source.diagnostics Show resolved Hide resolved

wdl-engine/src/eval/v1/task.rs Outdated Show resolved Hide resolved

peterhuene and others added 4 commits January 14, 2025 14:41

Update wdl-engine/tests/workflows/test-input-keyword/source.wdl

b4bb5ce

Co-authored-by: Clay McLeod <3411613+claymcleod@users.noreply.github.com>

Update wdl-engine/src/eval/v1/task.rs

5a6b56b

Co-authored-by: Andrew Frantz <andrew.frantz@stjude.org>

chore: code review feedback.

d589912

chore: update CHANGELOGs.

e9142ac

a-frantz reviewed Jan 15, 2025

View reviewed changes

a-frantz approved these changes Jan 15, 2025

View reviewed changes

peterhuene merged commit 120ecc0 into stjude-rust-labs:main Jan 15, 2025
16 checks passed

peterhuene deleted the workflow-eval branch January 15, 2025 21:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: implement workflow evaluation. #292

feat: implement workflow evaluation. #292

peterhuene commented Jan 14, 2025 •

edited

Loading

peterhuene commented Jan 14, 2025

peterhuene commented Jan 14, 2025

claymcleod left a comment

a-frantz Jan 15, 2025

peterhuene Jan 15, 2025

a-frantz Jan 15, 2025

feat: implement workflow evaluation. #292

feat: implement workflow evaluation. #292

Conversation

peterhuene commented Jan 14, 2025 • edited Loading

peterhuene commented Jan 14, 2025

peterhuene commented Jan 14, 2025

claymcleod left a comment

Choose a reason for hiding this comment

a-frantz Jan 15, 2025

Choose a reason for hiding this comment

peterhuene Jan 15, 2025

Choose a reason for hiding this comment

a-frantz Jan 15, 2025

Choose a reason for hiding this comment

peterhuene commented Jan 14, 2025 •

edited

Loading