In-graph synchronization with Epochs #86

fknorr · 2022-02-07T13:54:06Z

So far, queue::slow_full_sync() and the shutdown of worker executors are implemented by sending SYNC and SHUTDOWN commands through broadcast_control_command() and then busy-waiting on the main thread. Since we already introduced synchronization logic to task and command graphs before as part of horizons, we now have the opportunity to integrate SYNC and SHUTDOWN into the graphs as well and reduce specialized code paths. This will also be a stepping stone towards using explicit synchronization to exchange data in celerity buffers with the main thread.

This PR represents synchronization points in the graph with epochs, which themselves are a generalization of INIT/NOP tasks and commands. Each graph node (except the first epoch) has exactly one preceding epoch, and no node can ever depend on a node before its epoch. In that sense they are similar to horizons, except that they fully serialize execution and dependency tracking:

To ensure correct temporal ordering, a new epoch node receives a true-dependency on the entire execution front (orange edges); and all nodes without other true-dependencies (pure producers, tasks a b d e) receive a dependency on their epoch (pink edges). A new epoch will immediately become the last-writer and host-buffer initializer for all successor tasks. Like any other node, epochs can be subsumed by horizons, after which the subsuming horizon becomes the epoch for all following nodes.

Inserting a new epoch task with task_manager::finish_epoch() will create one epoch command per node, with dependencies on the current execution front. Each epoch_command will issue one epoch_job, which awaits the completion of said dependencies, optionally issues an MPI_Barrier and then notifies the task manager via notify_epoch_completed(). Other threads can await this notification to block the main thread in queue::slow_full_sync() and ~runtime().

The nodes preceding a task epoch are pruned once the first task is created after notify_epoch_completed(). In the command graph generator, nodes can be deleted as soon as they lie behind the last epoch, as they can't be depended on by any successor task. In that regard, epochs act like immediately-applied horizons.

This PR also recognizes that the task manager can prune all nodes preceding a horizon A as soon as it has been notified that horizon B with A → B has been reached. This notification signifies that all commands of tasks preceding A have finished executing, so even though they might still be referenced from already-executed commands in the command graph, the task data never has to be accessed again.

To aid debugging, Celerity will now also track the origin of a dependency in addition to its kind. This is currently visualized in generated dot graphs.

The old synchronization infrastructure (broadcast_control_command(), sync_id, ...) has been removed.

fknorr · 2022-02-22T13:22:02Z

From live discussion with @PeterTh: We need to re-visit an earlier concern about race conditions on tasks returned form task_manager::get_task(): The assumption was that this method can return a const task* that can be accessed without a lock since tasks don't change after creation. However, their dependencies are modified from ~intrusive_graph_node(), so that is not actually true. This is relevant for the PR in question since reducing the pruning delay from 2 horizons to 1 might cause conflicting accesses on the current epoch task.

include/command.h

include/task.h

include/task_manager.h

src/graph_generator.cc

src/graph_serializer.cc

src/runtime.cc

src/task_manager.cc

test/graph_compaction_tests.cc

include/log.h

psalz

I'm only about a third through the changes so far; I've added some minor notes. However I did a quick test and it seems like there is a ~10% performance drop when running wave_sim -N 4096 -T 200 with 4 workers on gpuc1: Current master takes 522ms on average (over 5 runs), where this PR takes 585ms on average.

include/command.h

include/command_graph.h

include/graph_generator.h

include/log.h

src/executor.cc

include/command.h

src/runtime.cc

src/graph_serializer.cc

fknorr · 2022-03-15T12:22:23Z

I have collected longer-running benchmark results on the wave_sim behavior, since previous graph-generation timings turned out to be spurious. It appears that the runtime is usually pretty stable but produces rare and pronounced outliers. This happens on both master and this PR, so I believe we cannot attribute their existence to the changes in this PR.

The following numbers were collected by running wave_sim on 4 nodes, repeated 1000 times, while measuring the run time surrounding all kernel submissions after slow_full_sync():

The outliers skew the mean, but by eliminating them (or using the median), we see that the "normal" run-time behavior has not changed noticeably.

ORDER_DEP was handled like TRUE_DEP everywhere, and it does not add any value except for context information in printed graphs. This adds a dependency_origin enum to distinguish data flow dependencies from epoch/horizon/collective temporal dependencies.

…d behavior

… members in task_manager and graph_generator

…not need to specialize the first task

psalz

Thanks again, I've added a few more notes.

The only thing that I still find somewhat confusing is that epochs are both an abstract concept and a concrete type of task/command; in that horizons can also act as epochs, and there are some fields that are named *epoch*, but can also point to a horizon task/command. Can you outline to me again briefly why you decided against treating horizons as a type of epoch?

test/test_utils.h

include/command.h

include/graph_generator.h

src/graph_generator.cc

src/task_manager.cc

test/graph_generation_tests.cc

src/graph_generator.cc

BlackMark29A

Looks good to me. The only thing I found was a typo that wasn't even introduced by this PR.

test/graph_compaction_tests.cc

fknorr · 2022-04-27T15:48:59Z

Thanks to all of you for the continued endurance on this!

fknorr requested review from psalz and PeterTh February 7, 2022 13:54

fknorr force-pushed the epochs branch 4 times, most recently from 583104d to 3a53747 Compare February 21, 2022 14:57

fknorr mentioned this pull request Feb 21, 2022

Capture buffer and host-object data on synchronization points #94

Closed

fknorr force-pushed the epochs branch 2 times, most recently from 59307c8 to 24f4c12 Compare February 22, 2022 10:06

PeterTh assigned fknorr Feb 28, 2022

PeterTh reviewed Mar 1, 2022

View reviewed changes

fknorr mentioned this pull request Mar 2, 2022

Introduce less restrictive side effect orders #101

Closed

fknorr force-pushed the epochs branch from 928f0af to 98d4692 Compare March 2, 2022 10:11

BlackMark29A reviewed Mar 3, 2022

View reviewed changes

include/log.h Show resolved Hide resolved

include/log.h Outdated Show resolved Hide resolved

psalz reviewed Mar 4, 2022

View reviewed changes

psalz reviewed Mar 7, 2022

View reviewed changes

include/command.h Outdated Show resolved Hide resolved

src/runtime.cc Outdated Show resolved Hide resolved

src/graph_serializer.cc Outdated Show resolved Hide resolved

src/graph_serializer.cc Outdated Show resolved Hide resolved

fknorr force-pushed the epochs branch 12 times, most recently from 36454f2 to c19e7f9 Compare March 14, 2022 14:03

fknorr and others added 17 commits March 30, 2022 20:01

Insert new epochs into the task graph to serialize execution

caf4986

Prune task and command graph on epoch completion

b8186b6

Implement shutdown/sync on top of epochs

9ec6f71

Clean up and document epochs implementation

70e7f4b

Only delay task deletion by 1 horizon instead of 2

9afbc9e

Test: Multiple epochs without intermediate tasks/commands have define…

44c0625

…d behavior

Remove unused control-command related declarations from runtime

857a8b8

Tighten invariants on command_pkg

f3bdcb2

Adress reviewer comments

d949746

Rename task manager submission / notification methods

364d0c1

Clarify the refactored nop_command special-casing in graph_serializer

ab8e195

Refactor graph_generator::build_task, improve naming of epoch/horizon…

ba6afd4

… members in task_manager and graph_generator

Document "dangling nodes after horizon pruning" CDAG phenomenon

7321e4a

Generate true dependencies from AWAIT PUSH commands to the epoch

1dfc3f1

With epoch-dependencies on the init epoch, graph_compaction_tests do …

08e7e6f

…not need to specialize the first task

Include shutdown epoch generation in graph benchmarks

2248f81

fknorr force-pushed the epochs branch from 444af63 to 2248f81 Compare March 30, 2022 18:01

Remove unused scheduler::is_idle()

1cde23e

fknorr mentioned this pull request Mar 31, 2022

Improve scheduler performance by reducing lock contention #111

Merged

psalz requested changes Apr 4, 2022

View reviewed changes

fknorr mentioned this pull request Apr 22, 2022

Replace single-use MPI datatypes with binary serialization #114

Merged

psalz approved these changes Apr 25, 2022

View reviewed changes

BlackMark29A approved these changes Apr 27, 2022

View reviewed changes

test/graph_compaction_tests.cc Outdated Show resolved Hide resolved

Address reviewer comments on epochs

df2c5ba

fknorr force-pushed the epochs branch from 2eb6c5d to df2c5ba Compare April 27, 2022 15:11

fknorr merged commit 61dd07e into master Apr 27, 2022

fknorr deleted the epochs branch April 27, 2022 15:49

psalz mentioned this pull request May 12, 2022

Update benchmark results for epochs #117

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

In-graph synchronization with Epochs #86

In-graph synchronization with Epochs #86

fknorr commented Feb 7, 2022 •

edited

Loading

fknorr commented Feb 22, 2022

psalz left a comment

fknorr commented Mar 15, 2022

psalz left a comment

BlackMark29A left a comment

fknorr commented Apr 27, 2022

In-graph synchronization with Epochs #86

In-graph synchronization with Epochs #86

Conversation

fknorr commented Feb 7, 2022 • edited Loading

fknorr commented Feb 22, 2022

psalz left a comment

Choose a reason for hiding this comment

fknorr commented Mar 15, 2022

psalz left a comment

Choose a reason for hiding this comment

BlackMark29A left a comment

Choose a reason for hiding this comment

fknorr commented Apr 27, 2022

fknorr commented Feb 7, 2022 •

edited

Loading