Benchmark TDAG/CDAG generation in isolation #107

fknorr · 2022-03-12T15:32:36Z

This PR is similar to #62 in that it benchmarks various graph topologies, however, it only measures graph generation and serialization as they happen on the master node without actually passing commands on to workers. It can therefore be evaluated on a single node and timed more precisely, but does not benchmark job generation or execution at all.

[task-graph] benchmarks measure TDAG generation.
[command-graph] benchmarks measure TDAG + CDAG generation and serialization in the same thread
[scheduler] benchmarks are like [command-graph], but use a dedicated scheduler thread similar to the runtime

I shamelessly copied the artificial graph topologies from @PeterTh 's #62 and added real-world graph examples from wave_sim as well as an (imaginary) Jacobi solver implementation.

Benchmarking the scheduler thread required some refactoring that allows re-using its std::thread in order to exclude the thread creation overhead from timings.

fknorr · 2022-03-16T10:10:06Z

I have simplified the implementation of benchmark tree graph generation and introduced more complex communication patterns. The correctness of the generated graphs can be verified through the [debug-graphs] test cases.

include/scheduler.h

psalz

Thanks, LGTM! Given the precedent set by #100 this should also include updated benchmark results for ci/perf.

test/benchmarks.cc

…benchmark graphs

fknorr requested review from psalz and PeterTh March 12, 2022 15:32

fknorr self-assigned this Mar 12, 2022

fknorr force-pushed the non-rt-graph-benchmarks branch from ec5b728 to 7b9f1cf Compare March 12, 2022 16:41

PeterTh mentioned this pull request Mar 14, 2022

Profile and optimize frontend (from CGF submission to serialized graph) #108

Closed

fknorr force-pushed the non-rt-graph-benchmarks branch from da20e27 to 5c77d70 Compare March 16, 2022 10:13

PeterTh reviewed Mar 16, 2022

View reviewed changes

include/scheduler.h Outdated Show resolved Hide resolved

fknorr added 3 commits March 16, 2022 21:02

Benchmark task and command graph generation

848ff84

Re-use scheduler thread to avoid benchmarking thread creation overhead

e5f108b

Add Jacobi solver graph topology to benchmarks

f7939d9

fknorr force-pushed the non-rt-graph-benchmarks branch from 9abd694 to e172340 Compare March 16, 2022 20:02

PeterTh approved these changes Mar 17, 2022

View reviewed changes

psalz approved these changes Mar 23, 2022

View reviewed changes

test/benchmarks.cc Outdated Show resolved Hide resolved

test/benchmarks.cc Outdated Show resolved Hide resolved

fknorr added 5 commits March 26, 2022 08:53

Benchmark scheduler thread against throttled task submission

bbee0b7

Verify correct generation of tree topologies, add communication, log …

5744e4d

…benchmark graphs

Move scheduler threading complexity from runtime to tests

56eff3f

Update benchmark results

a2c9305

DAG benchmark improve naming of tree topologies

5ad7bab

fknorr force-pushed the non-rt-graph-benchmarks branch from 2fd8290 to 5ad7bab Compare March 26, 2022 07:54

fknorr merged commit 51f5bc5 into master Mar 26, 2022

fknorr deleted the non-rt-graph-benchmarks branch July 19, 2022 09:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Benchmark TDAG/CDAG generation in isolation #107

Benchmark TDAG/CDAG generation in isolation #107

fknorr commented Mar 12, 2022

fknorr commented Mar 16, 2022

psalz left a comment

Benchmark TDAG/CDAG generation in isolation #107

Benchmark TDAG/CDAG generation in isolation #107

Conversation

fknorr commented Mar 12, 2022

fknorr commented Mar 16, 2022

psalz left a comment

Choose a reason for hiding this comment