-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update README.md #8
base: main
Are you sure you want to change the base?
Conversation
This patch adds a script for running compute-benchmarks, https://github.com/intel/compute-benchmarks/, and a corresponding GH Actions workflow that runs those benchmarks when prompted to do so with a comment, like so: /benchmarks-level-zero --env UR_L0_IMMEDIATE_COMMANDLISTS_BATCH_EVENT_COMPLETIONS=1 Additional arguments can be appended to the end of the line. After the build if finished, the results will be presented through a comment. For now, this runs only a single scenario, api_overhead_benchmark_sycl with SubmitKernel test, but will expand over time to cover more.
…ation functions" This reverts commit bbb04b6.
/benchmarks-level-zero --env UR_L0_IMMEDIATE_COMMANDLISTS_BATCH_EVENT_COMPLETIONS=1 --save baseline |
Compute Benchmarks L0 run: |
Compute Benchmarks L0 run: Benchmark ResultsChartxychart-beta ComparisonComparison data not found. No comparison performed. DetailsBatched In OrderClick to expandEnvironment Variables:UR_L0_USE_IMMEDIATE_COMMANDLISTS=0 Command:/home/pmdk/actions-runner/_work/unified-runtime/unified-runtime/compute-benchmarks-build/bin//api_overhead_benchmark_l0 --test=SubmitKernel --Ioq=1 --DiscardEvents=0 --MeasureCompletion=0 --iterations=10000 --Profiling=0 --NumKernels=10 --KernelExecTime=1 --csv --noHeaders Output:TestCase,Mean,Median,StdDev,Min,Max,Type Batched Out Of OrderClick to expandEnvironment Variables:UR_L0_USE_IMMEDIATE_COMMANDLISTS=0 Command:/home/pmdk/actions-runner/_work/unified-runtime/unified-runtime/compute-benchmarks-build/bin//api_overhead_benchmark_l0 --test=SubmitKernel --Ioq=0 --DiscardEvents=0 --MeasureCompletion=0 --iterations=10000 --Profiling=0 --NumKernels=10 --KernelExecTime=1 --csv --noHeaders Output:TestCase,Mean,Median,StdDev,Min,Max,Type Immediate In OrderClick to expandEnvironment Variables:UR_L0_USE_IMMEDIATE_COMMANDLISTS=1 Command:/home/pmdk/actions-runner/_work/unified-runtime/unified-runtime/compute-benchmarks-build/bin//api_overhead_benchmark_l0 --test=SubmitKernel --Ioq=1 --DiscardEvents=0 --MeasureCompletion=0 --iterations=10000 --Profiling=0 --NumKernels=10 --KernelExecTime=1 --csv --noHeaders Output:TestCase,Mean,Median,StdDev,Min,Max,Type Immediate Out Of OrderClick to expandEnvironment Variables:UR_L0_USE_IMMEDIATE_COMMANDLISTS=1 Command:/home/pmdk/actions-runner/_work/unified-runtime/unified-runtime/compute-benchmarks-build/bin//api_overhead_benchmark_l0 --test=SubmitKernel --Ioq=0 --DiscardEvents=0 --MeasureCompletion=0 --iterations=10000 --Profiling=0 --NumKernels=10 --KernelExecTime=1 --csv --noHeaders Output:TestCase,Mean,Median,StdDev,Min,Max,Type |
/benchmarks-level-zero --env UR_L0_IMMEDIATE_COMMANDLISTS_BATCH_EVENT_COMPLETIONS=1 --compare baseline |
Compute Benchmarks L0 run: |
Compute Benchmarks L0 run: Benchmark ResultsChartxychart-beta
title "api_overhead_benchmark_sycl (lower is better)"
x-axis ["Batched In Order", "Batched Out Of Order", "Immediate In Order", "Immediate Out Of Order"]
y-axis "mean execution time per 10 kernels (in μs)" 0 --> 100.0
bar [49.456, 33.692, 27.009, 26.39]
line [52.319, 34.515, 53.157, 34.171]
ComparisonComparison with previous data:
DetailsBatched In OrderClick to expandEnvironment Variables:UR_L0_USE_IMMEDIATE_COMMANDLISTS=0 Command:/home/pmdk/actions-runner/_work/unified-runtime/unified-runtime/compute-benchmarks-build/bin//api_overhead_benchmark_l0 --test=SubmitKernel --Ioq=1 --DiscardEvents=0 --MeasureCompletion=0 --iterations=10000 --Profiling=0 --NumKernels=10 --KernelExecTime=1 --csv --noHeaders Output:TestCase,Mean,Median,StdDev,Min,Max,Type Batched Out Of OrderClick to expandEnvironment Variables:UR_L0_USE_IMMEDIATE_COMMANDLISTS=0 Command:/home/pmdk/actions-runner/_work/unified-runtime/unified-runtime/compute-benchmarks-build/bin//api_overhead_benchmark_l0 --test=SubmitKernel --Ioq=0 --DiscardEvents=0 --MeasureCompletion=0 --iterations=10000 --Profiling=0 --NumKernels=10 --KernelExecTime=1 --csv --noHeaders Output:TestCase,Mean,Median,StdDev,Min,Max,Type Immediate In OrderClick to expandEnvironment Variables:UR_L0_USE_IMMEDIATE_COMMANDLISTS=1 Command:/home/pmdk/actions-runner/_work/unified-runtime/unified-runtime/compute-benchmarks-build/bin//api_overhead_benchmark_l0 --test=SubmitKernel --Ioq=1 --DiscardEvents=0 --MeasureCompletion=0 --iterations=10000 --Profiling=0 --NumKernels=10 --KernelExecTime=1 --csv --noHeaders Output:TestCase,Mean,Median,StdDev,Min,Max,Type Immediate Out Of OrderClick to expandEnvironment Variables:UR_L0_USE_IMMEDIATE_COMMANDLISTS=1 Command:/home/pmdk/actions-runner/_work/unified-runtime/unified-runtime/compute-benchmarks-build/bin//api_overhead_benchmark_l0 --test=SubmitKernel --Ioq=0 --DiscardEvents=0 --MeasureCompletion=0 --iterations=10000 --Profiling=0 --NumKernels=10 --KernelExecTime=1 --csv --noHeaders Output:TestCase,Mean,Median,StdDev,Min,Max,Type |
/benchmarks-level-zero --save baseline |
Compute Benchmarks L0 run: |
Compute Benchmarks L0 run: |
/benchmarks-level-zero --save baseline |
Compute Benchmarks L0 run: |
Compute Benchmarks L0 run: Benchmark ResultsChartxychart-beta
title "api_overhead_benchmark_sycl (lower is better)"
x-axis ["Batched In Order", "Batched Out Of Order", "Immediate In Order", "Immediate Out Of Order"]
y-axis "mean execution time per 10 kernels (in μs)" 0 --> 100.0
bar [26.188, 44.666, 30.889, 48.461]
line [26.188, 44.666, 30.889, 48.461]
ComparisonComparison with previous data:
DetailsBatched In OrderClick to expandEnvironment Variables:UR_L0_USE_IMMEDIATE_COMMANDLISTS=0 Command:/home/pmdk/actions-runner/_work/unified-runtime/unified-runtime/compute-benchmarks-build/bin//api_overhead_benchmark_sycl --test=SubmitKernel --Ioq=1 --DiscardEvents=0 --MeasureCompletion=0 --iterations=10000 --Profiling=0 --NumKernels=10 --KernelExecTime=1 --csv --noHeaders Output:TestCase,Mean,Median,StdDev,Min,Max,Type Batched Out Of OrderClick to expandEnvironment Variables:UR_L0_USE_IMMEDIATE_COMMANDLISTS=0 Command:/home/pmdk/actions-runner/_work/unified-runtime/unified-runtime/compute-benchmarks-build/bin//api_overhead_benchmark_sycl --test=SubmitKernel --Ioq=0 --DiscardEvents=0 --MeasureCompletion=0 --iterations=10000 --Profiling=0 --NumKernels=10 --KernelExecTime=1 --csv --noHeaders Output:TestCase,Mean,Median,StdDev,Min,Max,Type Immediate In OrderClick to expandEnvironment Variables:UR_L0_USE_IMMEDIATE_COMMANDLISTS=1 Command:/home/pmdk/actions-runner/_work/unified-runtime/unified-runtime/compute-benchmarks-build/bin//api_overhead_benchmark_sycl --test=SubmitKernel --Ioq=1 --DiscardEvents=0 --MeasureCompletion=0 --iterations=10000 --Profiling=0 --NumKernels=10 --KernelExecTime=1 --csv --noHeaders Output:TestCase,Mean,Median,StdDev,Min,Max,Type Immediate Out Of OrderClick to expandEnvironment Variables:UR_L0_USE_IMMEDIATE_COMMANDLISTS=1 Command:/home/pmdk/actions-runner/_work/unified-runtime/unified-runtime/compute-benchmarks-build/bin//api_overhead_benchmark_sycl --test=SubmitKernel --Ioq=0 --DiscardEvents=0 --MeasureCompletion=0 --iterations=10000 --Profiling=0 --NumKernels=10 --KernelExecTime=1 --csv --noHeaders Output:TestCase,Mean,Median,StdDev,Min,Max,Type |
/benchmarks-level-zero --env UR_L0_IMMEDIATE_COMMANDLISTS_BATCH_EVENT_COMPLETIONS=1 |
Compute Benchmarks L0 run: |
Compute Benchmarks L0 run: Benchmark ResultsChartxychart-beta
title "api_overhead_benchmark_sycl (lower is better)"
x-axis ["Batched In Order", "Batched Out Of Order", "Immediate In Order", "Immediate Out Of Order"]
y-axis "mean execution time per 10 kernels (in μs)" 0 --> 100.0
bar [46.693, 44.905, 53.893, 49.958]
line [26.188, 44.666, 30.889, 48.461]
ComparisonComparison with previous data:
DetailsBatched In OrderClick to expandEnvironment Variables:UR_L0_USE_IMMEDIATE_COMMANDLISTS=0 Command:/home/pmdk/actions-runner/_work/unified-runtime/unified-runtime/compute-benchmarks-build/bin//api_overhead_benchmark_sycl --test=SubmitKernel --Ioq=1 --DiscardEvents=0 --MeasureCompletion=0 --iterations=10000 --Profiling=0 --NumKernels=10 --KernelExecTime=1 --csv --noHeaders Output:TestCase,Mean,Median,StdDev,Min,Max,Type Batched Out Of OrderClick to expandEnvironment Variables:UR_L0_USE_IMMEDIATE_COMMANDLISTS=0 Command:/home/pmdk/actions-runner/_work/unified-runtime/unified-runtime/compute-benchmarks-build/bin//api_overhead_benchmark_sycl --test=SubmitKernel --Ioq=0 --DiscardEvents=0 --MeasureCompletion=0 --iterations=10000 --Profiling=0 --NumKernels=10 --KernelExecTime=1 --csv --noHeaders Output:TestCase,Mean,Median,StdDev,Min,Max,Type Immediate In OrderClick to expandEnvironment Variables:UR_L0_USE_IMMEDIATE_COMMANDLISTS=1 Command:/home/pmdk/actions-runner/_work/unified-runtime/unified-runtime/compute-benchmarks-build/bin//api_overhead_benchmark_sycl --test=SubmitKernel --Ioq=1 --DiscardEvents=0 --MeasureCompletion=0 --iterations=10000 --Profiling=0 --NumKernels=10 --KernelExecTime=1 --csv --noHeaders Output:TestCase,Mean,Median,StdDev,Min,Max,Type Immediate Out Of OrderClick to expandEnvironment Variables:UR_L0_USE_IMMEDIATE_COMMANDLISTS=1 Command:/home/pmdk/actions-runner/_work/unified-runtime/unified-runtime/compute-benchmarks-build/bin//api_overhead_benchmark_sycl --test=SubmitKernel --Ioq=0 --DiscardEvents=0 --MeasureCompletion=0 --iterations=10000 --Profiling=0 --NumKernels=10 --KernelExecTime=1 --csv --noHeaders Output:TestCase,Mean,Median,StdDev,Min,Max,Type |
efa690a
to
138f7f9
Compare
9eeeead
to
62234f1
Compare
6ca52a5
to
844c209
Compare
No description provided.