Improve performance of allocation profiling #287

r1viollet · 2023-07-17T07:12:15Z

What does this PR do?

Adjust the pseudo random generator to be thread local
Change the generator to be smaller
Remove the lock from the allocation profiling path

Credit for this idea goes to @nsavoire

Motivation

Improve performance of allocation profiling. This really brings down the allocation profiling to numbers that are comparable.

Benchmark                                  Time             CPU   Iterations
----------------------------------------------------------------------------
BM_ThreadedAllocations_NoTracking     343085 ns       172913 ns         4525
BM_ThreadedAllocations_Tracking       483263 ns       244706 ns         2886

The numbers were 4x worse prior to this change.

After this change I will need to focus on the deallocation code path.

Additional Notes

This needs extensive testing.
Risks are:

Accuracy
Thread safety

How to test the change?

The benchmark only tests parts of this. We will need to be careful when deploying this change.

- Adjust the pseudo random generator to be thread local - Change the generator to be smaller - Remove the lock from the allocation profiling path

richardstartin · 2023-07-25T15:57:01Z

src/lib/allocation_tracker.cc

@@ -194,6 +190,10 @@ void AllocationTracker::track_allocation(uintptr_t addr, size_t size,
  free_on_consecutive_failures(success);

  if (success && _state.track_deallocations) {
+    // \fixme{r1viollet} adjust set to be lock free


this could probably be replaced with a rather simple sparse bitset over the least significant 48 bits of the pointers, which would allow CAS operations on pages of the bitset rather than a lock.

I like this, especially as the impact of collisions are easy to manage. I'll probably make it smaller than that.

richardstartin · 2023-07-25T15:58:22Z

test/allocation_tracker-bench.cc

@@ -39,7 +44,7 @@ void perform_memory_operations(bool track_allocations,
  std::mt19937 gen(rd());

  for (auto _ : state) {
-    state.PauseTiming();
+    //    state.PauseTiming();


why does this need to be commented out?

Initially I was measuring the cost of deallocation. In this PR I was optimizing the allocation code path.
I think the correct thing to do is have different benchmarks. I am continuing this work and I'll revisit this.

richardstartin

The change makes sense to me

Improve performance of allocation profiling

a852e6e

- Adjust the pseudo random generator to be thread local - Change the generator to be smaller - Remove the lock from the allocation profiling path

r1viollet marked this pull request as ready for review July 17, 2023 09:00

r1viollet requested review from sanchda and nsavoire as code owners July 17, 2023 09:00

r1viollet assigned nsavoire and sanchda Jul 20, 2023

richardstartin reviewed Jul 25, 2023

View reviewed changes

richardstartin approved these changes Jul 25, 2023

View reviewed changes

sanchda approved these changes Jul 25, 2023

View reviewed changes

r1viollet merged commit 7ff7238 into main Jul 26, 2023

r1viollet deleted the r1viollet/perf_alloc_profiling_v1 branch July 26, 2023 08:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve performance of allocation profiling #287

Improve performance of allocation profiling #287

r1viollet commented Jul 17, 2023 •

edited

Loading

richardstartin Jul 25, 2023

r1viollet Jul 26, 2023

richardstartin Jul 25, 2023

r1viollet Jul 26, 2023

richardstartin left a comment

Improve performance of allocation profiling #287

Improve performance of allocation profiling #287

Conversation

r1viollet commented Jul 17, 2023 • edited Loading

What does this PR do?

Motivation

Additional Notes

How to test the change?

richardstartin Jul 25, 2023

Choose a reason for hiding this comment

r1viollet Jul 26, 2023

Choose a reason for hiding this comment

richardstartin Jul 25, 2023

Choose a reason for hiding this comment

r1viollet Jul 26, 2023

Choose a reason for hiding this comment

richardstartin left a comment

Choose a reason for hiding this comment

r1viollet commented Jul 17, 2023 •

edited

Loading