[GPU] Use array for tracking memory usage instead of map #25269

ialbrecht · 2024-06-27T20:50:37Z

Details:

Any additional locking and synchronization on memory allocation might have negative impact on MT execution.
std::map has very slow access are requires lock on every access. We can use std::array instead to hold compile time known number of buckets.
array container has lower access latency and memory overhead.
We might me able to remove mutex lock on stat collection.

rkazants · 2024-06-28T03:31:47Z

build_jenkins

rkazants · 2024-06-28T03:40:33Z

Hi @ialbrecht,
any data with concrete numbers on how it can affect MT execution in your case?

src/plugins/intel_gpu/include/intel_gpu/runtime/engine.hpp

src/plugins/intel_gpu/src/runtime/engine.cpp

ialbrecht · 2024-07-01T14:51:42Z

Hi @ialbrecht, any data with concrete numbers on how it can affect MT execution in your case?

We had benchmarks were memory allocation was on the critical performance path and we were looking at all possible ways to optimize it.

src/plugins/intel_gpu/src/runtime/engine.cpp

`std::map` has very slow access are requires lock on every access. We can use `std::array` instead to hold compile time known number of buckets. This method should have less latency and memory overhead, as well as possibly no lock on access.

No need to lock mutex to update atomic values

Make sure atomic values are zero initialized.

As in description

`fetch_add` returns old value, therefore we need to increment current memory usage value before storing current max.

vladimir-paramuzov · 2024-07-26T09:36:23Z

build_jenkins

ialbrecht requested review from a team as code owners June 27, 2024 20:50

github-actions bot added the category: GPU OpenVINO GPU plugin label Jun 27, 2024

sys-openvino-ci added the ExternalPR External contributor label Jun 27, 2024

rkazants requested review from ilya-lavrenov and vladimir-paramuzov June 28, 2024 03:32

rkazants added the ExternalIntelPR External contributor from Intel label Jun 28, 2024

vladimir-paramuzov reviewed Jun 29, 2024

View reviewed changes

src/plugins/intel_gpu/include/intel_gpu/runtime/engine.hpp Outdated Show resolved Hide resolved

src/plugins/intel_gpu/src/runtime/engine.cpp Show resolved Hide resolved

vladimir-paramuzov changed the title ~~Use array for tracking memory usage instead of map~~ [GPU] Use array for tracking memory usage instead of map Jul 3, 2024

vladimir-paramuzov approved these changes Jul 3, 2024

View reviewed changes

vladimir-paramuzov reviewed Jul 3, 2024

View reviewed changes

src/plugins/intel_gpu/src/runtime/engine.cpp Outdated Show resolved Hide resolved

ialbrecht added 5 commits July 26, 2024 09:56

[GPU] Remove device memory tracking mutex

623436c

No need to lock mutex to update atomic values

[GPU] added default initializer

8a24957

Make sure atomic values are zero initialized.

[GPU] Removed double increment on memory usage counter

f6b8a97

As in description

[GPU] Fix current mem size value fetch

0d12c25

`fetch_add` returns old value, therefore we need to increment current memory usage value before storing current max.

vladimir-paramuzov force-pushed the memory_tracker_use_arrays branch from 47bc055 to 0d12c25 Compare July 26, 2024 05:56

vladimir-paramuzov added this pull request to the merge queue Jul 29, 2024

Merged via the queue into openvinotoolkit:master with commit 131c944 Jul 29, 2024
109 checks passed

vladimir-paramuzov added this to the 2024.4 milestone Jul 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[GPU] Use array for tracking memory usage instead of map #25269

[GPU] Use array for tracking memory usage instead of map #25269

ialbrecht commented Jun 27, 2024

rkazants commented Jun 28, 2024

rkazants commented Jun 28, 2024

ialbrecht commented Jul 1, 2024

vladimir-paramuzov commented Jul 26, 2024

[GPU] Use array for tracking memory usage instead of map #25269

[GPU] Use array for tracking memory usage instead of map #25269

Conversation

ialbrecht commented Jun 27, 2024

Details:

rkazants commented Jun 28, 2024

rkazants commented Jun 28, 2024

ialbrecht commented Jul 1, 2024

vladimir-paramuzov commented Jul 26, 2024