update document for latency configuration for multi numa nodes on one socket #26944

wangleis · 2024-10-08T06:40:35Z

Details:

update document for latency configuration for multi numa nodes on one socket
PR update latency configuration for multi numa nodes on one socket #26798 is code change

Tickets:

CVS-140601

dmitry-gorokhov · 2024-10-14T07:19:34Z

...-inference/inference-devices-and-modes/cpu-device/performance-hint-and-thread-scheduling.rst

+| ``ov::hint::enable_hyper_threading`` | No                                                                    | No                                                                    |
+--------------------------------------+-----------------------------------------------------------------------+-----------------------------------------------------------------------+
+| ``ov::hint::enable_cpu_pinning``     | No / Not Supported                                                    | Yes except using P-cores and E-cores together                         |
+--------------------------------------+-----------------------------------------------------------------------+-----------------------------------------------------------------------+



I think it would be benefitial to have dedicated subsection here which will describe that starting from 5th Xeon generation Numa node are exposed explicitly (witch SNC=ON) (ideally should be reference on Intel resource with details) and that OV uses only single Numa due to performance consdirations. So usage of only part of the socket cores is expected behavior.
We also need to mentioned that for some models (with big compute demand) default behavior might not be optimal so the recomendation is to try full socket or even multi-socket execution for latency + provide recommendation hot to do that.

@dmitry-gorokhov updated. Please take a look.

wangleis requested a review from a team as a code owner October 8, 2024 06:40

wangleis requested review from tsavina and removed request for a team October 8, 2024 06:40

github-actions bot added the category: docs OpenVINO documentation label Oct 8, 2024

wangleis assigned dmitry-gorokhov Oct 8, 2024

wangleis requested review from dmitry-gorokhov, sunxiaoxia2022 and a team October 8, 2024 06:41

wangleis mentioned this pull request Oct 8, 2024

update latency configuration for multi numa nodes on one socket #26798

Open

update docs for latency on numa node

10cbc5a

dmitry-gorokhov added this to the 2024.5 milestone Oct 14, 2024

dmitry-gorokhov reviewed Oct 14, 2024

View reviewed changes

wangleis added 2 commits October 16, 2024 08:12

update note

03e83f2

Merge branch 'master' into doc_latency_on_numa_node

669c41a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

update document for latency configuration for multi numa nodes on one socket #26944

update document for latency configuration for multi numa nodes on one socket #26944

wangleis commented Oct 8, 2024

dmitry-gorokhov Oct 14, 2024

wangleis Oct 15, 2024

update document for latency configuration for multi numa nodes on one socket #26944

Are you sure you want to change the base?

update document for latency configuration for multi numa nodes on one socket #26944

Conversation

wangleis commented Oct 8, 2024

Details:

Tickets:

dmitry-gorokhov Oct 14, 2024

Choose a reason for hiding this comment

wangleis Oct 15, 2024

Choose a reason for hiding this comment