Skip to content

Commit

Permalink
[DOCS] Updating NPU device article for master (#25734)
Browse files Browse the repository at this point in the history
Porting: #25727
  • Loading branch information
sgolebiewski-intel authored Jul 25, 2024
1 parent f6ca0e7 commit a9c8b99
Showing 1 changed file with 26 additions and 10 deletions.
Original file line number Diff line number Diff line change
Expand Up @@ -164,8 +164,8 @@ offer a limited set of supported OpenVINO features.

**ov::intel_npu::compilation_mode_params**

``ov::intel_npu::compilation_mode_params`` is an NPU-specific property that allows to
control model compilation for NPU.
``ov::intel_npu::compilation_mode_params`` is an NPU-specific property that allows
control of model compilation for NPU.

.. note::

Expand All @@ -176,7 +176,7 @@ Following configuration options are supported:

**optimization-level**

Defines a preset of optimization passes to be applied during compilation.
Defines an optimization effort hint to the compiler.

.. list-table::
:widths: 10 200
Expand All @@ -185,15 +185,15 @@ Defines a preset of optimization passes to be applied during compilation.
* - **Value**
- **Description**
* - 0
- Reduced subset of optimization passes. Smaller compile time.
- Reduced subset of optimization passes. May result in smaller compile time.
* - 1
- **Default.** Balanced performance/compile time.
* - 2
- Prioritize performance over compile time that may be an issue.

**performance-hint-override**

An extension for LATENCY mode being specified using ``ov::hint::performance_mode``
The LATENCY mode can be overridden by specifying ``ov::hint::performance_mode``
Has no effect for other ``ov::hint::PerformanceMode`` hints.

.. list-table::
Expand All @@ -207,15 +207,31 @@ Has no effect for other ``ov::hint::PerformanceMode`` hints.
* - latency
- Prioritize performance over power efficiency.

.. tab-set::
Usage example:

.. tab-item:: Usage example
.. code-block::
.. code-block::
map<str, str> config = {ov::intel_npu::compilation_mode_params.name(), ov::Any("optimization-level=1 performance-hint-override=latency")};
compile_model(model, config);
**npu_turbo**

The turbo mode, where available, provides a hint to the system to maintain the
maximum NPU frequency and memory throughput within the platform TDP limits.
The turbo mode is not recommended for sustainable workloads due to higher power
consumption and potential impact on other compute resources.

.. code-block::
core.set_property("NPU", ov::intel_npu::turbo(true));
or

.. code-block::
map<str, str> config = {ov::intel_npu::compilation_mode_params.name(), ov::Any("optimization-level=1 performance-hint-override=latency")};
core.compile_model(ov_model, "NPU", {ov::intel_npu::turbo(true)});
compile_model(model, config);
Limitations
#############################
Expand Down

0 comments on commit a9c8b99

Please sign in to comment.