[Op][Spec] RMSNorm Operator Specification #23569

mitruska · 2024-03-20T11:17:17Z

Details:

RMSNorm Operator Specification

To be discussed:

Scale input - optional or outside the formula - proposed as optional input to comply with existing GPU RMSNorm op
Axes as input - vector or scalar, input or attribute - proposed as axes 1D/scalar input
compute_type - precision for the internal computation and accumulation (usually f32 for better results on lower precisions), inside the op or outside and implemented by Convert - proposed as attribute to comply with existing GPU RMSNorm (output_type)

Related GPU kernel and fusion transformation.

Related PRs:

Tickets:

134914, dicsussion 129027

rkazants · 2024-03-21T08:39:19Z

...ocumentation/openvino-ir-format/operation-sets/operation-specs/normalization/rms-norm-14.rst

+* *compute_type*
+
+  * **Description**: The precision for internal computation, before scaling.
+  * **Range of values**: Supported floating point type: "f32", "f16", ...


any other types except fp16 and fp32?

In the models I've seen cast to f32, in general any type can be allowed to comply with Convert capabilities, but it can be not a real use case.

rkazants · 2024-03-21T08:45:52Z

...ocumentation/openvino-ir-format/operation-sets/operation-specs/normalization/rms-norm-14.rst

+    (x / Sqrt(ReduceMean(x^2, axes) + eps))
+
+
+ -   If the optional ``scale`` input is provided:
+
+.. math::
+
+    (x / Sqrt(ReduceMean(x^2, axes) + eps)) * scale


Is the final decision to have multiplication by x inside RMSNorm? Why?

The discussion I mentioned in the PR description is about having the scale inside or outside the formula.
And I proposed to keep it optional for compatibility with existing GPU RMSNorm op.

Could you please precise, do you see other options for the RMSNorm formula?

…_spec

### Details: - RMSNorm op core class - Registration in the opset and op check (conformance) test will be added in the next PRs Spec PR: - #23569 ### Tickets: - 136261

…_spec

...ocumentation/openvino-ir-format/operation-sets/operation-specs/normalization/rms-norm-15.rst

### Details: - RMSNorm op core class - Registration in the opset and op check (conformance) test will be added in the next PRs Spec PR: - openvinotoolkit#23569 ### Tickets: - 136261

mitruska · 2024-05-09T16:58:45Z

Ongoing discussion:

The "compute_type" attribute supposed to cover F16-->RMS(F32/BF16)-->F16 to enable fusing Converts at the beggining and at the end of the RMS subgraph.
But it doesn't cover patterns when the Convert is not the first op in the graph (current GPU case F32-->RMS(F32)-->F16), so the "output_type" attribute is needed when fusing only the final Convert (mentioned as important from the GPU performance perspective).

### Details: - RMSNorm reference implementation - The `Scale` input is optional ** The conversion of the input/output type is not included (proposed as `computation_type` - conversion logic can be handled by the Convert operations (or the reference can be extended with the conversion logic separately if agreed on the spec). Related PRs: - Specification Proposal: #23569 ### Tickets: - 136262

mitruska · 2024-05-17T08:52:25Z

Closing - Decided to keep RMS as internal operator for now (moved from gpu custom).
Based on this work, there is a separate PR with documentation of the existing internal::RMS.

[Spec][Internal][Op] Specification for RMS internal op #24564

mitruska added 2 commits March 20, 2024 11:46

Init RMSNorm spec

5793319

Minor punctuation improvement

6e870ab

mitruska requested a review from a team as a code owner March 20, 2024 11:17

mitruska requested review from zKulesza and removed request for a team March 20, 2024 11:17

mitruska self-assigned this Mar 20, 2024

github-actions bot added the category: docs OpenVINO documentation label Mar 20, 2024

mitruska added category: Opset OpenVINO Opset and removed category: docs OpenVINO documentation labels Mar 20, 2024

mitruska requested review from vladimir-paramuzov, slyalin, mmikolajcz, jane-intel and andrew-k-park March 20, 2024 11:19

github-actions bot added the category: docs OpenVINO documentation label Mar 20, 2024

rkazants reviewed Mar 21, 2024

View reviewed changes

mitruska requested a review from rkazants March 25, 2024 08:15

mitruska added this to the 2024.2 milestone Mar 29, 2024

mitruska mentioned this pull request Apr 3, 2024

[Op] Add RMSNorm op core class #23842

Merged

mitruska added 2 commits April 9, 2024 15:50

Add scalar to the allowed shape of axes input

bfc4417

Merge remote-tracking branch 'upstream/master' into mitruska/rms_norm…

b559e64

…_spec

mitruska added 2 commits April 18, 2024 10:42

Update ops ver to 15

d95e21f

Merge remote-tracking branch 'upstream/master' into mitruska/rms_norm…

04037dc

…_spec

praasz approved these changes Apr 22, 2024

View reviewed changes

...ocumentation/openvino-ir-format/operation-sets/operation-specs/normalization/rms-norm-15.rst Show resolved Hide resolved

mitruska mentioned this pull request Apr 25, 2024

[Op] RMSNorm reference implementation #24232

Merged

This comment was marked as outdated.

Sign in to view

github-actions bot added the Stale label May 7, 2024

mitruska removed the Stale label May 7, 2024

mitruska closed this May 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Op][Spec] RMSNorm Operator Specification #23569

[Op][Spec] RMSNorm Operator Specification #23569

mitruska commented Mar 20, 2024 •

edited

Loading

rkazants Mar 21, 2024

mitruska Mar 22, 2024

rkazants Mar 21, 2024

mitruska Mar 22, 2024

This comment was marked as outdated.

mitruska commented May 9, 2024

mitruska commented May 17, 2024

[Op][Spec] RMSNorm Operator Specification #23569

[Op][Spec] RMSNorm Operator Specification #23569

Conversation

mitruska commented Mar 20, 2024 • edited Loading

Details:

Tickets:

rkazants Mar 21, 2024

Choose a reason for hiding this comment

mitruska Mar 22, 2024

Choose a reason for hiding this comment

rkazants Mar 21, 2024

Choose a reason for hiding this comment

mitruska Mar 22, 2024

Choose a reason for hiding this comment

This comment was marked as outdated.

mitruska commented May 9, 2024

mitruska commented May 17, 2024

mitruska commented Mar 20, 2024 •

edited

Loading