[js/common] allows using Uint16Array as data for float16 tensor #23827

fs-eire · 2025-02-26T22:37:14Z

Description

Motivation and Context

fdwr

Thanks Yulong.

fs-eire · 2025-02-27T03:10:35Z

@Honry @xenova Could you please help to check if this PR fixes the problem? It should work but I don't have a chance to test it E2E.

Artifacts can be downloaded at https://dev.azure.com/onnxruntime/onnxruntime/_build/results?buildId=1629021&view=artifacts&pathAsName=false&type=publishedArtifacts

Honry · 2025-02-27T06:40:40Z

@fs-eire, verified, thanks for the quick fix.

xenova

Thanks for the fix! For transformers.js, we'll also default to Float16Array if available: #23817

Honry · 2025-02-27T08:52:55Z

@fs-eire, hold on, this still breaks demos that read the outputs as Uint16Array. Now you always save the output to Float16Array if Float16Array is available, however existing demos still treat it as Uint16Array.

fdwr · 2025-02-27T09:55:17Z

@fs-eire, hold on, this still breaks demos that read the outputs as Uint16Array. Now you always save the output to Float16Array if Float16Array is available, however existing demos still treat it as Uint16Array.

Ooh, that's more complicated. How does ORT know what the output type should be? It could look at the input types to deduce the output types (e.g. if there were uint16-as-float16 input tensors, then presumably any output float16 tensors should be uint16 too), but it wouldn't be fully robust (e.g. say you had a model that took int32 inputs and float16-as-uint16 outputs, which there is no input hint to know the output type). Well, it might be a good enough transient heuristic for now.

Honry · 2025-02-27T13:18:46Z

Ooh, that's more complicated. How does ORT know what the output type should be? It could look at the input types to deduce the output types (e.g. if there were uint16-as-float16 input tensors, then presumably any output float16 tensors should be uint16 too), but it wouldn't be fully robust (e.g. say you had a model that took int32 inputs and float16-as-uint16 outputs, which there is no input hint to know the output type). Well, it might be a good enough transient heuristic for now.

Exactly, it’s difficult to know what the output type should be. Users have to add API check to determine if Float16Array should be used.

fs-eire · 2025-02-27T19:03:37Z

Ooh, that's more complicated. How does ORT know what the output type should be? It could look at the input types to deduce the output types (e.g. if there were uint16-as-float16 input tensors, then presumably any output float16 tensors should be uint16 too), but it wouldn't be fully robust (e.g. say you had a model that took int32 inputs and float16-as-uint16 outputs, which there is no input hint to know the output type). Well, it might be a good enough transient heuristic for now.

Exactly, it’s difficult to know what the output type should be. Users have to add API check to determine if Float16Array should be used.

as explained by @fdwr, I think there is no way to do this inside ORT

to fix this problem, you need to modify the application code. a simple way is that for all float16 output, you always create a Uint16Array wrapper:

if (myTensor.type === 'float16') {
   myData = new Uint16Array(myTensor.data.buffer, myTensor.data.byteOffset, myTensor.data.length);
}

fs-eire · 2025-02-27T20:19:42Z

@Honry Please let me know if you are OK with this change or we need further discussion

Honry · 2025-02-28T01:22:45Z

@Honry Please let me know if you are OK with this change or we need further discussion

Make sense, and we should remind end users to change their application code if need.

fdwr · 2025-02-28T01:35:51Z

@Honry Please let me know if you are OK with this change or we need further discussion

Make sense, and we should remind end users to change their application code if need.

Sounds worth adding to the breaking changes section: https://github.com/microsoft/webnn-developer-preview?tab=readme-ov-file#breaking-changes

Honry

👍

fs-eire added 3 commits February 26, 2025 14:36

[js/common] allows using Uint16Array as data for float16 tensor

802ea65

update CI

30603fa

fix test in linux (add quote)

8f1b638

Honry mentioned this pull request Feb 27, 2025

Failed to run whisper demo on latest Chrome Canary microsoft/webnn-developer-preview#74

Open

fdwr approved these changes Feb 27, 2025

View reviewed changes

fs-eire added the release:1.21.0 label Feb 27, 2025

xenova approved these changes Feb 27, 2025

View reviewed changes

Honry approved these changes Feb 28, 2025

View reviewed changes

guschmue approved these changes Mar 3, 2025

View reviewed changes

fs-eire merged commit 1872527 into main Mar 3, 2025
93 of 95 checks passed

fs-eire deleted the fs-eire/js-support-f16-uint16array branch March 3, 2025 21:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[js/common] allows using Uint16Array as data for float16 tensor #23827

[js/common] allows using Uint16Array as data for float16 tensor #23827

fs-eire commented Feb 26, 2025

fdwr left a comment

fs-eire commented Feb 27, 2025

Honry commented Feb 27, 2025

xenova left a comment •

edited

Loading

Honry commented Feb 27, 2025

fdwr commented Feb 27, 2025

Honry commented Feb 27, 2025

fs-eire commented Feb 27, 2025 •

edited

Loading

fs-eire commented Feb 27, 2025

Honry commented Feb 28, 2025

fdwr commented Feb 28, 2025

Honry left a comment

[js/common] allows using Uint16Array as data for float16 tensor #23827

[js/common] allows using Uint16Array as data for float16 tensor #23827

Conversation

fs-eire commented Feb 26, 2025

Description

Motivation and Context

fdwr left a comment

Choose a reason for hiding this comment

fs-eire commented Feb 27, 2025

Honry commented Feb 27, 2025

xenova left a comment • edited Loading

Choose a reason for hiding this comment

Honry commented Feb 27, 2025

fdwr commented Feb 27, 2025

Honry commented Feb 27, 2025

fs-eire commented Feb 27, 2025 • edited Loading

fs-eire commented Feb 27, 2025

Honry commented Feb 28, 2025

fdwr commented Feb 28, 2025

Honry left a comment

Choose a reason for hiding this comment

xenova left a comment •

edited

Loading

fs-eire commented Feb 27, 2025 •

edited

Loading