Add fp16 precision support #222

philloooo · 2024-04-17T18:12:06Z

Hi,
Can we add fp16 version of the models? (have all inputs and weights be fp16).
On Mac, CoreML only use NPU when the data is fp16. It would be valuable to have webnn samples with fp16 precision to compare the performance.

Honry · 2024-04-23T03:31:29Z

We have a plan to add a few fp16 models specific for image classification.

huningxin · 2024-04-23T06:19:30Z

This would also be used for NPU test. #220

Honry · 2024-05-07T07:13:42Z

@philloooo, @mingmingtasd has added 3 fp16 models for image classification: #226

philloooo · 2024-05-13T23:19:54Z

thanks! I've tested it on Mac, some models are blocked by webmachinelearning/webnn#678 but Image classification with ResNet 50 V2 works and I can confirm it's using Apple Neural Engine.

I noticed that if I select the device to GPU, the model is different even when the precision is the same. Can we use the same models for GPU vs NPU? That's better to do comparison.

@Honry

mingmingtasd · 2024-05-14T02:19:49Z

We should allow to include softmax both for GPU and NPU: https://github.com/webmachinelearning/webnn-samples/pull/226/files#diff-13f8eda69b8dc85839ae4e046882bfbbddd80a5928e8bd3ea6ff7b71e4213f68R124,
The TODO can be done.
@Honry @philloooo

Honry · 2024-05-16T02:11:31Z

@philloooo, #237 fixes the inconsistent softmax support.

Now all the fp16 models (MobileNetV2, RestNet 50 V1, EfficientNet) share the same models between GPU and NPU.

huningxin · 2024-08-12T05:30:14Z

Can we close this issue?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add fp16 precision support #222

Add fp16 precision support #222

philloooo commented Apr 17, 2024

Honry commented Apr 23, 2024

huningxin commented Apr 23, 2024

Honry commented May 7, 2024

philloooo commented May 13, 2024 •

edited

Loading

mingmingtasd commented May 14, 2024

Honry commented May 16, 2024

huningxin commented Aug 12, 2024

Add fp16 precision support #222

Add fp16 precision support #222

Comments

philloooo commented Apr 17, 2024

Honry commented Apr 23, 2024

huningxin commented Apr 23, 2024

Honry commented May 7, 2024

philloooo commented May 13, 2024 • edited Loading

mingmingtasd commented May 14, 2024

Honry commented May 16, 2024

huningxin commented Aug 12, 2024

philloooo commented May 13, 2024 •

edited

Loading