Use the same model for GPU and CPU #1292

thewh1teagle · 2024-08-09T17:42:54Z

thewh1teagle
Aug 9, 2024

I would like to use olive to create simple single whisper medium int8 model for the app Vibe.
However it seems that olive need separate models per GPU / CPU.
I want to simplify that to single model. Is that possible?
Thanks

samuel100 · 2024-12-24T14:54:15Z

samuel100
Dec 24, 2024
Collaborator

@thewh1teagle Ultimately, an ONNX model will run on the ONNX Runtime. So to answer your question... you can optimize a model for CPU and then inference it on both a GPU and CPU machine.

However, Olive optimizes the model for specific devices (NPU/GPU/CPU) from different hardware vendors (Nvidia, Qualcomm, etc). Therefore, you will likely get better performance by optimizing for the specific device you want to run on.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use the same model for GPU and CPU #1292

{{title}}

Replies: 1 comment

{{title}}

Select a reply

Use the same model for GPU and CPU #1292

thewh1teagle Aug 9, 2024

Replies: 1 comment

samuel100 Dec 24, 2024 Collaborator

thewh1teagle
Aug 9, 2024

samuel100
Dec 24, 2024
Collaborator