Quantisation for TTS models #2395

ApoorveK · 2023-03-07T10:09:28Z

ApoorveK
Mar 7, 2023

So, the main idea behind this discussion is finding efficient way to Quantize TTS models, for faster inferences. Since the layers being used actually are custom layers (for example in VITS model) so how can we Quantize these layers and which framework would be ideal to do that. Have been through Pytorch documentations, and I am not able to find about "Fusing Custom Layers" which is important step in quantisation of TTS model/s (whether it is Quantized Aware Training or Post Training Quantisation).

Nanayeb34 · 2023-07-22T05:42:38Z

Nanayeb34
Jul 22, 2023

I think one of the ways to apply pytorch's dynamic quantize will be to instantiate the model class. At present, I can't seem to find the model class of the architectures. It looks like we would have to scrap it from the respective .py files of the models

0 replies

DrewThomasson · 2024-10-20T18:09:37Z

DrewThomasson
Oct 20, 2024

I've been trying to quantize the xtts model

Ive gotten to quantize them but inference seems to be broken for all fo them :/

Where any work related to this will be located:

https://github.com/DrewThomasson/Quantize_xtts/tree/main

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Quantisation for TTS models #2395

{{title}}

Replies: 2 comments

{{title}}

{{title}}

Select a reply

Quantisation for TTS models #2395

ApoorveK Mar 7, 2023

Replies: 2 comments

Nanayeb34 Jul 22, 2023

DrewThomasson Oct 20, 2024

ApoorveK
Mar 7, 2023

Nanayeb34
Jul 22, 2023

DrewThomasson
Oct 20, 2024