Trained TTS Vitsmodel Restricted to 11 Seconds of Audio Generation #6972
-
By following this tutorial, I trained a TTS VitsModel. Is there a workaround available to generate the complete audio for longer texts? Note:
|
Beta Was this translation helpful? Give feedback.
Replies: 5 comments 19 replies
-
@VahidooX could you kindly review my inquiry? Thank you. |
Beta Was this translation helpful? Give feedback.
-
thanks. @treacker. Could you pls have a look? |
Beta Was this translation helpful? Give feedback.
-
found the same reported issue as well: #6998 |
Beta Was this translation helpful? Give feedback.
-
Sorry for late responce. The problem is in |
Beta Was this translation helpful? Give feedback.
Sorry for late responce. The problem is in
max_len
parameter, which is set by default to 1000 and can't be changed fromconvert_text_to_waveform
function. The workaround is either to useforward
function which is called insideconvert_text_to_waveform
or add option to changemax_len
. @XuesongYang choose please what is preferred solution