Why throw away the Phase spectrum? #1129
VigneshBaskar
started this conversation in
General
Replies: 1 comment 3 replies
-
I remember reading a couple of papers suggesting that, but it's been a while. I think it's worth trying, especially with one of the normalizing flow models like glow tts as they produce finer outputs. |
Beta Was this translation helpful? Give feedback.
3 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
The output of a TTS system is some form of Magnitude Spectrum (Mel Spectrogram or something else). The Phase spectrum is not at all taken into account for training the TTS system. I understand that both the magnitude spectrum and phase spectrum can be calculated from the Fourier coefficients as follows:
So ideally just one of these spectra is sufficient as it already contains all the required Fourier coefficients. But still...
May I request someone to help me with the following questions please:
Beta Was this translation helpful? Give feedback.
All reactions