Replies: 1 comment
-
Hi! Hope this helps! |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hello, I am very unfamiliar with the way the overflow model works, but I have seen that in it's dict of outputs it gives an alignments category. Now, I need a phoneme level aligner for my project and as far as I have seen it would be marvelous at the task because it learns some extremely good alignments, especially with phonemes. Is there a way to condition the alignments on the dataset's spectrogram so that it is possible to use a trained model as a forced aligner? If not, I have heard that for giving alignments to glow tts and such models, there is an aligner network that is jointly trained with those models. Is there a way to get dataset alignments out of that?
Thanks, have a good day!
Beta Was this translation helpful? Give feedback.
All reactions