possibility for overflow model to generate character level alignments to the training dataset #2252

albluc24 · 2023-01-02T10:13:23Z

albluc24
Jan 2, 2023

Hello, I am very unfamiliar with the way the overflow model works, but I have seen that in it's dict of outputs it gives an alignments category. Now, I need a phoneme level aligner for my project and as far as I have seen it would be marvelous at the task because it learns some extremely good alignments, especially with phonemes. Is there a way to condition the alignments on the dataset's spectrogram so that it is possible to use a trained model as a forced aligner? If not, I have heard that for giving alignments to glow tts and such models, there is an aligner network that is jointly trained with those models. Is there a way to get dataset alignments out of that?
Thanks, have a good day!

shivammehta25 · 2023-01-17T13:24:19Z

shivammehta25
Jan 17, 2023
Collaborator

Hi!
Since it is a left-to-right no skip, I would say it is better to have phoneme level alignments over character level alignments (which seems to be your case. So once the model is trained you can run a forward pass of the model and you can get alignments in output['alignment'].

Hope this helps!

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

possibility for overflow model to generate character level alignments to the training dataset #2252

{{title}}

Replies: 1 comment

{{title}}

Select a reply

possibility for overflow model to generate character level alignments to the training dataset #2252

albluc24 Jan 2, 2023

Replies: 1 comment

shivammehta25 Jan 17, 2023 Collaborator

albluc24
Jan 2, 2023

shivammehta25
Jan 17, 2023
Collaborator