-
Notifications
You must be signed in to change notification settings - Fork 218
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Phonetic recognition #45
Comments
I created a feature branch (
Note that this feature branch may change at any time. |
Thank you very much! |
You should try something like montreal-forced-aligner, it supports many languages out of box |
@nshmyrev Thanks for the reference; that project looks interesting. However, this issue is about phonetic recognition, while Montreal Forced Aligner is about forced alignment and G2P. Am I missing a connection? |
I've extracted all the speech recognition logic into an interface called |
Rhubarb Lip Sync uses word-based speech recognition. That works well for English dialog. For non-English dialog, however, phonetic recognition might work better. Rather than try to extract English words from non-English speech, this will extract phonemes.
I'm planning to add a CLI option to switch to phonetic recognition.
This is only a temporary solution. In the long run, I still plan to implement full (word-based) recognition for languages other than English (see #5).
The text was updated successfully, but these errors were encountered: