-
Notifications
You must be signed in to change notification settings - Fork 4.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
New Tokenizer API #937
New Tokenizer API #937
Conversation
I'll take a look this weekend, @erogol 👍 |
Discard but store OOV chars with a warninig message when the OOV char first recognized
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. You might also look our discussion channels. |
Still relevant. |
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. You might also look our discussion channels. |
This is already merged after rebasing |
Tokenizer API
Tokenizer API is defined by the TTSTokenizer class. It is intended to provide all the text processing functionalities to a
tts
model. New tokenizers can also be added by subclassing the TTSTokenizer class.Phonemizer API
Phonemizer API is defined by the BasePhonemizer class and implemented by the ESpeak and Gruut wrappers, ZH_CH,
JP_JA phonemizers. New phonemizers can be added by implementing the BasePhonemizer class.
BaseCharacters
BaseCharacters class provides an API to define the model vocabulary and provide the dictionary to map characters to
token IDs and back. There are two pre-defined classes inheriting from BaseCharacters. IPAPhonemes and Graphemes that respectively define the IPA phoneme character set for models using phonemes and grapheme set for models using raw characters.
Punctuations class
Punctuations class to strip out punctuations and restore them when needed.
Language specific text normalization routines under
TTS.tts.utils.text
Under
TTS.tts.utils.text
there are folders for each language to accommodate the text normalization routines thatare designed for the language.
GlowTTS recipe and model using the new API
Other models are not compatible with the new API currently.
TODO: