-
Notifications
You must be signed in to change notification settings - Fork 2.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Identifying Spoken Language #4903
Comments
@fayejf is the model published? Please point to the docs. |
It looks like there is a labeller, see https://github.com/NVIDIA/NeMo/blob/main/examples/asr/speech_classification/speech_to_label.py#L81 |
@jnnnnn @Sasha-Bachynskyi The model is published. Thanks for your patience. #5080 |
Hi, @fayejf! I can't figure out how to use this model. There is only an instance of how to initialize a model. Thank you in advance for helping! |
Hi @Sasha-Bachynskyi , PR to merge info regarding docs should be merged soon. You may infer the label using For inferencing on single audio file use |
Hi @nithinraok, I'm sorry for bothering you. I use the following instruction Below is my code:
But, I get an error:
It seems that there is something wrong with librosa System info: What can it be? I'd appreciate any help in advance |
Looks like librosa is expecting mandatory naming args from newest version. Lower your librosa version or use the fix provided at #6086 |
Hello, developers.
Is there a model or something to identify spoken language? For example, how to identify whether a speaker speaks English or Russian.
I looked for it in the tutorials and found nothing.
I will appreciate any help
The text was updated successfully, but these errors were encountered: