Unable to get NeMo working with canary-1b #8389
-
Hi all, hoping for some pointers i'm running into on an issue with trying to get the canary model running locally on some radio recordings I want to transcribe. When using {"audio_filepath": "/home/csmith/projects/simple-tr-transcription/temp_1399-1707503502_852412500.1-call_11866.wav", "duration": 20.48, "taskname": "asr", "source_lang": "en", "target_lang": "en", "pnc": "yes"} I get the following output and no transcription:
Please let me know if there's any other information that would be helpful in diagnosing what I'm missing - would really love to give this model a shot! |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 4 replies
-
I got it working by adding an |
Beta Was this translation helpful? Give feedback.
-
Hi @Kurumindla-Kranthivardhan, we currently only support single-channel audio input, while your input may have more than one channel. We will add support for multi-channel @pzelasko |
Beta Was this translation helpful? Give feedback.
Sorry about the confusion. In current NeMo design, the groundtruth field ("answer") is needed for inference and we need to explicitly set it to a (any) dummy string, but later we will make that field optional so that users won't need to manually set the dummy groundtruth.