Cache aware streaming ASR - discrepancies between training and inference #7010
-
Hello, Firstly thanks for putting together a great tool. I've trained a cache aware streaming fastconformer model using the standard config and the standard ctc asr training script . The model trains well, but when I go to evaluate the streaming performance using the cache aware streaming inference script there is a drastic difference between offline and streaming performance (13% and 70% respectively), which surprised me because I thought that I had limited the attention context using I've confirmed that the default streaming configs are the same between training and inference (the standard parameters returned by Am I missing something? I would have thought that Any guidance would be much appreciated. Many thanks |
Beta Was this translation helpful? Give feedback.
Replies: 5 comments 2 replies
-
Which branch or nemo version are you using? |
Beta Was this translation helpful? Give feedback.
-
thanks for your response. I'm using the master branch on this commit: |
Beta Was this translation helpful? Give feedback.
-
I found a bug in updating the caches and fixed it in this PR: Would you please try it out? |
Beta Was this translation helpful? Give feedback.
-
Sorry for the delay. That worked perfectly! Thanks very much |
Beta Was this translation helpful? Give feedback.
-
Hello @titu1994 , I am in the process of training this model for German language, and I obtained these training stats after 100 epochs. Looking at the training loss and train wer, I believe the losses have not yet converged. Do you think something is wrong with the parameters/hyper-params. or does the training needs to continue? Thank You |
Beta Was this translation helpful? Give feedback.
I found a bug in updating the caches and fixed it in this PR:
#7034
Would you please try it out?