-
Notifications
You must be signed in to change notification settings - Fork 2.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Adding RNN encoder for LSTM-Transducer and LSTM-CTC models #3886
Conversation
Signed-off-by: Vahid <vnoroozi@nvidia.com>
Signed-off-by: Vahid <vnoroozi@nvidia.com>
Signed-off-by: Vahid <vnoroozi@nvidia.com>
Signed-off-by: Vahid <vnoroozi@nvidia.com>
Signed-off-by: Vahid <vnoroozi@nvidia.com>
Signed-off-by: Vahid <vnoroozi@nvidia.com>
Signed-off-by: Vahid <vnoroozi@nvidia.com>
Signed-off-by: Vahid <vnoroozi@nvidia.com>
Signed-off-by: Vahid <vnoroozi@nvidia.com>
Signed-off-by: Vahid <vnoroozi@nvidia.com>
Signed-off-by: Vahid <vnoroozi@nvidia.com>
Signed-off-by: Vahid <vnoroozi@nvidia.com>
Signed-off-by: Vahid <vnoroozi@nvidia.com>
Signed-off-by: Vahid <vnoroozi@nvidia.com>
Signed-off-by: Vahid <vnoroozi@nvidia.com>
Signed-off-by: Vahid <vnoroozi@nvidia.com>
Signed-off-by: Vahid <vnoroozi@nvidia.com>
Signed-off-by: Vahid <vnoroozi@nvidia.com>
Signed-off-by: Vahid <vnoroozi@nvidia.com>
This pull request introduces 3 alerts when merging a189aa0 into e188f36 - view on LGTM.com new alerts:
|
Signed-off-by: Vahid <vnoroozi@nvidia.com>
This pull request introduces 2 alerts when merging e9bb886 into e188f36 - view on LGTM.com new alerts:
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Overall looks great, minor comments here and there.
Signed-off-by: Vahid <vnoroozi@nvidia.com>
Signed-off-by: Vahid <vnoroozi@nvidia.com>
Signed-off-by: Vahid <vnoroozi@nvidia.com>
Signed-off-by: Vahid <vnoroozi@nvidia.com>
Signed-off-by: Vahid <vnoroozi@nvidia.com>
This pull request fixes 1 alert when merging d1355a0 into 84236ba - view on LGTM.com fixed alerts:
|
Signed-off-by: Vahid <vnoroozi@nvidia.com>
This pull request fixes 1 alert when merging 6649533 into ca8a7e0 - view on LGTM.com fixed alerts:
|
This pull request fixes 1 alert when merging 253d441 into b1b6e5e - view on LGTM.com fixed alerts:
|
Signed-off-by: Vahid <vnoroozi@nvidia.com>
This pull request fixes 5 alerts when merging 7ff3717 into 5c88c8d - view on LGTM.com fixed alerts:
|
This pull request fixes 5 alerts when merging 20d81ec into 262fe05 - view on LGTM.com fixed alerts:
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Overall looks good, just reuse model_defaults.pred_hidden
instead of add yet another variable proj_size
at model_defaults level.
Things in model_defaults are :
- Global to a model archetype (not just 1 model like LSTMs here)
- Referred to multiple places in the model config (so user can modify the global one and propagate to rest of config).
proj_size is effectively pred_hidden here so I don't see why should split them into two.
Signed-off-by: Vahid <vnoroozi@nvidia.com>
Signed-off-by: Vahid <vnoroozi@nvidia.com>
Signed-off-by: Vahid <vnoroozi@nvidia.com>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks great ! Merge when ready
Signed-off-by: Vahid <vnoroozi@nvidia.com>
Signed-off-by: Vahid <vnoroozi@nvidia.com>
What does this PR do ?
This PR adds RNN-based encoder to NeMo. It enables us to have LSTM-Transducer (RNN-T) and LSTM-CTC models.
Changelog
Usage
# Add a code snippet demonstrating how to use this
PR Type: