-
Notifications
You must be signed in to change notification settings - Fork 2.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Minor typo fixes in TTS tutorial #5266
Conversation
Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com>
I have no idea why it moved a bunch of metadata around. ._. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for the fixes. I wonder if there is an easy way to run a spellchecker on a notebook.
I think it has to do with pycharm configuration. There is some metadata at the very bottom related to notebook format version. |
Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com>
Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Co-authored-by: Jocelyn <jocelynh@nvidia.com> Signed-off-by: 1-800-bad-code <shane.carroll@utsa.edu>
* remove stage wrapper from parallel Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> * fixed the onnx bug in conformer for non-streaming models. (#5242) Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> * [Tools][ASR] Tool for generating data using simulated RIRs (#5158) [Tools][ASR] Tool for generating data using simulated RIRs Signed-off-by: Ante Jukić <ajukic@nvidia.com> * Add fully torch.jit.script-able speaker clustering module (#5191) * Add files for commit Signed-off-by: Taejin Park <tango4j@gmail.com> * Added parallelism on p-value search Signed-off-by: Taejin Park <tango4j@gmail.com> * Changed speaker clustering to accept torch.tensor Signed-off-by: Taejin Park <tango4j@gmail.com> * Cleaned up the code and tested to have identical output Signed-off-by: Taejin Park <tango4j@gmail.com> * update on Notebook demo Signed-off-by: Taejin Park <tango4j@gmail.com> * Added eigvalsh for faster eig val calculation: Signed-off-by: Taejin Park <tango4j@gmail.com> * Remove NMESC_JitScriptedModule.ipynb Signed-off-by: Taejin Park <tango4j@gmail.com> * Cleaned code and style fix Signed-off-by: Taejin Park <tango4j@gmail.com> * Modified MSDD framework to fit torch-scripted clustering Signed-off-by: Taejin Park <tango4j@gmail.com> * LGTM fix Signed-off-by: Taejin Park <tango4j@gmail.com> * removed all string based timestamps Signed-off-by: Taejin Park <tango4j@gmail.com> * Removed unnecessary lines Signed-off-by: Taejin Park <tango4j@gmail.com> * removed redundant lines Signed-off-by: Taejin Park <tango4j@gmail.com> Signed-off-by: Taejin Park <tango4j@gmail.com> * Update perturb.py (#5231) * Update perturb.py Add checking for channels mismatch for audio and noise data, throw an exception if they have different number of channels. Also fixed `perturb_with_foreground_noise` as done in `perturb_with_input_noise` Signed-off-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * update check and teest Signed-off-by: stevehuang52 <heh@nvidia.com> * fix test Signed-off-by: stevehuang52 <heh@nvidia.com> Signed-off-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com> Signed-off-by: stevehuang52 <heh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * remove CV requirements. (#5233) Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> * Fix link to inference notebook (#5247) (#5251) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Co-authored-by: Jocelyn <jocelynh@nvidia.com> * checks for accepted adapter type at module level (#5194) * add accepted adapter functionality into transformer, mlp and attention Signed-off-by: arendu <adithya.r@gmail.com> * fix to t5 adapter and ia3 evals due to predict_step dictionary key changes Signed-off-by: arendu <adithya.r@gmail.com> * use mixin logic for adapters in ParallelAttention and ParallelMLP classes Signed-off-by: arendu <adithya.r@gmail.com> * typo fix Signed-off-by: arendu <adithya.r@gmail.com> * updates Signed-off-by: arendu <adithya.r@gmail.com> * moved adapter tools Signed-off-by: arendu <adithya.r@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix error with t5 adapter Signed-off-by: arendu <adithya.r@gmail.com> * updates' Signed-off-by: arendu <adithya.r@gmail.com> * replace ColumnParallelLinear with nn.Linear in export_utils Signed-off-by: arendu <adithya.r@gmail.com> * remove ColumnLinear Signed-off-by: arendu <adithya.r@gmail.com> * typo fix Signed-off-by: arendu <adithya.r@gmail.com> * update to check config targets Signed-off-by: arendu <adithya.r@gmail.com> * updates Signed-off-by: arendu <adithya.r@gmail.com> * refactor so that mixin is adapter name agnostic Signed-off-by: arendu <adithya.r@gmail.com> * fix merge conflict Signed-off-by: arendu <adithya.r@gmail.com> * minor Signed-off-by: arendu <adithya.r@gmail.com> * minor Signed-off-by: arendu <adithya.r@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * using class comparison instead of string match Signed-off-by: arendu <adithya.r@gmail.com> * fix test fail Signed-off-by: arendu <adithya.r@gmail.com> * fixed checks for add_adapter Signed-off-by: arendu <adithya.r@gmail.com> * fixed checks for add_adapter Signed-off-by: arendu <adithya.r@gmail.com> Signed-off-by: arendu <adithya.r@gmail.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * fix groovy syntax Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> * fix hypotheses return (#5253) Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Update ASR scores table (#5254) (#5255) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> * Support for inserting additional subsampling in conformer encoder (#5224) * Change the default position of the reduction position to null and rename subsampling reduction to striding Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> * Put the caching logic outside the conformer encoder Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Add description of the reduction parameters in the configs Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Update test_asr_exportables with correct reduction position value Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * asr and nmt tests in parallel Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> * add more users who can trigger blossom-ci Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> * path fix Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> * fix paths, remove redundant test Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> * add symlink Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> * Modernize RNNT ONNX export and add TS export (#5248) * Upgrade rnnt export for CUDA/CPU/TRT Signed-off-by: smajumdar <titu1994@gmail.com> * Update runtime script for onnx exported model to modern API Signed-off-by: smajumdar <titu1994@gmail.com> * Finalize code Signed-off-by: smajumdar <titu1994@gmail.com> * Remove comments Signed-off-by: smajumdar <titu1994@gmail.com> * Remove redundant stuff from tests Signed-off-by: smajumdar <titu1994@gmail.com> * Update test Signed-off-by: smajumdar <titu1994@gmail.com> * Remove onnx rnnt export test due to lack of onnxruntime install Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * update tutorials to use meeting config as default and VAD (#5237) * update tutorials to use meeting config as default and VAD Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * update model path Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Fix links to speaker identification notebook (#5260) (#5261) Signed-off-by: SeanNaren <snarenthiran@nvidia.com> Signed-off-by: SeanNaren <snarenthiran@nvidia.com> Signed-off-by: SeanNaren <snarenthiran@nvidia.com> Co-authored-by: Sean Naren <snarenthiran@nvidia.com> * add shm-size Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> * [TTS] Fastpitch energy condition and refactoring (#5218) * Incorporating Energy conditioning in FastPitch Signed-off-by: subhankar-ghosh <subhankar2321@gmail.com> * Minor fixes in Energy conditioning in FastPitch Signed-off-by: subhankar-ghosh <subhankar2321@gmail.com> * Add Energy conditioning in FastPitch to infer method Signed-off-by: subhankar-ghosh <subhankar2321@gmail.com> * adding fn to function names Signed-off-by: subhankar-ghosh <subhankar2321@gmail.com> * Incorporating Energy conditioning in FastPitch Signed-off-by: subhankar-ghosh <subhankar2321@gmail.com> * Minor fixes in Energy conditioning in FastPitch Signed-off-by: subhankar-ghosh <subhankar2321@gmail.com> * Add Energy conditioning in FastPitch to infer method Signed-off-by: subhankar-ghosh <subhankar2321@gmail.com> * adding fn to function names Signed-off-by: subhankar-ghosh <subhankar2321@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * remove ifelse from batching, minor refactoring changes in energy code Signed-off-by: subhankar-ghosh <subhankar2321@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Refactor based on PR comments. Signed-off-by: subhankar-ghosh <subhankar2321@gmail.com> * Added support for not learning alignment in energy Signed-off-by: subhankar-ghosh <subhankar2321@gmail.com> * Fix typo in assert statemetn Signed-off-by: subhankar-ghosh <subhankar2321@gmail.com> * Renaming average_pitch to average_features Signed-off-by: subhankar-ghosh <subhankar2321@gmail.com> * Renaming len variable name as it is a keyword Signed-off-by: subhankar-ghosh <subhankar2321@gmail.com> * Renaming len variable name as it is a keyword Signed-off-by: subhankar-ghosh <subhankar2321@gmail.com> Signed-off-by: subhankar-ghosh <subhankar2321@gmail.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> * [TTS] HiFi-TTS Download Script (#5241) * Hifi tts download script Signed-off-by: Oleksii Volkovskyi <volkovskyi@berkeley.edu> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Oleksii Volkovskyi <volkovskyi@berkeley.edu> * comment and remove imports Signed-off-by: Oleksii Volkovskyi <volkovskyi@berkeley.edu> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Oleksii Volkovskyi <volkovskyi@berkeley.edu> Signed-off-by: Oleksii Volkovskyi <volkovskyi@berkeley.edu> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Specifying audio signal dropout separately for the Conformer Encoder (#5263) * Fixed bug in transcribe_speech.py where decoding strategy was not being updated. Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Add option to specify audio dropout separately for conformer encoders Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> * Add audio dropout option to test_asr_exportables Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> * Rename dropout_audio to dropout_pre_encode Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> * Update the comments in squeezeformer configs referring to conformer modules Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * created (#5268) * created * bug Signed-off-by: Dima Rekesh <drekesh@nvidia.com> Signed-off-by: Dima Rekesh <drekesh@nvidia.com> Co-authored-by: Dima Rekesh <drekesh@nvidia.com> * [TTS] Add Mandarin/English Bilingual Recipe for Training Fastpitch Models (#5208) * Add Chinese TTS tokenizer and G2P. * Add data process script. * Add tutorial. Signed-off-by: Yuekai Zhang <zhangyuekai@foxmail.com> * Minor typo fixes in TTS tutorial (#5266) (#5272) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Co-authored-by: Jocelyn <jocelynh@nvidia.com> * Fix failing speaker counting for short audio samples (#5267) * Add files for commit Signed-off-by: Taejin Park <tango4j@gmail.com> * Added parallelism on p-value search Signed-off-by: Taejin Park <tango4j@gmail.com> * Changed speaker clustering to accept torch.tensor Signed-off-by: Taejin Park <tango4j@gmail.com> * Cleaned up the code and tested to have identical output Signed-off-by: Taejin Park <tango4j@gmail.com> * update on Notebook demo Signed-off-by: Taejin Park <tango4j@gmail.com> * Added eigvalsh for faster eig val calculation: Signed-off-by: Taejin Park <tango4j@gmail.com> * Remove NMESC_JitScriptedModule.ipynb Signed-off-by: Taejin Park <tango4j@gmail.com> * Cleaned code and style fix Signed-off-by: Taejin Park <tango4j@gmail.com> * Modified MSDD framework to fit torch-scripted clustering Signed-off-by: Taejin Park <tango4j@gmail.com> * LGTM fix Signed-off-by: Taejin Park <tango4j@gmail.com> * removed all string based timestamps Signed-off-by: Taejin Park <tango4j@gmail.com> * Removed unnecessary lines Signed-off-by: Taejin Park <tango4j@gmail.com> * removed redundant lines Signed-off-by: Taejin Park <tango4j@gmail.com> * Add enhanced speaker count back Signed-off-by: Taejin Park <tango4j@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fixed minor docstrings Signed-off-by: Taejin Park <tango4j@gmail.com> * removed import Counter Signed-off-by: Taejin Park <tango4j@gmail.com> Signed-off-by: Taejin Park <tango4j@gmail.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Pcla tutorial fixes (#5271) (#5273) * Fixed typos Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed cell type and tatoeba reference Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed typo Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed branch variable Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Co-authored-by: Matvei Novikov <mattyson.so@gmail.com> * Add Gradio App to ASR Docs (#5270) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * Fix bug into Dialogue tutorial (#5277) (#5280) Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * [TTS] fixed type of filepath and rename openslr. (#5276) Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> * O2bert + apex pipeline functions (#5221) * Global batch size support for validation * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Global batch size support for bert * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * bert batch support * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * bert batch size support * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * O2 support for bert * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Update megatron_bert_pretraining.py Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com> * Update megatron_bert_model.py Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Update megatron_bert_config.yaml Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Update megatron_bert_model.py Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com> * Bug fix * Bug fix * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Bug fix * Bug fix * Bug fix * Update megatron_bert_config.yaml Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com> * Addressed Sandeeps comments Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com> * Update Jenkinsfile Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com> * Update Jenkinsfile Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com> * Update megatron_bert_model.py Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com> * Update Jenkinsfile Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Typo fix (#5288) (#5291) Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Co-authored-by: Matvei Novikov <mattyson.so@gmail.com> * Upperbound PTL (#5302) * Upperbound PTL Signed-off-by: smajumdar <titu1994@gmail.com> * Upperbound PTL Signed-off-by: smajumdar <titu1994@gmail.com> * Upperbound PTL Signed-off-by: smajumdar <titu1994@gmail.com> * Upperbound PTL Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * Add support for Sampled Softmax for RNNT Joint (#5216) * Initial prototype of SampldRNNTJoint Signed-off-by: smajumdar <titu1994@gmail.com> * Implement randperm based noise selection algo Signed-off-by: smajumdar <titu1994@gmail.com> * First working prototype of sampled rnnt ! Signed-off-by: smajumdar <titu1994@gmail.com> * Add note for why we need this remap before i forget Signed-off-by: smajumdar <titu1994@gmail.com> * Finalize version that works with sampling Signed-off-by: smajumdar <titu1994@gmail.com> * Update docs for rnnt decoder and joint Signed-off-by: smajumdar <titu1994@gmail.com> * Remove the adjustment_val for softmax Signed-off-by: smajumdar <titu1994@gmail.com> * Update config and docs for Sampled Softmax Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * Update Interface(s) phonetic entry (#5212) * change interface(s) phone Signed-off-by: Jason <jasoli@nvidia.com> * push version Signed-off-by: Jason <jasoli@nvidia.com> * update dict path Signed-off-by: Jason <jasoli@nvidia.com> Signed-off-by: Jason <jasoli@nvidia.com> * [TTS] remove obsolete torch_tts unit test marker and replace with run_only_on('CPU') (#5307) Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> * Fixes for Conformer-xl export (#5309) * Fixing runtime check for ONNX > 2G Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * Fixing ONNX export Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * add label inference support to EncDecSpeakerLabel class (#5278) * add label inference support to EncDecSpeakerLabel class Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * add necessary tests Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * reflect on comments Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * grammatical correction Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * minor doc string changes Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * [TTS] bugfix IPAG2P and refactor to remove duplicate process. (#5304) * [TTS] bugfix IPAG2P and refactor to remove duplicate process. * added type hints and rename func. * unify str and list(str) as list(str). * revise logging message when phoneme_dict_obj is empty Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> * Update path to get_data.py in TTS tutorial (#5311) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Add italian model checkpoints (#5315) Signed-off-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Igor Gitman <igitman@nvidia.com> * Text Memmap Parsing Improvements (#5265) * 1. Fixed text-memmap issue when boundary (new-line) is missing from end of file). Signed-off-by: Micha Livne <mlivne@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * 1. Fixed style. Signed-off-by: Micha Livne <mlivne@nvidia.com> * 1. Added support in paratial sample loading and alternative decoding. Signed-off-by: Micha Livne <mlivne@nvidia.com> * 1. Fixed syntax issues. Signed-off-by: Micha Livne <mlivne@nvidia.com> * 1. Minor change. Signed-off-by: Micha Livne <mlivne@nvidia.com> * 1. Extended flexibility of mapping indices. Signed-off-by: Micha Livne <mlivne@nvidia.com> * 1. Added validation ofdtype of indexing function. Signed-off-by: Micha Livne <mlivne@nvidia.com> Signed-off-by: Micha Livne <mlivne@nvidia.com> Co-authored-by: Micha Livne <mlivne@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> * [TTS] Replace IPA lambda arguments with locale string (#5298) * [TTS] Replace IPA lambda arguments with locale string * [TTS] Add locale validation * Fixed typos * Return punctuation as sorted list Signed-off-by: Ryan <rlangman@nvidia.com> * Remove onnx graphsurgery from Dockerfile (#5320) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * Update refspec (#5321) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * Force wav file format for audio_filepath (#5323) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * Updates to T0 Dataset and Model (#5201) * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fixes Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Update config Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Restore function needed for NMT Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix config Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Change output file format from JSON to JSONL Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Add T0 data preproc scripts Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Merge and multiprocessing Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix for is_correct Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Refactor T0 dataset Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Add script to merge train folder into individual training files to minimize number of blends Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Minor changes Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Remove bin compat Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix header lines Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * [DOC] add sphinx-copybutton requirement to copy button on code snippets. (#5326) Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> * [TTS] expand to support flexible dictionary entry formats in IPAG2P. (#5318) * expand to support flexible dictionary entry formats in IPAG2P. * removed unused imports in test.collections.tts * removed unused imports in nemo.collections.tts.modules * removed unused imports in nemo_text_processing.text_normalization.zh * updated unit tests with new cases * renamed test function names because we only test IPAG2P rather than all classes in the modules.py. * revise current test dict with a single space between word and pronunications. Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> * small bugfix for r1.13.0 (#5310) (#5325) * typo fix * udpate transcribe Signed-off-by: fayejf <fayejf07@gmail.com> Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com> * Option to pad the last validation input sequence if its smaller than the encoder sequence length for MegatronGPT (#5243) * Option to pad the last input sequence of validation dataset if its smaller than the encoder sequence length for MegatronGPT * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * added default value for drop last argument Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Anmol Gupta <anmolg@nvidia.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> * Add support for Hydra multirun to NeMo (#5159) * Update execution doc and remove old snippet Signed-off-by: smajumdar <titu1994@gmail.com> * Fix types Signed-off-by: smajumdar <titu1994@gmail.com> * Fix defaults Signed-off-by: smajumdar <titu1994@gmail.com> * Fix types for ParallelAdapterConfig Signed-off-by: smajumdar <titu1994@gmail.com> * Add hash for config cache Signed-off-by: smajumdar <titu1994@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Add support to delete redundant ckpt files for HP search Signed-off-by: smajumdar <titu1994@gmail.com> * Correct config for IA3 Signed-off-by: smajumdar <titu1994@gmail.com> * Fix check to <= 0 Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * typo fix (#5328) * Speed up HF data processing script for ASR (#5330) * Correct hydra issue with relative filepaths Signed-off-by: smajumdar <titu1994@gmail.com> * Improve speed of dataset processing Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * add precommit hood to automatic sort entries in requirements. (#5333) Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> * [TTS] update organization of model checkpoints and their pointers. (#5327) * [TTS] update orgnization of model checkpoints and their pointers. Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> * move model name column to the 2nd col and correct model names as predefined_model_name. Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> * Add speaker clustering arguments to forward function (#5306) * Move arguments to forward function Signed-off-by: Taejin Park <tango4j@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Resolved type issue Signed-off-by: Taejin Park <tango4j@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Taejin Park <tango4j@gmail.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * [STT] Add Ru ASR Conformer-CTC and Conformer-Transducer (#5340) (#5341) * [STT] Add stt_ru_conformer_ctc_large Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> * [STT] Add stt_ru_conformer_transducer_large Add stt_ru_conformer_transducer_large Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Co-authored-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * [TTS] bugfix for the script of generating mels. (#5344) Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> * Fixing de-autocast (#5319) * Fixing de-autocast Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * Cleanup Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * Refining export with max_dim/batch Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * Moving cast utils to its own module Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * Pcla tutorial fixes (#5313) (#5347) * fixes Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * fixes Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * moved `create_text_and_labels` to token_classification_utils.py Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Co-authored-by: Matvei Novikov <mattyson.so@gmail.com> * bug (#5348) Co-authored-by: Dima Rekesh <drekesh@nvidia.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> * [Bugfix] Added rm -f / wget- nc command to avoid bash error in multispeaker sim notebook (#5292) * Added rm -f command to avoid error message Signed-off-by: Taejin Park <tango4j@gmail.com> * removed unnecessary changes Signed-off-by: Taejin Park <tango4j@gmail.com> Signed-off-by: Taejin Park <tango4j@gmail.com> * [DOC] added ipython dependency to support IPython.sphinxext extension (#5345) * [DOC] added ipython dependency to support IPython.sphinxext extension Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> * revert ipython extension in the doc and replace ipython block with shell-session. Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> * Fix dialogue tutorial bug (#5297) (#5303) * set add_pooling_layer=False for huggingface bert model * remove add_pooling_layer=False and set find_unused_parameters=True * set num_prompt_tokens to 0 for huggingface Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com> Co-authored-by: Eric Harper <complex451@gmail.com> * Fix issue with HF Model upload tutorial (#5359) (#5360) * Add Gradio App to ASR Docs (#5270) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> (cherry picked from commit e4b6a387e3b3d9cdf511f7b9bbb5e94925e48cc2) * Fix issue with normalized config for dataset name Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> * Bug fix (removing old compute consumed samples) (#5355) Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com> Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com> * removed uninstall nemo_cv and nemo_simple_gan and relax numba version… (#5332) * Update reinstall.sh and requirements. * removed nemo_cv and nemo_simple_gan in reinstall.sh. * relaxed numba version limits. * added tensorboard requirement to avoid any incpmpatible issue. Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> * revert changes for numba Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> * Pipeline paralleism in Bert (#5293) * Global batch size support for validation * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Global batch size support for bert * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * bert batch support * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * bert batch size support * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * O2 support for bert * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Update megatron_bert_pretraining.py Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com> * Update megatron_bert_model.py Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Update megatron_bert_config.yaml Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Update megatron_bert_model.py Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com> * Bug fix * Bug fix * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Bug fix * Bug fix * Bug fix * Update megatron_bert_config.yaml Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com> * PPBert * PPBert * PPBert * PPBert * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Update megatron_bert_config.yaml Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com> * bug fix * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * bug fix * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * bug fix * bug fix * bug fix * bug fix Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * tutorial fixes (#5354) (#5361) Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Co-authored-by: Matvei Novikov <mattyson.so@gmail.com> * Enable mlflow logger (#4893) * Enable mlflow logger Signed-off-by: whrichd <trabeitwrq@gmail.com> * fix style Signed-off-by: whrichd <trabeitwrq@gmail.com> * Add doc lines. Signed-off-by: whrichd <trabeitwrq@gmail.com> * change default value Signed-off-by: whrichd <trabeitwrq@gmail.com> * fix doc Signed-off-by: whrichd <trabeitwrq@gmail.com> * addressed comments, added dataclass Signed-off-by: whrichd <trabeitwrq@gmail.com> * fix style Signed-off-by: whrichd <trabeitwrq@gmail.com> * fix doc Signed-off-by: whrichd <trabeitwrq@gmail.com> Signed-off-by: whrichd <trabeitwrq@gmail.com> * Add SDP documentation (#5274) (#5376) * Add details to SDP README.md Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add docstring to WriteManifest processor Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add docstring to CreateInitialManifestMLS Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add ModifyManifestTextProcessor docstring Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add ASRInference docstring Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add base_processor docstrings Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add minimal SDP docs page Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Update tools/speech_dataset_processor/README.md Co-authored-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> * Write simple README for SDP and move complex explanations to docs Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove incorrect type hints Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Make config example less confusing Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Fix typo Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Clarify that YAML file is config file in README Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove unused imports Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove SDP docs for now Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove links to docs in SDP README Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Co-authored-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Co-authored-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Co-authored-by: Igor Gitman <igitman@nvidia.com> * Rename Speech Dataset Processor to Speech Data Processor (#5378) (#5381) Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Co-authored-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> * fix for num worker 0 causing issues in losses after 1 epoch (#5379) (#5384) Co-authored-by: Adi Renduchintala <108822655+arendu@users.noreply.github.com> * [TTS] Add Spanish model documentation (#5390) Signed-off-by: Ryan <rlangman@nvidia.com> * [TTS] Add Spanish FastPitch training configs (#5383) * [TTS] Add Spanish FastPitch training configs * [TTS] Add single speaker Spanish configs Signed-off-by: Ryan <rlangman@nvidia.com> * Fix Python type hints according to Python Docs (#5370) * Remove duplicated type annotations Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> * Fix tuple annotations in function return types Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> * Add necessary imports Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> * Add necessary imports Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> * Fix types in obvious places Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> * Fix types in obvious places Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> * Fix unused import (avoid quotes in type annotations) Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> * Revert "Fix unused import (avoid quotes in type annotations)" This reverts commit ea433efcd9916abf8944879e791484a0a1437f83. Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> * Remove problematic import Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> * Fix list_available_models method type Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> * Revert some changes Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> * Revert quotes in list_available_models Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> * Force MHA QKV onto fp32 (#5391) (#5395) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> * Add cpWER for evaluation of ASR with diarization (#5279) * Add cpWER calculation feature Signed-off-by: Taejin Park <tango4j@gmail.com> * added notebook Signed-off-by: Taejin Park <tango4j@gmail.com> * updated notebook and diarization_utils Signed-off-by: Taejin Park <tango4j@gmail.com> * Minor update on tutorial notebook Signed-off-by: Taejin Park <tango4j@gmail.com> * Style fix Signed-off-by: Taejin Park <tango4j@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Update on missing docstrings Signed-off-by: Taejin Park <tango4j@gmail.com> * Fixed an unfinished docstring Signed-off-by: Taejin Park <tango4j@gmail.com> * Removed unused variables Signed-off-by: Taejin Park <tango4j@gmail.com> * Fixed dict input to list input Signed-off-by: Taejin Park <tango4j@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Style fix Signed-off-by: Taejin Park <tango4j@gmail.com> * fixed LGTM issues Signed-off-by: Taejin Park <tango4j@gmail.com> * Fixed error in cpWER cal Signed-off-by: Taejin Park <tango4j@gmail.com> * fixed docstrings Signed-off-by: Taejin Park <tango4j@gmail.com> * fixed docstrings Signed-off-by: Taejin Park <tango4j@gmail.com> * Fix some of the typing issues, lower case names Signed-off-by: SeanNaren <snarenthiran@nvidia.com> * Replaced bruteforce with LSA alg for cpWER Signed-off-by: Taejin Park <tango4j@gmail.com> * Reflected PR comments Signed-off-by: Taejin Park <tango4j@gmail.com> * Cleaned notebook Signed-off-by: Taejin Park <tango4j@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated notebook Signed-off-by: Taejin Park <tango4j@gmail.com> * Fixed LGTM warnings Signed-off-by: Taejin Park <tango4j@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * added test_diar_metrics.py Signed-off-by: Taejin Park <tango4j@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fixed typos Signed-off-by: Taejin Park <tango4j@gmail.com> * Fixed wrong type annotations Signed-off-by: Taejin Park <tango4j@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Added bruteforce mode and its unit-test Signed-off-by: Taejin Park <tango4j@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * LGTM issues fixed Signed-off-by: Taejin Park <tango4j@gmail.com> * reolve LGTM issues Signed-off-by: Taejin Park <tango4j@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * unified speaker key in trans_dict Signed-off-by: Taejin Park <tango4j@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Removed unused variable and imports Signed-off-by: Taejin Park <tango4j@gmail.com> * Update nemo/collections/asr/parts/utils/diarization_utils.py Co-authored-by: Sean Naren <snarenthiran@nvidia.com> Signed-off-by: Taejin Park <tango4j@gmail.com> * Update nemo/collections/asr/parts/utils/diarization_utils.py Co-authored-by: Sean Naren <snarenthiran@nvidia.com> Signed-off-by: Taejin Park <tango4j@gmail.com> * moved all the diarization eval to der.py Signed-off-by: Taejin Park <tango4j@gmail.com> * Update tests/collections/asr/test_diar_metrics.py Co-authored-by: Sean Naren <snarenthiran@nvidia.com> Signed-off-by: Taejin Park <tango4j@gmail.com> * der.py update on tests Signed-off-by: Taejin Park <tango4j@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * unused imports and style fix Signed-off-by: Taejin Park <tango4j@gmail.com> * style fix Signed-off-by: Taejin Park <tango4j@gmail.com> * unused import Signed-off-by: Taejin Park <tango4j@gmail.com> * reflected review comments Signed-off-by: Taejin Park <tango4j@gmail.com> * Fixed an import bug in tutorial notebook Signed-off-by: Taejin Park <tango4j@gmail.com> Signed-off-by: Taejin Park <tango4j@gmail.com> Signed-off-by: SeanNaren <snarenthiran@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: SeanNaren <snarenthiran@nvidia.com> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> * Added cast Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Ante Jukić <ajukic@nvidia.com> Signed-off-by: Taejin Park <tango4j@gmail.com> Signed-off-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com> Signed-off-by: stevehuang52 <heh@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: arendu <adithya.r@gmail.com> Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> Signed-off-by: SeanNaren <snarenthiran@nvidia.com> Signed-off-by: subhankar-ghosh <subhankar2321@gmail.com> Signed-off-by: Oleksii Volkovskyi <volkovskyi@berkeley.edu> Signed-off-by: Dima Rekesh <drekesh@nvidia.com> Signed-off-by: Yuekai Zhang <zhangyuekai@foxmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com> Signed-off-by: Jason <jasoli@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Micha Livne <mlivne@nvidia.com> Signed-off-by: Ryan <rlangman@nvidia.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: fayejf <fayejf07@gmail.com> Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Signed-off-by: whrichd <trabeitwrq@gmail.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Signed-off-by: David <amosalla@asu.edu> Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Co-authored-by: Vahid Noroozi <VahidooX@users.noreply.github.com> Co-authored-by: anteju <108555623+anteju@users.noreply.github.com> Co-authored-by: Taejin Park <tango4j@gmail.com> Co-authored-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: Jocelyn <jocelynh@nvidia.com> Co-authored-by: Adi Renduchintala <108822655+arendu@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> Co-authored-by: Shantanu Acharya <shantanua@nvidia.com> Co-authored-by: Sean Naren <snarenthiran@nvidia.com> Co-authored-by: Subhankar Ghosh <subhankar2321@gmail.com> Co-authored-by: Oleksii Volkovskyi <volkovskyi@berkeley.edu> Co-authored-by: Dima Rekesh <bmwshop@gmail.com> Co-authored-by: Dima Rekesh <drekesh@nvidia.com> Co-authored-by: Yuekai Zhang <zhangyuekai@foxmail.com> Co-authored-by: Matvei Novikov <mattyson.so@gmail.com> Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com> Co-authored-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com> Co-authored-by: Jason <jasoli@nvidia.com> Co-authored-by: Igor Gitman <igitman@nvidia.com> Co-authored-by: Micha Livne <michalivne@users.noreply.github.com> Co-authored-by: Micha Livne <mlivne@nvidia.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Ryan Langman <rlangman@nvidia.com> Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com> Co-authored-by: anmolgupt <14880251+anmolgupt@users.noreply.github.com> Co-authored-by: Anmol Gupta <anmolg@nvidia.com> Co-authored-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Co-authored-by: Eric Harper <complex451@gmail.com> Co-authored-by: Riqiang Wang <43883260+whrichd@users.noreply.github.com> Co-authored-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Co-authored-by: Vladimir Bataev <vbataev@nvidia.com> Co-authored-by: David <amosalla@asu.edu>
Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Co-authored-by: Jocelyn <jocelynh@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com>
Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Co-authored-by: Jocelyn <jocelynh@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com>
Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Co-authored-by: Jocelyn <jocelynh@nvidia.com>
* Disable loss typecheck * Fix spectrogram lengths * Remove Precision 16 requirement * Address lgtm alerts * clean up unused code * Address lgtm alerts * Refactor audio_to_mel_torch method * Use NeMo FilterBank to get melspec Todo: set self.fb * Fix filterbank max frequency to match with original VITS * Fix filterbank features correct length * Address lgtm issues * Remove print statements * Remove stft_pad_amount * new structure for tts datasets in script folder Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * remove cmudict downloading Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * rename mixertts dataset, add vocoder dataset Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add libritts processing Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * update tts dataset and libritts get data Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix bugs in vocoder ds Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add ds * changed vits yaml * rm yaml * fix yaml and model * Added scaler * refactored yaml * managed to run in fp16 * refactoring Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix small bugs and add new todos Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix optimizers Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * Port Variational Inference with Adversarial Learning (VITS) to NeMo TTS (#6) * Add vits files Add vits_losses.py, vits_modules.py and vits.py. * Move non-vits models to modules * Add vits.yaml * Add _loader to vits.py * Add basic template for vits * Update vits.yaml with vits parameters * Remove extra space * Add top level training script * Add some variables to vits yaml * Add forward and training methods * Fix imports * Added validation step * Log training losses * Update loss calls to use class attributes * Add VITS to models list * Fix all imports * Remove old module calls * Fix typo in monotonic align import * Modified validation step 1. reverted to tensorboard 2. validation_step logs audio, mel-spec for batch 0 3. validation_step_alt logs audio, mel-spec for batch 0 and loss_mel * Fix imports for VITS * Remove old module calls * Fix typo in monotonic align import * Modified validation step 1. reverted to tensorboard 2. validation_step logs audio, mel-spec for batch 0 3. validation_step_alt logs audio, mel-spec for batch 0 and loss_mel * Add parameters from original VITS config * Fix config file * Fix imports and generate spec from audio * Fix incorrect dimensions * Progress update * Fix loss * Fix cuda thing * Fix monotonic align import * Fix typos in vits.py * Disable loss typecheck * Fix spectrogram lengths * Remove Precision 16 requirement * Address lgtm alerts * clean up unused code * Address lgtm alerts * Refactor audio_to_mel_torch method * Use NeMo FilterBank to get melspec Todo: set self.fb * Fix filterbank max frequency to match with original VITS * Fix filterbank features correct length * Address lgtm issues * Remove print statements * Remove stft_pad_amount Co-authored-by: martynwei <martyn.wei@gmail.com> Co-authored-by: Ryan Hong <66425733+rhong99@users.noreply.github.com> Co-authored-by: richa.ren@mail.utoronto.ca <richa.ren@mail.utoronto.ca> Co-authored-by: Jason <jasoli@nvidia.com> Signed-off-by: Jason <jasoli@nvidia.com> * make new commit Signed-off-by: Jason <jasoli@nvidia.com> * add copyright headers Signed-off-by: Jason <jasoli@nvidia.com> * style Signed-off-by: Jason <jasoli@nvidia.com> * rename README Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix style without vits_modules Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add numba code, fix style and add todos Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * small fix * fix some todos * added numba mas * added DDP sampler * specified versions * fixed for new librosa version * added feature loss * added IPA phonemizer * refactored IPA g2p * added vits losses * some ref * fix * added checkpointing * cp * cfg * merged some 1.8.0 fixes * plt fix * fix logging * fix checkpoint loading * refactored inference * fp32 run * update branch Signed-off-by: ericharper <complex451@gmail.com> * update package info Signed-off-by: ericharper <complex451@gmail.com> * new exp * update branch Signed-off-by: ericharper <complex451@gmail.com> * Restored tests previously disabled for 22.03 base (#4109) Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * add augmentation to label models (#4113) * add augmentation to label models Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * duration fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Call register_bert_model after assigning self.bert_model variable (#4116) Signed-off-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> Co-authored-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> * Tutorial on ITN with Thutmose tagger and small fixes (#4117) * 1. Add tutorial. 2. Move a function to fix import in tutorial. 3. Merge multiple spaces into one space in the final output Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fixes for code review Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * Add tutorial to tutorials.rst Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * cleaned up TN/ ITN doc (#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Check implicit grad acc in GLUE dataset building (#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * update the default (#4135) Signed-off-by: ekmb <ebakhturina@nvidia.com> * Draft: Fix restoring from checkpoint for case when `model.common_dataset_parameters.label_vocab_dir` is provided (#4136) * Fix restoring from checkpoint with label vocab dir Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Add tests for various ways to pass label ids to model Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Do not create tmp directory Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix parameter name Signed-off-by: PeganovAnton <peganoff2@mail.ru> * finish cherry-pick op Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix labels errors Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove duplicate stage Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Change target branch Signed-off-by: PeganovAnton <peganoff2@mail.ru> * fix typo (#4140) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Fix/punctuation avoid overwritting tmp files (#4144) * Add draft of fixing tmp files overwritting Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove accidental changes Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove accidental changes Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Use built-in tempfile library Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix code style Signed-off-by: PeganovAnton <peganoff2@mail.ru> * bug_fix_diarization_manifest_creation (#4125) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> * fix doc (#4146) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Tacotron2 retrain (#4103) * fix yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Fix for new TTSDataset class Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * inference fix Signed-off-by: treacker <emshabalin@yandex.ru> * removed old code Signed-off-by: treacker <emshabalin@yandex.ru> * updated parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * reverted version update Signed-off-by: treacker <emshabalin@yandex.ru> * refactored parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * Updated Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Refactored tutorial for Tacotron2 Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Update Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Update tacotron.yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * cleaned up TN/ ITN doc (#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: treacker <emshabalin@yandex.ru> * Check implicit grad acc in GLUE dataset building (#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Fixed jenkins Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> * Multiprocess improvements (#4127) * initial commit Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * start fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * improve multiprocessing speed while creating speaker dataset Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * updated scp to filelist Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * WaveGlow input type fixes (#4151) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * notebooks' link, typo and import fix (#4158) * redo missing pr 4007 Signed-off-by: fayejf <fayejf07@gmail.com> * remove extremely unreliable links Signed-off-by: fayejf <fayejf07@gmail.com> * Thutmose tagger bug fixes (#4162) * add pretrained ngc model, small fixes Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix model location Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix model location Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * 1. fix typos. 2. write magic functions without space Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * add example of inference with pretrained model Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * changed model location to nemo Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * style fix Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix space Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * update speaker docs (#4164) * update speaker docs Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * chunks -> segments Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Khz -> kHz Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * changed to vits g2p * refactoring * added cosineLR * Updated whitelist path * added vanilla torch grad scaler * Fixed lightning version * added warmup and wd * switched to cosineLR * refactored data classes for vits * some fixes * fixed import * changeg train loop * fixed scheduler bug * refactoring for exps * Refactored loss logic * Ref for exps * added coqui stuff * exps * bugfix * added side file * bugfix * reverted * fixed sampler behaviour * updated for ptl 1.7.2 * refactored dataloader func * some cleaning * reverted to vanilla loss * modified for pickling * added dataset class * fixed torch version * added autocast for fp training * removed coqui files * 'Fixed tokenizer' * Fix tokenizer * update branch Signed-off-by: ericharper <complex451@gmail.com> * Fix link to inference notebook (#5247) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Update ASR scores table (#5254) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * Fix links to speaker identification notebook (#5260) Signed-off-by: SeanNaren <snarenthiran@nvidia.com> Signed-off-by: SeanNaren <snarenthiran@nvidia.com> * Minor typo fixes in TTS tutorial (#5266) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Pcla tutorial fixes (#5271) * Fixed typos Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed cell type and tatoeba reference Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed typo Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed branch variable Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fix bug into Dialogue tutorial (#5277) * Typo fix (#5288) Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fix dialogue tutorial bug (#5297) * set add_pooling_layer=False for huggingface bert model * remove add_pooling_layer=False and set find_unused_parameters=True * set num_prompt_tokens to 0 for huggingface * small bugfix for r1.13.0 (#5310) * typo fix Signed-off-by: fayejf <fayejf07@gmail.com> * udpate transcribe Signed-off-by: fayejf <fayejf07@gmail.com> Signed-off-by: fayejf <fayejf07@gmail.com> * Add italian model checkpoints (#5316) Signed-off-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Igor Gitman <igitman@nvidia.com> * [STT] Add Ru ASR Conformer-CTC and Conformer-Transducer (#5340) * [STT] Add stt_ru_conformer_ctc_large Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> * [STT] Add stt_ru_conformer_transducer_large Add stt_ru_conformer_transducer_large Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Pcla tutorial fixes (#5313) * fixes Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * fixes Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * moved `create_text_and_labels` to token_classification_utils.py Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * a lot of refactoring * strict ptl version * strict ptl version * reverted plt version * Added base text2audio class * Fix issue with HF Model upload tutorial (#5359) * Add Gradio App to ASR Docs (#5270) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> (cherry picked from commit e4b6a38) * Fix issue with normalized config for dataset name Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * tutorial fixes (#5354) Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Add SDP documentation (#5274) * Add details to SDP README.md Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add docstring to WriteManifest processor Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add docstring to CreateInitialManifestMLS Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add ModifyManifestTextProcessor docstring Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add ASRInference docstring Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add base_processor docstrings Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add minimal SDP docs page Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Update tools/speech_dataset_processor/README.md Co-authored-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> * Write simple README for SDP and move complex explanations to docs Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove incorrect type hints Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Make config example less confusing Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Fix typo Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Clarify that YAML file is config file in README Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove unused imports Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove SDP docs for now Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove links to docs in SDP README Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Co-authored-by: Igor Gitman <igitman@nvidia.com> * [Bugfix] Added rm -f / wget- nc command in multispeaker sim notebook to r1.13.0 (#5375) * Fix minor error in notebook Signed-off-by: Taejin Park <tango4j@gmail.com> * changed branch name in tutorial notebook Signed-off-by: Taejin Park <tango4j@gmail.com> Signed-off-by: Taejin Park <tango4j@gmail.com> * Rename Speech Dataset Processor to Speech Data Processor (#5378) Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * fix for num worker 0 causing issues in losses after 1 epoch (#5379) * Fixed bug in notebook (#5382) Signed-off-by: Virginia Adams <vadams@nvidia.com> Signed-off-by: Virginia Adams <vadams@nvidia.com> * Force MHA QKV onto fp32 (#5391) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * Added scheduling variety * ref * Fix for prompt table restore error (#5393) * Fix for prompt table restore error Signed-off-by: Virginia Adams <vadams@nvidia.com> * Added more saftey checks Signed-off-by: Virginia Adams <vadams@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Added more condition checks Signed-off-by: Virginia Adams <vadams@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Virginia Adams <vadams@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Fix args (#5410) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * bugfix * import tests * Add temporary fix for CUDA issue in Dockerfile (#5421) Signed-off-by: Yu Yao <yuya@nvidia.com> Signed-off-by: Yu Yao <yuya@nvidia.com> * Megatron Export Update (#5343) * export update for Megatron + change ORT optimization Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated export_utils to use autocast instead of manually casting >:/ Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * removed dtype from LayerNorm Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * added comment Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * reverting changes on FloatCast Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * Cherry-picked changes from megatron-norm Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * updated asr_model import to cast_utils Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * updated del onnx_model place Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * changed ort optimization to basic -> temp fix Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com> * disable pc test (#5426) Signed-off-by: ekmb <ebakhturina@nvidia.com> Signed-off-by: ekmb <ebakhturina@nvidia.com> * Fix GPT generation when using sentencepiece tokenizer (#5413) * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Yi Dong <yidong@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Disable sync_batch_comm in validation_step for GPT (#5397) * disable sync_batch_comm in validation_step Signed-off-by: ericharper <complex451@gmail.com> * Read sync_batch_comm from config or default to False Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> * Update megatron_gpt_config to default sync_batch_comm to False to avoid CUDA error Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> * Empty Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Comment out test Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> * Revert "Add temporary fix for CUDA issue in Dockerfile (#5421)" (#5431) This reverts commit 0718b17. * Revert workaround for T5 that sets number of workers to 0 & sync_batch_comm=False (#5420) * Revert workers workaround Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix in config Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Fixed discrepancies * updated Jenkisfile * updated Jenkisfile * Cleaning * fixed the onnx bug in conformer for non-streaming models. (#5242) (#5446) Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Co-authored-by: Vahid Noroozi <VahidooX@users.noreply.github.com> * Set sync_batch_comm in other places (#5448) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Radtts 1.13 (#5451) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (#5358) * [TTS] add CI test for RADTTS training recipe. Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Radtts 1.13 plus (#5457) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (#5358) * Fixing RADTTS training - removing view buffer and fixing accuracy issue * Fixes for Torchscript/Triton * Added autocast to radtts UT * using cuda() for training example Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Add num layers check (#5470) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Change to kwargs (#5475) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Support for finetuning and finetuning inference with .ckpt files & batch size refactoring (#5339) (#5478) * Initial refactor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Resolve config before passing to load_from_checkpoint Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fixes for model parallel and nemo restore Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fixes for eval Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Revert config changes Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Refactor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix typo Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove comments Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Minor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix validation reconfiguration Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove old comment Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes for test_ds Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * export_utils bugfix (#5480) * updated export_utils Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Export fixes for Riva (#5496) * Export fixes for Riva Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * Cleaning up training_utils Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * minor bug fix (#5521) Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> * added set_start_method + function param bugfix (#5539) * added set_start_method + function param bugfix Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * upper bound torchmetrics Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: ericharper <complex451@gmail.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: ericharper <complex451@gmail.com> * remove notebook (#5548) Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: ericharper <complex451@gmail.com> * Remove broadcast (#5558) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * cleaning * Fix all gather while writing to a file during T5 finetuning (#5561) * Gather from data parallel only instead of all ranks Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * update readme Signed-off-by: ericharper <complex451@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * added copyright * fixed imports * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * cleaning * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fixed filesize check * last cleaning Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated cmudict path * fixed merge bug Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * warnings fix * fix warnings Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * storing * updated version Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * update Jenkinsfile versions Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * fixed issues Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * fixed more issues * more fixes Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * added experimental tag * Clarification updates Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * remove old cython code Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * remove old cython code Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * docstring fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Enhancements Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Enhancements Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * imports fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix typo Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * excessive comtutations fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * typecheck fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Small refactoring * Small refactoring Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * reversed exp_manager params Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Fixed call for new function signature Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> Signed-off-by: Jason <jasoli@nvidia.com> Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: ekmb <ebakhturina@nvidia.com> Signed-off-by: PeganovAnton <peganoff2@mail.ru> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: fayejf <fayejf07@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: SeanNaren <snarenthiran@nvidia.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Signed-off-by: Taejin Park <tango4j@gmail.com> Signed-off-by: Virginia Adams <vadams@nvidia.com> Signed-off-by: Yu Yao <yuya@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> Co-authored-by: jasonjjl1999 <jasonjjl1999@gmail.com> Co-authored-by: richa.ren@mail.utoronto.ca <richa.ren@mail.utoronto.ca> Co-authored-by: Oktai Tatanov <oktai.tatanov@gmail.com> Co-authored-by: jasonjjl1999 <43978361+jasonjjl1999@users.noreply.github.com> Co-authored-by: martynwei <martyn.wei@gmail.com> Co-authored-by: Ryan Hong <66425733+rhong99@users.noreply.github.com> Co-authored-by: Jason <jasoli@nvidia.com> Co-authored-by: ericharper <complex451@gmail.com> Co-authored-by: Boris Fomitchev <borisfom@users.noreply.github.com> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> Co-authored-by: Ramanathan Arunachalam <ramanathan.arun@rutgers.edu> Co-authored-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> Co-authored-by: bene-ges <61418381+bene-ges@users.noreply.github.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Evelina <10428420+ekmb@users.noreply.github.com> Co-authored-by: PeganovAnton <peganoff2@mail.ru> Co-authored-by: Jocelyn <jocelynh@nvidia.com> Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> Co-authored-by: Sean Naren <snarenthiran@nvidia.com> Co-authored-by: Matvei Novikov <mattyson.so@gmail.com> Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com> Co-authored-by: Igor Gitman <igitman@nvidia.com> Co-authored-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Co-authored-by: Taejin Park <tango4j@gmail.com> Co-authored-by: Adi Renduchintala <108822655+arendu@users.noreply.github.com> Co-authored-by: Virginia Adams <78445382+vadam5@users.noreply.github.com> Co-authored-by: yaoyu-33 <54727607+yaoyu-33@users.noreply.github.com> Co-authored-by: David <amosalla@asu.edu> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com> Co-authored-by: Yi Dong <yidong@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Co-authored-by: Vladimir Bataev <vbataev@nvidia.com> Co-authored-by: Vahid Noroozi <VahidooX@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
* Disable loss typecheck * Fix spectrogram lengths * Remove Precision 16 requirement * Address lgtm alerts * clean up unused code * Address lgtm alerts * Refactor audio_to_mel_torch method * Use NeMo FilterBank to get melspec Todo: set self.fb * Fix filterbank max frequency to match with original VITS * Fix filterbank features correct length * Address lgtm issues * Remove print statements * Remove stft_pad_amount * new structure for tts datasets in script folder Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * remove cmudict downloading Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * rename mixertts dataset, add vocoder dataset Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add libritts processing Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * update tts dataset and libritts get data Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix bugs in vocoder ds Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add ds * changed vits yaml * rm yaml * fix yaml and model * Added scaler * refactored yaml * managed to run in fp16 * refactoring Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix small bugs and add new todos Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix optimizers Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * Port Variational Inference with Adversarial Learning (VITS) to NeMo TTS (NVIDIA#6) * Add vits files Add vits_losses.py, vits_modules.py and vits.py. * Move non-vits models to modules * Add vits.yaml * Add _loader to vits.py * Add basic template for vits * Update vits.yaml with vits parameters * Remove extra space * Add top level training script * Add some variables to vits yaml * Add forward and training methods * Fix imports * Added validation step * Log training losses * Update loss calls to use class attributes * Add VITS to models list * Fix all imports * Remove old module calls * Fix typo in monotonic align import * Modified validation step 1. reverted to tensorboard 2. validation_step logs audio, mel-spec for batch 0 3. validation_step_alt logs audio, mel-spec for batch 0 and loss_mel * Fix imports for VITS * Remove old module calls * Fix typo in monotonic align import * Modified validation step 1. reverted to tensorboard 2. validation_step logs audio, mel-spec for batch 0 3. validation_step_alt logs audio, mel-spec for batch 0 and loss_mel * Add parameters from original VITS config * Fix config file * Fix imports and generate spec from audio * Fix incorrect dimensions * Progress update * Fix loss * Fix cuda thing * Fix monotonic align import * Fix typos in vits.py * Disable loss typecheck * Fix spectrogram lengths * Remove Precision 16 requirement * Address lgtm alerts * clean up unused code * Address lgtm alerts * Refactor audio_to_mel_torch method * Use NeMo FilterBank to get melspec Todo: set self.fb * Fix filterbank max frequency to match with original VITS * Fix filterbank features correct length * Address lgtm issues * Remove print statements * Remove stft_pad_amount Co-authored-by: martynwei <martyn.wei@gmail.com> Co-authored-by: Ryan Hong <66425733+rhong99@users.noreply.github.com> Co-authored-by: richa.ren@mail.utoronto.ca <richa.ren@mail.utoronto.ca> Co-authored-by: Jason <jasoli@nvidia.com> Signed-off-by: Jason <jasoli@nvidia.com> * make new commit Signed-off-by: Jason <jasoli@nvidia.com> * add copyright headers Signed-off-by: Jason <jasoli@nvidia.com> * style Signed-off-by: Jason <jasoli@nvidia.com> * rename README Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix style without vits_modules Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add numba code, fix style and add todos Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * small fix * fix some todos * added numba mas * added DDP sampler * specified versions * fixed for new librosa version * added feature loss * added IPA phonemizer * refactored IPA g2p * added vits losses * some ref * fix * added checkpointing * cp * cfg * merged some 1.8.0 fixes * plt fix * fix logging * fix checkpoint loading * refactored inference * fp32 run * update branch Signed-off-by: ericharper <complex451@gmail.com> * update package info Signed-off-by: ericharper <complex451@gmail.com> * new exp * update branch Signed-off-by: ericharper <complex451@gmail.com> * Restored tests previously disabled for 22.03 base (NVIDIA#4109) Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * add augmentation to label models (NVIDIA#4113) * add augmentation to label models Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * duration fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Call register_bert_model after assigning self.bert_model variable (NVIDIA#4116) Signed-off-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> Co-authored-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> * Tutorial on ITN with Thutmose tagger and small fixes (NVIDIA#4117) * 1. Add tutorial. 2. Move a function to fix import in tutorial. 3. Merge multiple spaces into one space in the final output Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fixes for code review Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * Add tutorial to tutorials.rst Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * cleaned up TN/ ITN doc (NVIDIA#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Check implicit grad acc in GLUE dataset building (NVIDIA#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * update the default (NVIDIA#4135) Signed-off-by: ekmb <ebakhturina@nvidia.com> * Draft: Fix restoring from checkpoint for case when `model.common_dataset_parameters.label_vocab_dir` is provided (NVIDIA#4136) * Fix restoring from checkpoint with label vocab dir Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Add tests for various ways to pass label ids to model Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Do not create tmp directory Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix parameter name Signed-off-by: PeganovAnton <peganoff2@mail.ru> * finish cherry-pick op Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix labels errors Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove duplicate stage Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Change target branch Signed-off-by: PeganovAnton <peganoff2@mail.ru> * fix typo (NVIDIA#4140) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Fix/punctuation avoid overwritting tmp files (NVIDIA#4144) * Add draft of fixing tmp files overwritting Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove accidental changes Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove accidental changes Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Use built-in tempfile library Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix code style Signed-off-by: PeganovAnton <peganoff2@mail.ru> * bug_fix_diarization_manifest_creation (NVIDIA#4125) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> * fix doc (NVIDIA#4146) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Tacotron2 retrain (NVIDIA#4103) * fix yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Fix for new TTSDataset class Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * inference fix Signed-off-by: treacker <emshabalin@yandex.ru> * removed old code Signed-off-by: treacker <emshabalin@yandex.ru> * updated parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * reverted version update Signed-off-by: treacker <emshabalin@yandex.ru> * refactored parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * Updated Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Refactored tutorial for Tacotron2 Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Update Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Update tacotron.yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * cleaned up TN/ ITN doc (NVIDIA#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: treacker <emshabalin@yandex.ru> * Check implicit grad acc in GLUE dataset building (NVIDIA#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Fixed jenkins Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> * Multiprocess improvements (NVIDIA#4127) * initial commit Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * start fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * improve multiprocessing speed while creating speaker dataset Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * updated scp to filelist Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * WaveGlow input type fixes (NVIDIA#4151) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * notebooks' link, typo and import fix (NVIDIA#4158) * redo missing pr 4007 Signed-off-by: fayejf <fayejf07@gmail.com> * remove extremely unreliable links Signed-off-by: fayejf <fayejf07@gmail.com> * Thutmose tagger bug fixes (NVIDIA#4162) * add pretrained ngc model, small fixes Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix model location Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix model location Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * 1. fix typos. 2. write magic functions without space Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * add example of inference with pretrained model Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * changed model location to nemo Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * style fix Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix space Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * update speaker docs (NVIDIA#4164) * update speaker docs Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * chunks -> segments Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Khz -> kHz Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * changed to vits g2p * refactoring * added cosineLR * Updated whitelist path * added vanilla torch grad scaler * Fixed lightning version * added warmup and wd * switched to cosineLR * refactored data classes for vits * some fixes * fixed import * changeg train loop * fixed scheduler bug * refactoring for exps * Refactored loss logic * Ref for exps * added coqui stuff * exps * bugfix * added side file * bugfix * reverted * fixed sampler behaviour * updated for ptl 1.7.2 * refactored dataloader func * some cleaning * reverted to vanilla loss * modified for pickling * added dataset class * fixed torch version * added autocast for fp training * removed coqui files * 'Fixed tokenizer' * Fix tokenizer * update branch Signed-off-by: ericharper <complex451@gmail.com> * Fix link to inference notebook (NVIDIA#5247) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Update ASR scores table (NVIDIA#5254) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * Fix links to speaker identification notebook (NVIDIA#5260) Signed-off-by: SeanNaren <snarenthiran@nvidia.com> Signed-off-by: SeanNaren <snarenthiran@nvidia.com> * Minor typo fixes in TTS tutorial (NVIDIA#5266) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Pcla tutorial fixes (NVIDIA#5271) * Fixed typos Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed cell type and tatoeba reference Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed typo Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed branch variable Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fix bug into Dialogue tutorial (NVIDIA#5277) * Typo fix (NVIDIA#5288) Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fix dialogue tutorial bug (NVIDIA#5297) * set add_pooling_layer=False for huggingface bert model * remove add_pooling_layer=False and set find_unused_parameters=True * set num_prompt_tokens to 0 for huggingface * small bugfix for r1.13.0 (NVIDIA#5310) * typo fix Signed-off-by: fayejf <fayejf07@gmail.com> * udpate transcribe Signed-off-by: fayejf <fayejf07@gmail.com> Signed-off-by: fayejf <fayejf07@gmail.com> * Add italian model checkpoints (NVIDIA#5316) Signed-off-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Igor Gitman <igitman@nvidia.com> * [STT] Add Ru ASR Conformer-CTC and Conformer-Transducer (NVIDIA#5340) * [STT] Add stt_ru_conformer_ctc_large Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> * [STT] Add stt_ru_conformer_transducer_large Add stt_ru_conformer_transducer_large Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Pcla tutorial fixes (NVIDIA#5313) * fixes Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * fixes Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * moved `create_text_and_labels` to token_classification_utils.py Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * a lot of refactoring * strict ptl version * strict ptl version * reverted plt version * Added base text2audio class * Fix issue with HF Model upload tutorial (NVIDIA#5359) * Add Gradio App to ASR Docs (NVIDIA#5270) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> (cherry picked from commit e4b6a38) * Fix issue with normalized config for dataset name Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * tutorial fixes (NVIDIA#5354) Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Add SDP documentation (NVIDIA#5274) * Add details to SDP README.md Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add docstring to WriteManifest processor Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add docstring to CreateInitialManifestMLS Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add ModifyManifestTextProcessor docstring Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add ASRInference docstring Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add base_processor docstrings Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add minimal SDP docs page Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Update tools/speech_dataset_processor/README.md Co-authored-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> * Write simple README for SDP and move complex explanations to docs Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove incorrect type hints Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Make config example less confusing Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Fix typo Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Clarify that YAML file is config file in README Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove unused imports Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove SDP docs for now Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove links to docs in SDP README Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Co-authored-by: Igor Gitman <igitman@nvidia.com> * [Bugfix] Added rm -f / wget- nc command in multispeaker sim notebook to r1.13.0 (NVIDIA#5375) * Fix minor error in notebook Signed-off-by: Taejin Park <tango4j@gmail.com> * changed branch name in tutorial notebook Signed-off-by: Taejin Park <tango4j@gmail.com> Signed-off-by: Taejin Park <tango4j@gmail.com> * Rename Speech Dataset Processor to Speech Data Processor (NVIDIA#5378) Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * fix for num worker 0 causing issues in losses after 1 epoch (NVIDIA#5379) * Fixed bug in notebook (NVIDIA#5382) Signed-off-by: Virginia Adams <vadams@nvidia.com> Signed-off-by: Virginia Adams <vadams@nvidia.com> * Force MHA QKV onto fp32 (NVIDIA#5391) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * Added scheduling variety * ref * Fix for prompt table restore error (NVIDIA#5393) * Fix for prompt table restore error Signed-off-by: Virginia Adams <vadams@nvidia.com> * Added more saftey checks Signed-off-by: Virginia Adams <vadams@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Added more condition checks Signed-off-by: Virginia Adams <vadams@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Virginia Adams <vadams@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Fix args (NVIDIA#5410) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * bugfix * import tests * Add temporary fix for CUDA issue in Dockerfile (NVIDIA#5421) Signed-off-by: Yu Yao <yuya@nvidia.com> Signed-off-by: Yu Yao <yuya@nvidia.com> * Megatron Export Update (NVIDIA#5343) * export update for Megatron + change ORT optimization Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated export_utils to use autocast instead of manually casting >:/ Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * removed dtype from LayerNorm Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * added comment Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * reverting changes on FloatCast Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * Cherry-picked changes from megatron-norm Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * updated asr_model import to cast_utils Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * updated del onnx_model place Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * changed ort optimization to basic -> temp fix Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com> * disable pc test (NVIDIA#5426) Signed-off-by: ekmb <ebakhturina@nvidia.com> Signed-off-by: ekmb <ebakhturina@nvidia.com> * Fix GPT generation when using sentencepiece tokenizer (NVIDIA#5413) * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Yi Dong <yidong@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Disable sync_batch_comm in validation_step for GPT (NVIDIA#5397) * disable sync_batch_comm in validation_step Signed-off-by: ericharper <complex451@gmail.com> * Read sync_batch_comm from config or default to False Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> * Update megatron_gpt_config to default sync_batch_comm to False to avoid CUDA error Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> * Empty Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Comment out test Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> * Revert "Add temporary fix for CUDA issue in Dockerfile (NVIDIA#5421)" (NVIDIA#5431) This reverts commit 0718b17. * Revert workaround for T5 that sets number of workers to 0 & sync_batch_comm=False (NVIDIA#5420) * Revert workers workaround Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix in config Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Fixed discrepancies * updated Jenkisfile * updated Jenkisfile * Cleaning * fixed the onnx bug in conformer for non-streaming models. (NVIDIA#5242) (NVIDIA#5446) Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Co-authored-by: Vahid Noroozi <VahidooX@users.noreply.github.com> * Set sync_batch_comm in other places (NVIDIA#5448) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Radtts 1.13 (NVIDIA#5451) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (NVIDIA#5358) * [TTS] add CI test for RADTTS training recipe. Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Radtts 1.13 plus (NVIDIA#5457) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (NVIDIA#5358) * Fixing RADTTS training - removing view buffer and fixing accuracy issue * Fixes for Torchscript/Triton * Added autocast to radtts UT * using cuda() for training example Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Add num layers check (NVIDIA#5470) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Change to kwargs (NVIDIA#5475) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Support for finetuning and finetuning inference with .ckpt files & batch size refactoring (NVIDIA#5339) (NVIDIA#5478) * Initial refactor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Resolve config before passing to load_from_checkpoint Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fixes for model parallel and nemo restore Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fixes for eval Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Revert config changes Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Refactor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix typo Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove comments Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Minor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix validation reconfiguration Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove old comment Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes for test_ds Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * export_utils bugfix (NVIDIA#5480) * updated export_utils Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Export fixes for Riva (NVIDIA#5496) * Export fixes for Riva Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * Cleaning up training_utils Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * minor bug fix (NVIDIA#5521) Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> * added set_start_method + function param bugfix (NVIDIA#5539) * added set_start_method + function param bugfix Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * upper bound torchmetrics Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: ericharper <complex451@gmail.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: ericharper <complex451@gmail.com> * remove notebook (NVIDIA#5548) Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: ericharper <complex451@gmail.com> * Remove broadcast (NVIDIA#5558) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * cleaning * Fix all gather while writing to a file during T5 finetuning (NVIDIA#5561) * Gather from data parallel only instead of all ranks Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * update readme Signed-off-by: ericharper <complex451@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * added copyright * fixed imports * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * cleaning * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fixed filesize check * last cleaning Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated cmudict path * fixed merge bug Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * warnings fix * fix warnings Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * storing * updated version Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * update Jenkinsfile versions Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * fixed issues Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * fixed more issues * more fixes Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * added experimental tag * Clarification updates Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * remove old cython code Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * remove old cython code Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * docstring fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Enhancements Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Enhancements Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * imports fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix typo Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * excessive comtutations fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * typecheck fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Small refactoring * Small refactoring Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * reversed exp_manager params Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Fixed call for new function signature Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> Signed-off-by: Jason <jasoli@nvidia.com> Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: ekmb <ebakhturina@nvidia.com> Signed-off-by: PeganovAnton <peganoff2@mail.ru> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: fayejf <fayejf07@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: SeanNaren <snarenthiran@nvidia.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Signed-off-by: Taejin Park <tango4j@gmail.com> Signed-off-by: Virginia Adams <vadams@nvidia.com> Signed-off-by: Yu Yao <yuya@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> Co-authored-by: jasonjjl1999 <jasonjjl1999@gmail.com> Co-authored-by: richa.ren@mail.utoronto.ca <richa.ren@mail.utoronto.ca> Co-authored-by: Oktai Tatanov <oktai.tatanov@gmail.com> Co-authored-by: jasonjjl1999 <43978361+jasonjjl1999@users.noreply.github.com> Co-authored-by: martynwei <martyn.wei@gmail.com> Co-authored-by: Ryan Hong <66425733+rhong99@users.noreply.github.com> Co-authored-by: Jason <jasoli@nvidia.com> Co-authored-by: ericharper <complex451@gmail.com> Co-authored-by: Boris Fomitchev <borisfom@users.noreply.github.com> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> Co-authored-by: Ramanathan Arunachalam <ramanathan.arun@rutgers.edu> Co-authored-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> Co-authored-by: bene-ges <61418381+bene-ges@users.noreply.github.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Evelina <10428420+ekmb@users.noreply.github.com> Co-authored-by: PeganovAnton <peganoff2@mail.ru> Co-authored-by: Jocelyn <jocelynh@nvidia.com> Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> Co-authored-by: Sean Naren <snarenthiran@nvidia.com> Co-authored-by: Matvei Novikov <mattyson.so@gmail.com> Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com> Co-authored-by: Igor Gitman <igitman@nvidia.com> Co-authored-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Co-authored-by: Taejin Park <tango4j@gmail.com> Co-authored-by: Adi Renduchintala <108822655+arendu@users.noreply.github.com> Co-authored-by: Virginia Adams <78445382+vadam5@users.noreply.github.com> Co-authored-by: yaoyu-33 <54727607+yaoyu-33@users.noreply.github.com> Co-authored-by: David <amosalla@asu.edu> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com> Co-authored-by: Yi Dong <yidong@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Co-authored-by: Vladimir Bataev <vbataev@nvidia.com> Co-authored-by: Vahid Noroozi <VahidooX@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
* Disable loss typecheck * Fix spectrogram lengths * Remove Precision 16 requirement * Address lgtm alerts * clean up unused code * Address lgtm alerts * Refactor audio_to_mel_torch method * Use NeMo FilterBank to get melspec Todo: set self.fb * Fix filterbank max frequency to match with original VITS * Fix filterbank features correct length * Address lgtm issues * Remove print statements * Remove stft_pad_amount * new structure for tts datasets in script folder Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * remove cmudict downloading Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * rename mixertts dataset, add vocoder dataset Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add libritts processing Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * update tts dataset and libritts get data Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix bugs in vocoder ds Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add ds * changed vits yaml * rm yaml * fix yaml and model * Added scaler * refactored yaml * managed to run in fp16 * refactoring Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix small bugs and add new todos Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix optimizers Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * Port Variational Inference with Adversarial Learning (VITS) to NeMo TTS (#6) * Add vits files Add vits_losses.py, vits_modules.py and vits.py. * Move non-vits models to modules * Add vits.yaml * Add _loader to vits.py * Add basic template for vits * Update vits.yaml with vits parameters * Remove extra space * Add top level training script * Add some variables to vits yaml * Add forward and training methods * Fix imports * Added validation step * Log training losses * Update loss calls to use class attributes * Add VITS to models list * Fix all imports * Remove old module calls * Fix typo in monotonic align import * Modified validation step 1. reverted to tensorboard 2. validation_step logs audio, mel-spec for batch 0 3. validation_step_alt logs audio, mel-spec for batch 0 and loss_mel * Fix imports for VITS * Remove old module calls * Fix typo in monotonic align import * Modified validation step 1. reverted to tensorboard 2. validation_step logs audio, mel-spec for batch 0 3. validation_step_alt logs audio, mel-spec for batch 0 and loss_mel * Add parameters from original VITS config * Fix config file * Fix imports and generate spec from audio * Fix incorrect dimensions * Progress update * Fix loss * Fix cuda thing * Fix monotonic align import * Fix typos in vits.py * Disable loss typecheck * Fix spectrogram lengths * Remove Precision 16 requirement * Address lgtm alerts * clean up unused code * Address lgtm alerts * Refactor audio_to_mel_torch method * Use NeMo FilterBank to get melspec Todo: set self.fb * Fix filterbank max frequency to match with original VITS * Fix filterbank features correct length * Address lgtm issues * Remove print statements * Remove stft_pad_amount Co-authored-by: martynwei <martyn.wei@gmail.com> Co-authored-by: Ryan Hong <66425733+rhong99@users.noreply.github.com> Co-authored-by: richa.ren@mail.utoronto.ca <richa.ren@mail.utoronto.ca> Co-authored-by: Jason <jasoli@nvidia.com> Signed-off-by: Jason <jasoli@nvidia.com> * make new commit Signed-off-by: Jason <jasoli@nvidia.com> * add copyright headers Signed-off-by: Jason <jasoli@nvidia.com> * style Signed-off-by: Jason <jasoli@nvidia.com> * rename README Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix style without vits_modules Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add numba code, fix style and add todos Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * small fix * fix some todos * added numba mas * added DDP sampler * specified versions * fixed for new librosa version * added feature loss * added IPA phonemizer * refactored IPA g2p * added vits losses * some ref * fix * added checkpointing * cp * cfg * merged some 1.8.0 fixes * plt fix * fix logging * fix checkpoint loading * refactored inference * fp32 run * update branch Signed-off-by: ericharper <complex451@gmail.com> * update package info Signed-off-by: ericharper <complex451@gmail.com> * new exp * update branch Signed-off-by: ericharper <complex451@gmail.com> * Restored tests previously disabled for 22.03 base (#4109) Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * add augmentation to label models (#4113) * add augmentation to label models Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * duration fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Call register_bert_model after assigning self.bert_model variable (#4116) Signed-off-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> Co-authored-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> * Tutorial on ITN with Thutmose tagger and small fixes (#4117) * 1. Add tutorial. 2. Move a function to fix import in tutorial. 3. Merge multiple spaces into one space in the final output Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fixes for code review Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * Add tutorial to tutorials.rst Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * cleaned up TN/ ITN doc (#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Check implicit grad acc in GLUE dataset building (#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * update the default (#4135) Signed-off-by: ekmb <ebakhturina@nvidia.com> * Draft: Fix restoring from checkpoint for case when `model.common_dataset_parameters.label_vocab_dir` is provided (#4136) * Fix restoring from checkpoint with label vocab dir Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Add tests for various ways to pass label ids to model Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Do not create tmp directory Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix parameter name Signed-off-by: PeganovAnton <peganoff2@mail.ru> * finish cherry-pick op Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix labels errors Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove duplicate stage Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Change target branch Signed-off-by: PeganovAnton <peganoff2@mail.ru> * fix typo (#4140) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Fix/punctuation avoid overwritting tmp files (#4144) * Add draft of fixing tmp files overwritting Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove accidental changes Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove accidental changes Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Use built-in tempfile library Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix code style Signed-off-by: PeganovAnton <peganoff2@mail.ru> * bug_fix_diarization_manifest_creation (#4125) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> * fix doc (#4146) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Tacotron2 retrain (#4103) * fix yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Fix for new TTSDataset class Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * inference fix Signed-off-by: treacker <emshabalin@yandex.ru> * removed old code Signed-off-by: treacker <emshabalin@yandex.ru> * updated parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * reverted version update Signed-off-by: treacker <emshabalin@yandex.ru> * refactored parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * Updated Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Refactored tutorial for Tacotron2 Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Update Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Update tacotron.yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * cleaned up TN/ ITN doc (#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: treacker <emshabalin@yandex.ru> * Check implicit grad acc in GLUE dataset building (#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Fixed jenkins Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> * Multiprocess improvements (#4127) * initial commit Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * start fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * improve multiprocessing speed while creating speaker dataset Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * updated scp to filelist Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * WaveGlow input type fixes (#4151) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * notebooks' link, typo and import fix (#4158) * redo missing pr 4007 Signed-off-by: fayejf <fayejf07@gmail.com> * remove extremely unreliable links Signed-off-by: fayejf <fayejf07@gmail.com> * Thutmose tagger bug fixes (#4162) * add pretrained ngc model, small fixes Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix model location Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix model location Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * 1. fix typos. 2. write magic functions without space Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * add example of inference with pretrained model Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * changed model location to nemo Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * style fix Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix space Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * update speaker docs (#4164) * update speaker docs Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * chunks -> segments Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Khz -> kHz Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * changed to vits g2p * refactoring * added cosineLR * Updated whitelist path * added vanilla torch grad scaler * Fixed lightning version * added warmup and wd * switched to cosineLR * refactored data classes for vits * some fixes * fixed import * changeg train loop * fixed scheduler bug * refactoring for exps * Refactored loss logic * Ref for exps * added coqui stuff * exps * bugfix * added side file * bugfix * reverted * fixed sampler behaviour * updated for ptl 1.7.2 * refactored dataloader func * some cleaning * reverted to vanilla loss * modified for pickling * added dataset class * fixed torch version * added autocast for fp training * removed coqui files * 'Fixed tokenizer' * Fix tokenizer * update branch Signed-off-by: ericharper <complex451@gmail.com> * Fix link to inference notebook (#5247) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Update ASR scores table (#5254) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * Fix links to speaker identification notebook (#5260) Signed-off-by: SeanNaren <snarenthiran@nvidia.com> Signed-off-by: SeanNaren <snarenthiran@nvidia.com> * Minor typo fixes in TTS tutorial (#5266) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Pcla tutorial fixes (#5271) * Fixed typos Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed cell type and tatoeba reference Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed typo Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed branch variable Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fix bug into Dialogue tutorial (#5277) * Typo fix (#5288) Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fix dialogue tutorial bug (#5297) * set add_pooling_layer=False for huggingface bert model * remove add_pooling_layer=False and set find_unused_parameters=True * set num_prompt_tokens to 0 for huggingface * small bugfix for r1.13.0 (#5310) * typo fix Signed-off-by: fayejf <fayejf07@gmail.com> * udpate transcribe Signed-off-by: fayejf <fayejf07@gmail.com> Signed-off-by: fayejf <fayejf07@gmail.com> * Add italian model checkpoints (#5316) Signed-off-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Igor Gitman <igitman@nvidia.com> * [STT] Add Ru ASR Conformer-CTC and Conformer-Transducer (#5340) * [STT] Add stt_ru_conformer_ctc_large Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> * [STT] Add stt_ru_conformer_transducer_large Add stt_ru_conformer_transducer_large Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Pcla tutorial fixes (#5313) * fixes Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * fixes Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * moved `create_text_and_labels` to token_classification_utils.py Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * a lot of refactoring * strict ptl version * strict ptl version * reverted plt version * Added base text2audio class * Fix issue with HF Model upload tutorial (#5359) * Add Gradio App to ASR Docs (#5270) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> (cherry picked from commit e4b6a38) * Fix issue with normalized config for dataset name Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * tutorial fixes (#5354) Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Add SDP documentation (#5274) * Add details to SDP README.md Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add docstring to WriteManifest processor Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add docstring to CreateInitialManifestMLS Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add ModifyManifestTextProcessor docstring Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add ASRInference docstring Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add base_processor docstrings Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add minimal SDP docs page Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Update tools/speech_dataset_processor/README.md Co-authored-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> * Write simple README for SDP and move complex explanations to docs Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove incorrect type hints Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Make config example less confusing Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Fix typo Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Clarify that YAML file is config file in README Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove unused imports Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove SDP docs for now Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove links to docs in SDP README Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Co-authored-by: Igor Gitman <igitman@nvidia.com> * [Bugfix] Added rm -f / wget- nc command in multispeaker sim notebook to r1.13.0 (#5375) * Fix minor error in notebook Signed-off-by: Taejin Park <tango4j@gmail.com> * changed branch name in tutorial notebook Signed-off-by: Taejin Park <tango4j@gmail.com> Signed-off-by: Taejin Park <tango4j@gmail.com> * Rename Speech Dataset Processor to Speech Data Processor (#5378) Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * fix for num worker 0 causing issues in losses after 1 epoch (#5379) * Fixed bug in notebook (#5382) Signed-off-by: Virginia Adams <vadams@nvidia.com> Signed-off-by: Virginia Adams <vadams@nvidia.com> * Force MHA QKV onto fp32 (#5391) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * Added scheduling variety * ref * Fix for prompt table restore error (#5393) * Fix for prompt table restore error Signed-off-by: Virginia Adams <vadams@nvidia.com> * Added more saftey checks Signed-off-by: Virginia Adams <vadams@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Added more condition checks Signed-off-by: Virginia Adams <vadams@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Virginia Adams <vadams@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Fix args (#5410) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * bugfix * import tests * Add temporary fix for CUDA issue in Dockerfile (#5421) Signed-off-by: Yu Yao <yuya@nvidia.com> Signed-off-by: Yu Yao <yuya@nvidia.com> * Megatron Export Update (#5343) * export update for Megatron + change ORT optimization Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated export_utils to use autocast instead of manually casting >:/ Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * removed dtype from LayerNorm Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * added comment Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * reverting changes on FloatCast Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * Cherry-picked changes from megatron-norm Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * updated asr_model import to cast_utils Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * updated del onnx_model place Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * changed ort optimization to basic -> temp fix Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com> * disable pc test (#5426) Signed-off-by: ekmb <ebakhturina@nvidia.com> Signed-off-by: ekmb <ebakhturina@nvidia.com> * Fix GPT generation when using sentencepiece tokenizer (#5413) * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Yi Dong <yidong@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Disable sync_batch_comm in validation_step for GPT (#5397) * disable sync_batch_comm in validation_step Signed-off-by: ericharper <complex451@gmail.com> * Read sync_batch_comm from config or default to False Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> * Update megatron_gpt_config to default sync_batch_comm to False to avoid CUDA error Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> * Empty Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Comment out test Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> * Revert "Add temporary fix for CUDA issue in Dockerfile (#5421)" (#5431) This reverts commit 0718b17. * Revert workaround for T5 that sets number of workers to 0 & sync_batch_comm=False (#5420) * Revert workers workaround Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix in config Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Fixed discrepancies * updated Jenkisfile * updated Jenkisfile * Cleaning * fixed the onnx bug in conformer for non-streaming models. (#5242) (#5446) Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Co-authored-by: Vahid Noroozi <VahidooX@users.noreply.github.com> * Set sync_batch_comm in other places (#5448) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Radtts 1.13 (#5451) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (#5358) * [TTS] add CI test for RADTTS training recipe. Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Radtts 1.13 plus (#5457) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (#5358) * Fixing RADTTS training - removing view buffer and fixing accuracy issue * Fixes for Torchscript/Triton * Added autocast to radtts UT * using cuda() for training example Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Add num layers check (#5470) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Change to kwargs (#5475) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Support for finetuning and finetuning inference with .ckpt files & batch size refactoring (#5339) (#5478) * Initial refactor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Resolve config before passing to load_from_checkpoint Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fixes for model parallel and nemo restore Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fixes for eval Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Revert config changes Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Refactor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix typo Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove comments Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Minor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix validation reconfiguration Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove old comment Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes for test_ds Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * export_utils bugfix (#5480) * updated export_utils Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Export fixes for Riva (#5496) * Export fixes for Riva Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * Cleaning up training_utils Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * minor bug fix (#5521) Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> * added set_start_method + function param bugfix (#5539) * added set_start_method + function param bugfix Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * upper bound torchmetrics Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: ericharper <complex451@gmail.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: ericharper <complex451@gmail.com> * remove notebook (#5548) Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: ericharper <complex451@gmail.com> * Remove broadcast (#5558) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * cleaning * Fix all gather while writing to a file during T5 finetuning (#5561) * Gather from data parallel only instead of all ranks Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * update readme Signed-off-by: ericharper <complex451@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * added copyright * fixed imports * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * cleaning * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fixed filesize check * last cleaning Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated cmudict path * fixed merge bug Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * warnings fix * fix warnings Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * storing * updated version Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * update Jenkinsfile versions Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * fixed issues Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * fixed more issues * more fixes Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * added experimental tag * Clarification updates Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * remove old cython code Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * remove old cython code Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * docstring fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Enhancements Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Enhancements Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * imports fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix typo Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * excessive comtutations fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * typecheck fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Small refactoring * Small refactoring Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * reversed exp_manager params Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Fixed call for new function signature Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> Signed-off-by: Jason <jasoli@nvidia.com> Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: ekmb <ebakhturina@nvidia.com> Signed-off-by: PeganovAnton <peganoff2@mail.ru> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: fayejf <fayejf07@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: SeanNaren <snarenthiran@nvidia.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Signed-off-by: Taejin Park <tango4j@gmail.com> Signed-off-by: Virginia Adams <vadams@nvidia.com> Signed-off-by: Yu Yao <yuya@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> Co-authored-by: jasonjjl1999 <jasonjjl1999@gmail.com> Co-authored-by: richa.ren@mail.utoronto.ca <richa.ren@mail.utoronto.ca> Co-authored-by: Oktai Tatanov <oktai.tatanov@gmail.com> Co-authored-by: jasonjjl1999 <43978361+jasonjjl1999@users.noreply.github.com> Co-authored-by: martynwei <martyn.wei@gmail.com> Co-authored-by: Ryan Hong <66425733+rhong99@users.noreply.github.com> Co-authored-by: Jason <jasoli@nvidia.com> Co-authored-by: ericharper <complex451@gmail.com> Co-authored-by: Boris Fomitchev <borisfom@users.noreply.github.com> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> Co-authored-by: Ramanathan Arunachalam <ramanathan.arun@rutgers.edu> Co-authored-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> Co-authored-by: bene-ges <61418381+bene-ges@users.noreply.github.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Evelina <10428420+ekmb@users.noreply.github.com> Co-authored-by: PeganovAnton <peganoff2@mail.ru> Co-authored-by: Jocelyn <jocelynh@nvidia.com> Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> Co-authored-by: Sean Naren <snarenthiran@nvidia.com> Co-authored-by: Matvei Novikov <mattyson.so@gmail.com> Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com> Co-authored-by: Igor Gitman <igitman@nvidia.com> Co-authored-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Co-authored-by: Taejin Park <tango4j@gmail.com> Co-authored-by: Adi Renduchintala <108822655+arendu@users.noreply.github.com> Co-authored-by: Virginia Adams <78445382+vadam5@users.noreply.github.com> Co-authored-by: yaoyu-33 <54727607+yaoyu-33@users.noreply.github.com> Co-authored-by: David <amosalla@asu.edu> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com> Co-authored-by: Yi Dong <yidong@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Co-authored-by: Vladimir Bataev <vbataev@nvidia.com> Co-authored-by: Vahid Noroozi <VahidooX@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
* Disable loss typecheck * Fix spectrogram lengths * Remove Precision 16 requirement * Address lgtm alerts * clean up unused code * Address lgtm alerts * Refactor audio_to_mel_torch method * Use NeMo FilterBank to get melspec Todo: set self.fb * Fix filterbank max frequency to match with original VITS * Fix filterbank features correct length * Address lgtm issues * Remove print statements * Remove stft_pad_amount * new structure for tts datasets in script folder Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * remove cmudict downloading Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * rename mixertts dataset, add vocoder dataset Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add libritts processing Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * update tts dataset and libritts get data Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix bugs in vocoder ds Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add ds * changed vits yaml * rm yaml * fix yaml and model * Added scaler * refactored yaml * managed to run in fp16 * refactoring Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix small bugs and add new todos Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix optimizers Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * Port Variational Inference with Adversarial Learning (VITS) to NeMo TTS (#6) * Add vits files Add vits_losses.py, vits_modules.py and vits.py. * Move non-vits models to modules * Add vits.yaml * Add _loader to vits.py * Add basic template for vits * Update vits.yaml with vits parameters * Remove extra space * Add top level training script * Add some variables to vits yaml * Add forward and training methods * Fix imports * Added validation step * Log training losses * Update loss calls to use class attributes * Add VITS to models list * Fix all imports * Remove old module calls * Fix typo in monotonic align import * Modified validation step 1. reverted to tensorboard 2. validation_step logs audio, mel-spec for batch 0 3. validation_step_alt logs audio, mel-spec for batch 0 and loss_mel * Fix imports for VITS * Remove old module calls * Fix typo in monotonic align import * Modified validation step 1. reverted to tensorboard 2. validation_step logs audio, mel-spec for batch 0 3. validation_step_alt logs audio, mel-spec for batch 0 and loss_mel * Add parameters from original VITS config * Fix config file * Fix imports and generate spec from audio * Fix incorrect dimensions * Progress update * Fix loss * Fix cuda thing * Fix monotonic align import * Fix typos in vits.py * Disable loss typecheck * Fix spectrogram lengths * Remove Precision 16 requirement * Address lgtm alerts * clean up unused code * Address lgtm alerts * Refactor audio_to_mel_torch method * Use NeMo FilterBank to get melspec Todo: set self.fb * Fix filterbank max frequency to match with original VITS * Fix filterbank features correct length * Address lgtm issues * Remove print statements * Remove stft_pad_amount Co-authored-by: martynwei <martyn.wei@gmail.com> Co-authored-by: Ryan Hong <66425733+rhong99@users.noreply.github.com> Co-authored-by: richa.ren@mail.utoronto.ca <richa.ren@mail.utoronto.ca> Co-authored-by: Jason <jasoli@nvidia.com> Signed-off-by: Jason <jasoli@nvidia.com> * make new commit Signed-off-by: Jason <jasoli@nvidia.com> * add copyright headers Signed-off-by: Jason <jasoli@nvidia.com> * style Signed-off-by: Jason <jasoli@nvidia.com> * rename README Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix style without vits_modules Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add numba code, fix style and add todos Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * small fix * fix some todos * added numba mas * added DDP sampler * specified versions * fixed for new librosa version * added feature loss * added IPA phonemizer * refactored IPA g2p * added vits losses * some ref * fix * added checkpointing * cp * cfg * merged some 1.8.0 fixes * plt fix * fix logging * fix checkpoint loading * refactored inference * fp32 run * update branch Signed-off-by: ericharper <complex451@gmail.com> * update package info Signed-off-by: ericharper <complex451@gmail.com> * new exp * update branch Signed-off-by: ericharper <complex451@gmail.com> * Restored tests previously disabled for 22.03 base (#4109) Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * add augmentation to label models (#4113) * add augmentation to label models Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * duration fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Call register_bert_model after assigning self.bert_model variable (#4116) Signed-off-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> Co-authored-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> * Tutorial on ITN with Thutmose tagger and small fixes (#4117) * 1. Add tutorial. 2. Move a function to fix import in tutorial. 3. Merge multiple spaces into one space in the final output Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fixes for code review Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * Add tutorial to tutorials.rst Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * cleaned up TN/ ITN doc (#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Check implicit grad acc in GLUE dataset building (#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * update the default (#4135) Signed-off-by: ekmb <ebakhturina@nvidia.com> * Draft: Fix restoring from checkpoint for case when `model.common_dataset_parameters.label_vocab_dir` is provided (#4136) * Fix restoring from checkpoint with label vocab dir Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Add tests for various ways to pass label ids to model Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Do not create tmp directory Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix parameter name Signed-off-by: PeganovAnton <peganoff2@mail.ru> * finish cherry-pick op Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix labels errors Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove duplicate stage Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Change target branch Signed-off-by: PeganovAnton <peganoff2@mail.ru> * fix typo (#4140) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Fix/punctuation avoid overwritting tmp files (#4144) * Add draft of fixing tmp files overwritting Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove accidental changes Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove accidental changes Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Use built-in tempfile library Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix code style Signed-off-by: PeganovAnton <peganoff2@mail.ru> * bug_fix_diarization_manifest_creation (#4125) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> * fix doc (#4146) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Tacotron2 retrain (#4103) * fix yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Fix for new TTSDataset class Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * inference fix Signed-off-by: treacker <emshabalin@yandex.ru> * removed old code Signed-off-by: treacker <emshabalin@yandex.ru> * updated parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * reverted version update Signed-off-by: treacker <emshabalin@yandex.ru> * refactored parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * Updated Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Refactored tutorial for Tacotron2 Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Update Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Update tacotron.yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * cleaned up TN/ ITN doc (#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: treacker <emshabalin@yandex.ru> * Check implicit grad acc in GLUE dataset building (#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Fixed jenkins Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> * Multiprocess improvements (#4127) * initial commit Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * start fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * improve multiprocessing speed while creating speaker dataset Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * updated scp to filelist Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * WaveGlow input type fixes (#4151) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * notebooks' link, typo and import fix (#4158) * redo missing pr 4007 Signed-off-by: fayejf <fayejf07@gmail.com> * remove extremely unreliable links Signed-off-by: fayejf <fayejf07@gmail.com> * Thutmose tagger bug fixes (#4162) * add pretrained ngc model, small fixes Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix model location Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix model location Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * 1. fix typos. 2. write magic functions without space Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * add example of inference with pretrained model Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * changed model location to nemo Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * style fix Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix space Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * update speaker docs (#4164) * update speaker docs Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * chunks -> segments Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Khz -> kHz Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * changed to vits g2p * refactoring * added cosineLR * Updated whitelist path * added vanilla torch grad scaler * Fixed lightning version * added warmup and wd * switched to cosineLR * refactored data classes for vits * some fixes * fixed import * changeg train loop * fixed scheduler bug * refactoring for exps * Refactored loss logic * Ref for exps * added coqui stuff * exps * bugfix * added side file * bugfix * reverted * fixed sampler behaviour * updated for ptl 1.7.2 * refactored dataloader func * some cleaning * reverted to vanilla loss * modified for pickling * added dataset class * fixed torch version * added autocast for fp training * removed coqui files * 'Fixed tokenizer' * Fix tokenizer * update branch Signed-off-by: ericharper <complex451@gmail.com> * Fix link to inference notebook (#5247) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Update ASR scores table (#5254) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * Fix links to speaker identification notebook (#5260) Signed-off-by: SeanNaren <snarenthiran@nvidia.com> Signed-off-by: SeanNaren <snarenthiran@nvidia.com> * Minor typo fixes in TTS tutorial (#5266) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Pcla tutorial fixes (#5271) * Fixed typos Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed cell type and tatoeba reference Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed typo Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed branch variable Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fix bug into Dialogue tutorial (#5277) * Typo fix (#5288) Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fix dialogue tutorial bug (#5297) * set add_pooling_layer=False for huggingface bert model * remove add_pooling_layer=False and set find_unused_parameters=True * set num_prompt_tokens to 0 for huggingface * small bugfix for r1.13.0 (#5310) * typo fix Signed-off-by: fayejf <fayejf07@gmail.com> * udpate transcribe Signed-off-by: fayejf <fayejf07@gmail.com> Signed-off-by: fayejf <fayejf07@gmail.com> * Add italian model checkpoints (#5316) Signed-off-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Igor Gitman <igitman@nvidia.com> * [STT] Add Ru ASR Conformer-CTC and Conformer-Transducer (#5340) * [STT] Add stt_ru_conformer_ctc_large Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> * [STT] Add stt_ru_conformer_transducer_large Add stt_ru_conformer_transducer_large Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Pcla tutorial fixes (#5313) * fixes Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * fixes Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * moved `create_text_and_labels` to token_classification_utils.py Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * a lot of refactoring * strict ptl version * strict ptl version * reverted plt version * Added base text2audio class * Fix issue with HF Model upload tutorial (#5359) * Add Gradio App to ASR Docs (#5270) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> (cherry picked from commit e4b6a38) * Fix issue with normalized config for dataset name Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * tutorial fixes (#5354) Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Add SDP documentation (#5274) * Add details to SDP README.md Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add docstring to WriteManifest processor Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add docstring to CreateInitialManifestMLS Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add ModifyManifestTextProcessor docstring Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add ASRInference docstring Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add base_processor docstrings Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add minimal SDP docs page Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Update tools/speech_dataset_processor/README.md Co-authored-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> * Write simple README for SDP and move complex explanations to docs Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove incorrect type hints Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Make config example less confusing Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Fix typo Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Clarify that YAML file is config file in README Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove unused imports Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove SDP docs for now Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove links to docs in SDP README Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Co-authored-by: Igor Gitman <igitman@nvidia.com> * [Bugfix] Added rm -f / wget- nc command in multispeaker sim notebook to r1.13.0 (#5375) * Fix minor error in notebook Signed-off-by: Taejin Park <tango4j@gmail.com> * changed branch name in tutorial notebook Signed-off-by: Taejin Park <tango4j@gmail.com> Signed-off-by: Taejin Park <tango4j@gmail.com> * Rename Speech Dataset Processor to Speech Data Processor (#5378) Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * fix for num worker 0 causing issues in losses after 1 epoch (#5379) * Fixed bug in notebook (#5382) Signed-off-by: Virginia Adams <vadams@nvidia.com> Signed-off-by: Virginia Adams <vadams@nvidia.com> * Force MHA QKV onto fp32 (#5391) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * Added scheduling variety * ref * Fix for prompt table restore error (#5393) * Fix for prompt table restore error Signed-off-by: Virginia Adams <vadams@nvidia.com> * Added more saftey checks Signed-off-by: Virginia Adams <vadams@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Added more condition checks Signed-off-by: Virginia Adams <vadams@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Virginia Adams <vadams@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Fix args (#5410) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * bugfix * import tests * Add temporary fix for CUDA issue in Dockerfile (#5421) Signed-off-by: Yu Yao <yuya@nvidia.com> Signed-off-by: Yu Yao <yuya@nvidia.com> * Megatron Export Update (#5343) * export update for Megatron + change ORT optimization Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated export_utils to use autocast instead of manually casting >:/ Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * removed dtype from LayerNorm Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * added comment Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * reverting changes on FloatCast Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * Cherry-picked changes from megatron-norm Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * updated asr_model import to cast_utils Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * updated del onnx_model place Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * changed ort optimization to basic -> temp fix Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com> * disable pc test (#5426) Signed-off-by: ekmb <ebakhturina@nvidia.com> Signed-off-by: ekmb <ebakhturina@nvidia.com> * Fix GPT generation when using sentencepiece tokenizer (#5413) * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Yi Dong <yidong@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Disable sync_batch_comm in validation_step for GPT (#5397) * disable sync_batch_comm in validation_step Signed-off-by: ericharper <complex451@gmail.com> * Read sync_batch_comm from config or default to False Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> * Update megatron_gpt_config to default sync_batch_comm to False to avoid CUDA error Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> * Empty Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Comment out test Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> * Revert "Add temporary fix for CUDA issue in Dockerfile (#5421)" (#5431) This reverts commit 0718b17. * Revert workaround for T5 that sets number of workers to 0 & sync_batch_comm=False (#5420) * Revert workers workaround Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix in config Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Fixed discrepancies * updated Jenkisfile * updated Jenkisfile * Cleaning * fixed the onnx bug in conformer for non-streaming models. (#5242) (#5446) Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Co-authored-by: Vahid Noroozi <VahidooX@users.noreply.github.com> * Set sync_batch_comm in other places (#5448) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Radtts 1.13 (#5451) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (#5358) * [TTS] add CI test for RADTTS training recipe. Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Radtts 1.13 plus (#5457) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (#5358) * Fixing RADTTS training - removing view buffer and fixing accuracy issue * Fixes for Torchscript/Triton * Added autocast to radtts UT * using cuda() for training example Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Add num layers check (#5470) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Change to kwargs (#5475) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Support for finetuning and finetuning inference with .ckpt files & batch size refactoring (#5339) (#5478) * Initial refactor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Resolve config before passing to load_from_checkpoint Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fixes for model parallel and nemo restore Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fixes for eval Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Revert config changes Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Refactor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix typo Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove comments Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Minor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix validation reconfiguration Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove old comment Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes for test_ds Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * export_utils bugfix (#5480) * updated export_utils Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Export fixes for Riva (#5496) * Export fixes for Riva Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * Cleaning up training_utils Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * minor bug fix (#5521) Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> * added set_start_method + function param bugfix (#5539) * added set_start_method + function param bugfix Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * upper bound torchmetrics Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: ericharper <complex451@gmail.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: ericharper <complex451@gmail.com> * remove notebook (#5548) Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: ericharper <complex451@gmail.com> * Remove broadcast (#5558) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * cleaning * Fix all gather while writing to a file during T5 finetuning (#5561) * Gather from data parallel only instead of all ranks Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * update readme Signed-off-by: ericharper <complex451@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * added copyright * fixed imports * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * cleaning * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fixed filesize check * last cleaning Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated cmudict path * fixed merge bug Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * warnings fix * fix warnings Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * storing * updated version Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * update Jenkinsfile versions Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * fixed issues Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * fixed more issues * more fixes Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * added experimental tag * Clarification updates Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * remove old cython code Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * remove old cython code Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * docstring fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Enhancements Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Enhancements Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * imports fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix typo Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * excessive comtutations fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * typecheck fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Small refactoring * Small refactoring Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * reversed exp_manager params Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Fixed call for new function signature Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> Signed-off-by: Jason <jasoli@nvidia.com> Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: ekmb <ebakhturina@nvidia.com> Signed-off-by: PeganovAnton <peganoff2@mail.ru> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: fayejf <fayejf07@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: SeanNaren <snarenthiran@nvidia.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Signed-off-by: Taejin Park <tango4j@gmail.com> Signed-off-by: Virginia Adams <vadams@nvidia.com> Signed-off-by: Yu Yao <yuya@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> Co-authored-by: jasonjjl1999 <jasonjjl1999@gmail.com> Co-authored-by: richa.ren@mail.utoronto.ca <richa.ren@mail.utoronto.ca> Co-authored-by: Oktai Tatanov <oktai.tatanov@gmail.com> Co-authored-by: jasonjjl1999 <43978361+jasonjjl1999@users.noreply.github.com> Co-authored-by: martynwei <martyn.wei@gmail.com> Co-authored-by: Ryan Hong <66425733+rhong99@users.noreply.github.com> Co-authored-by: Jason <jasoli@nvidia.com> Co-authored-by: ericharper <complex451@gmail.com> Co-authored-by: Boris Fomitchev <borisfom@users.noreply.github.com> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> Co-authored-by: Ramanathan Arunachalam <ramanathan.arun@rutgers.edu> Co-authored-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> Co-authored-by: bene-ges <61418381+bene-ges@users.noreply.github.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Evelina <10428420+ekmb@users.noreply.github.com> Co-authored-by: PeganovAnton <peganoff2@mail.ru> Co-authored-by: Jocelyn <jocelynh@nvidia.com> Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> Co-authored-by: Sean Naren <snarenthiran@nvidia.com> Co-authored-by: Matvei Novikov <mattyson.so@gmail.com> Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com> Co-authored-by: Igor Gitman <igitman@nvidia.com> Co-authored-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Co-authored-by: Taejin Park <tango4j@gmail.com> Co-authored-by: Adi Renduchintala <108822655+arendu@users.noreply.github.com> Co-authored-by: Virginia Adams <78445382+vadam5@users.noreply.github.com> Co-authored-by: yaoyu-33 <54727607+yaoyu-33@users.noreply.github.com> Co-authored-by: David <amosalla@asu.edu> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com> Co-authored-by: Yi Dong <yidong@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Co-authored-by: Vladimir Bataev <vbataev@nvidia.com> Co-authored-by: Vahid Noroozi <VahidooX@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
* Disable loss typecheck * Fix spectrogram lengths * Remove Precision 16 requirement * Address lgtm alerts * clean up unused code * Address lgtm alerts * Refactor audio_to_mel_torch method * Use NeMo FilterBank to get melspec Todo: set self.fb * Fix filterbank max frequency to match with original VITS * Fix filterbank features correct length * Address lgtm issues * Remove print statements * Remove stft_pad_amount * new structure for tts datasets in script folder Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * remove cmudict downloading Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * rename mixertts dataset, add vocoder dataset Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add libritts processing Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * update tts dataset and libritts get data Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix bugs in vocoder ds Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add ds * changed vits yaml * rm yaml * fix yaml and model * Added scaler * refactored yaml * managed to run in fp16 * refactoring Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix small bugs and add new todos Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix optimizers Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * Port Variational Inference with Adversarial Learning (VITS) to NeMo TTS (NVIDIA#6) * Add vits files Add vits_losses.py, vits_modules.py and vits.py. * Move non-vits models to modules * Add vits.yaml * Add _loader to vits.py * Add basic template for vits * Update vits.yaml with vits parameters * Remove extra space * Add top level training script * Add some variables to vits yaml * Add forward and training methods * Fix imports * Added validation step * Log training losses * Update loss calls to use class attributes * Add VITS to models list * Fix all imports * Remove old module calls * Fix typo in monotonic align import * Modified validation step 1. reverted to tensorboard 2. validation_step logs audio, mel-spec for batch 0 3. validation_step_alt logs audio, mel-spec for batch 0 and loss_mel * Fix imports for VITS * Remove old module calls * Fix typo in monotonic align import * Modified validation step 1. reverted to tensorboard 2. validation_step logs audio, mel-spec for batch 0 3. validation_step_alt logs audio, mel-spec for batch 0 and loss_mel * Add parameters from original VITS config * Fix config file * Fix imports and generate spec from audio * Fix incorrect dimensions * Progress update * Fix loss * Fix cuda thing * Fix monotonic align import * Fix typos in vits.py * Disable loss typecheck * Fix spectrogram lengths * Remove Precision 16 requirement * Address lgtm alerts * clean up unused code * Address lgtm alerts * Refactor audio_to_mel_torch method * Use NeMo FilterBank to get melspec Todo: set self.fb * Fix filterbank max frequency to match with original VITS * Fix filterbank features correct length * Address lgtm issues * Remove print statements * Remove stft_pad_amount Co-authored-by: martynwei <martyn.wei@gmail.com> Co-authored-by: Ryan Hong <66425733+rhong99@users.noreply.github.com> Co-authored-by: richa.ren@mail.utoronto.ca <richa.ren@mail.utoronto.ca> Co-authored-by: Jason <jasoli@nvidia.com> Signed-off-by: Jason <jasoli@nvidia.com> * make new commit Signed-off-by: Jason <jasoli@nvidia.com> * add copyright headers Signed-off-by: Jason <jasoli@nvidia.com> * style Signed-off-by: Jason <jasoli@nvidia.com> * rename README Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix style without vits_modules Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add numba code, fix style and add todos Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * small fix * fix some todos * added numba mas * added DDP sampler * specified versions * fixed for new librosa version * added feature loss * added IPA phonemizer * refactored IPA g2p * added vits losses * some ref * fix * added checkpointing * cp * cfg * merged some 1.8.0 fixes * plt fix * fix logging * fix checkpoint loading * refactored inference * fp32 run * update branch Signed-off-by: ericharper <complex451@gmail.com> * update package info Signed-off-by: ericharper <complex451@gmail.com> * new exp * update branch Signed-off-by: ericharper <complex451@gmail.com> * Restored tests previously disabled for 22.03 base (NVIDIA#4109) Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * add augmentation to label models (NVIDIA#4113) * add augmentation to label models Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * duration fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Call register_bert_model after assigning self.bert_model variable (NVIDIA#4116) Signed-off-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> Co-authored-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> * Tutorial on ITN with Thutmose tagger and small fixes (NVIDIA#4117) * 1. Add tutorial. 2. Move a function to fix import in tutorial. 3. Merge multiple spaces into one space in the final output Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fixes for code review Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * Add tutorial to tutorials.rst Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * cleaned up TN/ ITN doc (NVIDIA#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Check implicit grad acc in GLUE dataset building (NVIDIA#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * update the default (NVIDIA#4135) Signed-off-by: ekmb <ebakhturina@nvidia.com> * Draft: Fix restoring from checkpoint for case when `model.common_dataset_parameters.label_vocab_dir` is provided (NVIDIA#4136) * Fix restoring from checkpoint with label vocab dir Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Add tests for various ways to pass label ids to model Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Do not create tmp directory Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix parameter name Signed-off-by: PeganovAnton <peganoff2@mail.ru> * finish cherry-pick op Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix labels errors Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove duplicate stage Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Change target branch Signed-off-by: PeganovAnton <peganoff2@mail.ru> * fix typo (NVIDIA#4140) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Fix/punctuation avoid overwritting tmp files (NVIDIA#4144) * Add draft of fixing tmp files overwritting Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove accidental changes Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove accidental changes Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Use built-in tempfile library Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix code style Signed-off-by: PeganovAnton <peganoff2@mail.ru> * bug_fix_diarization_manifest_creation (NVIDIA#4125) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> * fix doc (NVIDIA#4146) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Tacotron2 retrain (NVIDIA#4103) * fix yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Fix for new TTSDataset class Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * inference fix Signed-off-by: treacker <emshabalin@yandex.ru> * removed old code Signed-off-by: treacker <emshabalin@yandex.ru> * updated parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * reverted version update Signed-off-by: treacker <emshabalin@yandex.ru> * refactored parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * Updated Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Refactored tutorial for Tacotron2 Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Update Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Update tacotron.yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * cleaned up TN/ ITN doc (NVIDIA#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: treacker <emshabalin@yandex.ru> * Check implicit grad acc in GLUE dataset building (NVIDIA#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Fixed jenkins Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> * Multiprocess improvements (NVIDIA#4127) * initial commit Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * start fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * improve multiprocessing speed while creating speaker dataset Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * updated scp to filelist Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * WaveGlow input type fixes (NVIDIA#4151) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * notebooks' link, typo and import fix (NVIDIA#4158) * redo missing pr 4007 Signed-off-by: fayejf <fayejf07@gmail.com> * remove extremely unreliable links Signed-off-by: fayejf <fayejf07@gmail.com> * Thutmose tagger bug fixes (NVIDIA#4162) * add pretrained ngc model, small fixes Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix model location Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix model location Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * 1. fix typos. 2. write magic functions without space Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * add example of inference with pretrained model Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * changed model location to nemo Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * style fix Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix space Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * update speaker docs (NVIDIA#4164) * update speaker docs Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * chunks -> segments Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Khz -> kHz Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * changed to vits g2p * refactoring * added cosineLR * Updated whitelist path * added vanilla torch grad scaler * Fixed lightning version * added warmup and wd * switched to cosineLR * refactored data classes for vits * some fixes * fixed import * changeg train loop * fixed scheduler bug * refactoring for exps * Refactored loss logic * Ref for exps * added coqui stuff * exps * bugfix * added side file * bugfix * reverted * fixed sampler behaviour * updated for ptl 1.7.2 * refactored dataloader func * some cleaning * reverted to vanilla loss * modified for pickling * added dataset class * fixed torch version * added autocast for fp training * removed coqui files * 'Fixed tokenizer' * Fix tokenizer * update branch Signed-off-by: ericharper <complex451@gmail.com> * Fix link to inference notebook (NVIDIA#5247) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Update ASR scores table (NVIDIA#5254) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * Fix links to speaker identification notebook (NVIDIA#5260) Signed-off-by: SeanNaren <snarenthiran@nvidia.com> Signed-off-by: SeanNaren <snarenthiran@nvidia.com> * Minor typo fixes in TTS tutorial (NVIDIA#5266) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Pcla tutorial fixes (NVIDIA#5271) * Fixed typos Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed cell type and tatoeba reference Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed typo Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed branch variable Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fix bug into Dialogue tutorial (NVIDIA#5277) * Typo fix (NVIDIA#5288) Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fix dialogue tutorial bug (NVIDIA#5297) * set add_pooling_layer=False for huggingface bert model * remove add_pooling_layer=False and set find_unused_parameters=True * set num_prompt_tokens to 0 for huggingface * small bugfix for r1.13.0 (NVIDIA#5310) * typo fix Signed-off-by: fayejf <fayejf07@gmail.com> * udpate transcribe Signed-off-by: fayejf <fayejf07@gmail.com> Signed-off-by: fayejf <fayejf07@gmail.com> * Add italian model checkpoints (NVIDIA#5316) Signed-off-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Igor Gitman <igitman@nvidia.com> * [STT] Add Ru ASR Conformer-CTC and Conformer-Transducer (NVIDIA#5340) * [STT] Add stt_ru_conformer_ctc_large Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> * [STT] Add stt_ru_conformer_transducer_large Add stt_ru_conformer_transducer_large Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Pcla tutorial fixes (NVIDIA#5313) * fixes Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * fixes Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * moved `create_text_and_labels` to token_classification_utils.py Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * a lot of refactoring * strict ptl version * strict ptl version * reverted plt version * Added base text2audio class * Fix issue with HF Model upload tutorial (NVIDIA#5359) * Add Gradio App to ASR Docs (NVIDIA#5270) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> (cherry picked from commit e4b6a38) * Fix issue with normalized config for dataset name Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * tutorial fixes (NVIDIA#5354) Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Add SDP documentation (NVIDIA#5274) * Add details to SDP README.md Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add docstring to WriteManifest processor Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add docstring to CreateInitialManifestMLS Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add ModifyManifestTextProcessor docstring Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add ASRInference docstring Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add base_processor docstrings Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add minimal SDP docs page Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Update tools/speech_dataset_processor/README.md Co-authored-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> * Write simple README for SDP and move complex explanations to docs Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove incorrect type hints Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Make config example less confusing Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Fix typo Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Clarify that YAML file is config file in README Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove unused imports Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove SDP docs for now Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove links to docs in SDP README Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Co-authored-by: Igor Gitman <igitman@nvidia.com> * [Bugfix] Added rm -f / wget- nc command in multispeaker sim notebook to r1.13.0 (NVIDIA#5375) * Fix minor error in notebook Signed-off-by: Taejin Park <tango4j@gmail.com> * changed branch name in tutorial notebook Signed-off-by: Taejin Park <tango4j@gmail.com> Signed-off-by: Taejin Park <tango4j@gmail.com> * Rename Speech Dataset Processor to Speech Data Processor (NVIDIA#5378) Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * fix for num worker 0 causing issues in losses after 1 epoch (NVIDIA#5379) * Fixed bug in notebook (NVIDIA#5382) Signed-off-by: Virginia Adams <vadams@nvidia.com> Signed-off-by: Virginia Adams <vadams@nvidia.com> * Force MHA QKV onto fp32 (NVIDIA#5391) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * Added scheduling variety * ref * Fix for prompt table restore error (NVIDIA#5393) * Fix for prompt table restore error Signed-off-by: Virginia Adams <vadams@nvidia.com> * Added more saftey checks Signed-off-by: Virginia Adams <vadams@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Added more condition checks Signed-off-by: Virginia Adams <vadams@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Virginia Adams <vadams@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Fix args (NVIDIA#5410) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * bugfix * import tests * Add temporary fix for CUDA issue in Dockerfile (NVIDIA#5421) Signed-off-by: Yu Yao <yuya@nvidia.com> Signed-off-by: Yu Yao <yuya@nvidia.com> * Megatron Export Update (NVIDIA#5343) * export update for Megatron + change ORT optimization Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated export_utils to use autocast instead of manually casting >:/ Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * removed dtype from LayerNorm Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * added comment Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * reverting changes on FloatCast Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * Cherry-picked changes from megatron-norm Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * updated asr_model import to cast_utils Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * updated del onnx_model place Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * changed ort optimization to basic -> temp fix Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com> * disable pc test (NVIDIA#5426) Signed-off-by: ekmb <ebakhturina@nvidia.com> Signed-off-by: ekmb <ebakhturina@nvidia.com> * Fix GPT generation when using sentencepiece tokenizer (NVIDIA#5413) * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Yi Dong <yidong@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Disable sync_batch_comm in validation_step for GPT (NVIDIA#5397) * disable sync_batch_comm in validation_step Signed-off-by: ericharper <complex451@gmail.com> * Read sync_batch_comm from config or default to False Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> * Update megatron_gpt_config to default sync_batch_comm to False to avoid CUDA error Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> * Empty Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Comment out test Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> * Revert "Add temporary fix for CUDA issue in Dockerfile (NVIDIA#5421)" (NVIDIA#5431) This reverts commit 0718b17. * Revert workaround for T5 that sets number of workers to 0 & sync_batch_comm=False (NVIDIA#5420) * Revert workers workaround Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix in config Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Fixed discrepancies * updated Jenkisfile * updated Jenkisfile * Cleaning * fixed the onnx bug in conformer for non-streaming models. (NVIDIA#5242) (NVIDIA#5446) Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Co-authored-by: Vahid Noroozi <VahidooX@users.noreply.github.com> * Set sync_batch_comm in other places (NVIDIA#5448) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Radtts 1.13 (NVIDIA#5451) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (NVIDIA#5358) * [TTS] add CI test for RADTTS training recipe. Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Radtts 1.13 plus (NVIDIA#5457) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (NVIDIA#5358) * Fixing RADTTS training - removing view buffer and fixing accuracy issue * Fixes for Torchscript/Triton * Added autocast to radtts UT * using cuda() for training example Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Add num layers check (NVIDIA#5470) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Change to kwargs (NVIDIA#5475) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Support for finetuning and finetuning inference with .ckpt files & batch size refactoring (NVIDIA#5339) (NVIDIA#5478) * Initial refactor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Resolve config before passing to load_from_checkpoint Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fixes for model parallel and nemo restore Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fixes for eval Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Revert config changes Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Refactor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix typo Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove comments Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Minor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix validation reconfiguration Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove old comment Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes for test_ds Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * export_utils bugfix (NVIDIA#5480) * updated export_utils Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Export fixes for Riva (NVIDIA#5496) * Export fixes for Riva Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * Cleaning up training_utils Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * minor bug fix (NVIDIA#5521) Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> * added set_start_method + function param bugfix (NVIDIA#5539) * added set_start_method + function param bugfix Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * upper bound torchmetrics Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: ericharper <complex451@gmail.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: ericharper <complex451@gmail.com> * remove notebook (NVIDIA#5548) Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: ericharper <complex451@gmail.com> * Remove broadcast (NVIDIA#5558) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * cleaning * Fix all gather while writing to a file during T5 finetuning (NVIDIA#5561) * Gather from data parallel only instead of all ranks Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * update readme Signed-off-by: ericharper <complex451@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * added copyright * fixed imports * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * cleaning * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fixed filesize check * last cleaning Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated cmudict path * fixed merge bug Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * warnings fix * fix warnings Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * storing * updated version Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * update Jenkinsfile versions Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * fixed issues Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * fixed more issues * more fixes Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * added experimental tag * Clarification updates Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * remove old cython code Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * remove old cython code Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * docstring fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Enhancements Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Enhancements Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * imports fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix typo Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * excessive comtutations fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * typecheck fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Small refactoring * Small refactoring Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * reversed exp_manager params Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Fixed call for new function signature Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> Signed-off-by: Jason <jasoli@nvidia.com> Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: ekmb <ebakhturina@nvidia.com> Signed-off-by: PeganovAnton <peganoff2@mail.ru> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: fayejf <fayejf07@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: SeanNaren <snarenthiran@nvidia.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Signed-off-by: Taejin Park <tango4j@gmail.com> Signed-off-by: Virginia Adams <vadams@nvidia.com> Signed-off-by: Yu Yao <yuya@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> Co-authored-by: jasonjjl1999 <jasonjjl1999@gmail.com> Co-authored-by: richa.ren@mail.utoronto.ca <richa.ren@mail.utoronto.ca> Co-authored-by: Oktai Tatanov <oktai.tatanov@gmail.com> Co-authored-by: jasonjjl1999 <43978361+jasonjjl1999@users.noreply.github.com> Co-authored-by: martynwei <martyn.wei@gmail.com> Co-authored-by: Ryan Hong <66425733+rhong99@users.noreply.github.com> Co-authored-by: Jason <jasoli@nvidia.com> Co-authored-by: ericharper <complex451@gmail.com> Co-authored-by: Boris Fomitchev <borisfom@users.noreply.github.com> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> Co-authored-by: Ramanathan Arunachalam <ramanathan.arun@rutgers.edu> Co-authored-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> Co-authored-by: bene-ges <61418381+bene-ges@users.noreply.github.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Evelina <10428420+ekmb@users.noreply.github.com> Co-authored-by: PeganovAnton <peganoff2@mail.ru> Co-authored-by: Jocelyn <jocelynh@nvidia.com> Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> Co-authored-by: Sean Naren <snarenthiran@nvidia.com> Co-authored-by: Matvei Novikov <mattyson.so@gmail.com> Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com> Co-authored-by: Igor Gitman <igitman@nvidia.com> Co-authored-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Co-authored-by: Taejin Park <tango4j@gmail.com> Co-authored-by: Adi Renduchintala <108822655+arendu@users.noreply.github.com> Co-authored-by: Virginia Adams <78445382+vadam5@users.noreply.github.com> Co-authored-by: yaoyu-33 <54727607+yaoyu-33@users.noreply.github.com> Co-authored-by: David <amosalla@asu.edu> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com> Co-authored-by: Yi Dong <yidong@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Co-authored-by: Vladimir Bataev <vbataev@nvidia.com> Co-authored-by: Vahid Noroozi <VahidooX@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
* Disable loss typecheck * Fix spectrogram lengths * Remove Precision 16 requirement * Address lgtm alerts * clean up unused code * Address lgtm alerts * Refactor audio_to_mel_torch method * Use NeMo FilterBank to get melspec Todo: set self.fb * Fix filterbank max frequency to match with original VITS * Fix filterbank features correct length * Address lgtm issues * Remove print statements * Remove stft_pad_amount * new structure for tts datasets in script folder Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * remove cmudict downloading Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * rename mixertts dataset, add vocoder dataset Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add libritts processing Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * update tts dataset and libritts get data Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix bugs in vocoder ds Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add ds * changed vits yaml * rm yaml * fix yaml and model * Added scaler * refactored yaml * managed to run in fp16 * refactoring Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix small bugs and add new todos Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix optimizers Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * Port Variational Inference with Adversarial Learning (VITS) to NeMo TTS (NVIDIA#6) * Add vits files Add vits_losses.py, vits_modules.py and vits.py. * Move non-vits models to modules * Add vits.yaml * Add _loader to vits.py * Add basic template for vits * Update vits.yaml with vits parameters * Remove extra space * Add top level training script * Add some variables to vits yaml * Add forward and training methods * Fix imports * Added validation step * Log training losses * Update loss calls to use class attributes * Add VITS to models list * Fix all imports * Remove old module calls * Fix typo in monotonic align import * Modified validation step 1. reverted to tensorboard 2. validation_step logs audio, mel-spec for batch 0 3. validation_step_alt logs audio, mel-spec for batch 0 and loss_mel * Fix imports for VITS * Remove old module calls * Fix typo in monotonic align import * Modified validation step 1. reverted to tensorboard 2. validation_step logs audio, mel-spec for batch 0 3. validation_step_alt logs audio, mel-spec for batch 0 and loss_mel * Add parameters from original VITS config * Fix config file * Fix imports and generate spec from audio * Fix incorrect dimensions * Progress update * Fix loss * Fix cuda thing * Fix monotonic align import * Fix typos in vits.py * Disable loss typecheck * Fix spectrogram lengths * Remove Precision 16 requirement * Address lgtm alerts * clean up unused code * Address lgtm alerts * Refactor audio_to_mel_torch method * Use NeMo FilterBank to get melspec Todo: set self.fb * Fix filterbank max frequency to match with original VITS * Fix filterbank features correct length * Address lgtm issues * Remove print statements * Remove stft_pad_amount Co-authored-by: martynwei <martyn.wei@gmail.com> Co-authored-by: Ryan Hong <66425733+rhong99@users.noreply.github.com> Co-authored-by: richa.ren@mail.utoronto.ca <richa.ren@mail.utoronto.ca> Co-authored-by: Jason <jasoli@nvidia.com> Signed-off-by: Jason <jasoli@nvidia.com> * make new commit Signed-off-by: Jason <jasoli@nvidia.com> * add copyright headers Signed-off-by: Jason <jasoli@nvidia.com> * style Signed-off-by: Jason <jasoli@nvidia.com> * rename README Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix style without vits_modules Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add numba code, fix style and add todos Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * small fix * fix some todos * added numba mas * added DDP sampler * specified versions * fixed for new librosa version * added feature loss * added IPA phonemizer * refactored IPA g2p * added vits losses * some ref * fix * added checkpointing * cp * cfg * merged some 1.8.0 fixes * plt fix * fix logging * fix checkpoint loading * refactored inference * fp32 run * update branch Signed-off-by: ericharper <complex451@gmail.com> * update package info Signed-off-by: ericharper <complex451@gmail.com> * new exp * update branch Signed-off-by: ericharper <complex451@gmail.com> * Restored tests previously disabled for 22.03 base (NVIDIA#4109) Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * add augmentation to label models (NVIDIA#4113) * add augmentation to label models Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * duration fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Call register_bert_model after assigning self.bert_model variable (NVIDIA#4116) Signed-off-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> Co-authored-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> * Tutorial on ITN with Thutmose tagger and small fixes (NVIDIA#4117) * 1. Add tutorial. 2. Move a function to fix import in tutorial. 3. Merge multiple spaces into one space in the final output Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fixes for code review Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * Add tutorial to tutorials.rst Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * cleaned up TN/ ITN doc (NVIDIA#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Check implicit grad acc in GLUE dataset building (NVIDIA#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * update the default (NVIDIA#4135) Signed-off-by: ekmb <ebakhturina@nvidia.com> * Draft: Fix restoring from checkpoint for case when `model.common_dataset_parameters.label_vocab_dir` is provided (NVIDIA#4136) * Fix restoring from checkpoint with label vocab dir Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Add tests for various ways to pass label ids to model Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Do not create tmp directory Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix parameter name Signed-off-by: PeganovAnton <peganoff2@mail.ru> * finish cherry-pick op Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix labels errors Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove duplicate stage Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Change target branch Signed-off-by: PeganovAnton <peganoff2@mail.ru> * fix typo (NVIDIA#4140) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Fix/punctuation avoid overwritting tmp files (NVIDIA#4144) * Add draft of fixing tmp files overwritting Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove accidental changes Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove accidental changes Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Use built-in tempfile library Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix code style Signed-off-by: PeganovAnton <peganoff2@mail.ru> * bug_fix_diarization_manifest_creation (NVIDIA#4125) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> * fix doc (NVIDIA#4146) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Tacotron2 retrain (NVIDIA#4103) * fix yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Fix for new TTSDataset class Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * inference fix Signed-off-by: treacker <emshabalin@yandex.ru> * removed old code Signed-off-by: treacker <emshabalin@yandex.ru> * updated parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * reverted version update Signed-off-by: treacker <emshabalin@yandex.ru> * refactored parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * Updated Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Refactored tutorial for Tacotron2 Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Update Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Update tacotron.yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * cleaned up TN/ ITN doc (NVIDIA#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: treacker <emshabalin@yandex.ru> * Check implicit grad acc in GLUE dataset building (NVIDIA#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Fixed jenkins Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> * Multiprocess improvements (NVIDIA#4127) * initial commit Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * start fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * improve multiprocessing speed while creating speaker dataset Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * updated scp to filelist Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * WaveGlow input type fixes (NVIDIA#4151) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * notebooks' link, typo and import fix (NVIDIA#4158) * redo missing pr 4007 Signed-off-by: fayejf <fayejf07@gmail.com> * remove extremely unreliable links Signed-off-by: fayejf <fayejf07@gmail.com> * Thutmose tagger bug fixes (NVIDIA#4162) * add pretrained ngc model, small fixes Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix model location Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix model location Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * 1. fix typos. 2. write magic functions without space Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * add example of inference with pretrained model Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * changed model location to nemo Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * style fix Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix space Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * update speaker docs (NVIDIA#4164) * update speaker docs Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * chunks -> segments Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Khz -> kHz Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * changed to vits g2p * refactoring * added cosineLR * Updated whitelist path * added vanilla torch grad scaler * Fixed lightning version * added warmup and wd * switched to cosineLR * refactored data classes for vits * some fixes * fixed import * changeg train loop * fixed scheduler bug * refactoring for exps * Refactored loss logic * Ref for exps * added coqui stuff * exps * bugfix * added side file * bugfix * reverted * fixed sampler behaviour * updated for ptl 1.7.2 * refactored dataloader func * some cleaning * reverted to vanilla loss * modified for pickling * added dataset class * fixed torch version * added autocast for fp training * removed coqui files * 'Fixed tokenizer' * Fix tokenizer * update branch Signed-off-by: ericharper <complex451@gmail.com> * Fix link to inference notebook (NVIDIA#5247) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Update ASR scores table (NVIDIA#5254) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * Fix links to speaker identification notebook (NVIDIA#5260) Signed-off-by: SeanNaren <snarenthiran@nvidia.com> Signed-off-by: SeanNaren <snarenthiran@nvidia.com> * Minor typo fixes in TTS tutorial (NVIDIA#5266) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Pcla tutorial fixes (NVIDIA#5271) * Fixed typos Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed cell type and tatoeba reference Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed typo Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed branch variable Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fix bug into Dialogue tutorial (NVIDIA#5277) * Typo fix (NVIDIA#5288) Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fix dialogue tutorial bug (NVIDIA#5297) * set add_pooling_layer=False for huggingface bert model * remove add_pooling_layer=False and set find_unused_parameters=True * set num_prompt_tokens to 0 for huggingface * small bugfix for r1.13.0 (NVIDIA#5310) * typo fix Signed-off-by: fayejf <fayejf07@gmail.com> * udpate transcribe Signed-off-by: fayejf <fayejf07@gmail.com> Signed-off-by: fayejf <fayejf07@gmail.com> * Add italian model checkpoints (NVIDIA#5316) Signed-off-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Igor Gitman <igitman@nvidia.com> * [STT] Add Ru ASR Conformer-CTC and Conformer-Transducer (NVIDIA#5340) * [STT] Add stt_ru_conformer_ctc_large Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> * [STT] Add stt_ru_conformer_transducer_large Add stt_ru_conformer_transducer_large Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Pcla tutorial fixes (NVIDIA#5313) * fixes Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * fixes Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * moved `create_text_and_labels` to token_classification_utils.py Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * a lot of refactoring * strict ptl version * strict ptl version * reverted plt version * Added base text2audio class * Fix issue with HF Model upload tutorial (NVIDIA#5359) * Add Gradio App to ASR Docs (NVIDIA#5270) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> (cherry picked from commit e4b6a38) * Fix issue with normalized config for dataset name Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * tutorial fixes (NVIDIA#5354) Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Add SDP documentation (NVIDIA#5274) * Add details to SDP README.md Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add docstring to WriteManifest processor Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add docstring to CreateInitialManifestMLS Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add ModifyManifestTextProcessor docstring Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add ASRInference docstring Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add base_processor docstrings Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add minimal SDP docs page Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Update tools/speech_dataset_processor/README.md Co-authored-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> * Write simple README for SDP and move complex explanations to docs Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove incorrect type hints Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Make config example less confusing Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Fix typo Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Clarify that YAML file is config file in README Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove unused imports Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove SDP docs for now Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove links to docs in SDP README Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Co-authored-by: Igor Gitman <igitman@nvidia.com> * [Bugfix] Added rm -f / wget- nc command in multispeaker sim notebook to r1.13.0 (NVIDIA#5375) * Fix minor error in notebook Signed-off-by: Taejin Park <tango4j@gmail.com> * changed branch name in tutorial notebook Signed-off-by: Taejin Park <tango4j@gmail.com> Signed-off-by: Taejin Park <tango4j@gmail.com> * Rename Speech Dataset Processor to Speech Data Processor (NVIDIA#5378) Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * fix for num worker 0 causing issues in losses after 1 epoch (NVIDIA#5379) * Fixed bug in notebook (NVIDIA#5382) Signed-off-by: Virginia Adams <vadams@nvidia.com> Signed-off-by: Virginia Adams <vadams@nvidia.com> * Force MHA QKV onto fp32 (NVIDIA#5391) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * Added scheduling variety * ref * Fix for prompt table restore error (NVIDIA#5393) * Fix for prompt table restore error Signed-off-by: Virginia Adams <vadams@nvidia.com> * Added more saftey checks Signed-off-by: Virginia Adams <vadams@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Added more condition checks Signed-off-by: Virginia Adams <vadams@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Virginia Adams <vadams@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Fix args (NVIDIA#5410) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * bugfix * import tests * Add temporary fix for CUDA issue in Dockerfile (NVIDIA#5421) Signed-off-by: Yu Yao <yuya@nvidia.com> Signed-off-by: Yu Yao <yuya@nvidia.com> * Megatron Export Update (NVIDIA#5343) * export update for Megatron + change ORT optimization Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated export_utils to use autocast instead of manually casting >:/ Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * removed dtype from LayerNorm Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * added comment Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * reverting changes on FloatCast Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * Cherry-picked changes from megatron-norm Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * updated asr_model import to cast_utils Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * updated del onnx_model place Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * changed ort optimization to basic -> temp fix Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com> * disable pc test (NVIDIA#5426) Signed-off-by: ekmb <ebakhturina@nvidia.com> Signed-off-by: ekmb <ebakhturina@nvidia.com> * Fix GPT generation when using sentencepiece tokenizer (NVIDIA#5413) * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Yi Dong <yidong@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Disable sync_batch_comm in validation_step for GPT (NVIDIA#5397) * disable sync_batch_comm in validation_step Signed-off-by: ericharper <complex451@gmail.com> * Read sync_batch_comm from config or default to False Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> * Update megatron_gpt_config to default sync_batch_comm to False to avoid CUDA error Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> * Empty Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Comment out test Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> * Revert "Add temporary fix for CUDA issue in Dockerfile (NVIDIA#5421)" (NVIDIA#5431) This reverts commit 0718b17. * Revert workaround for T5 that sets number of workers to 0 & sync_batch_comm=False (NVIDIA#5420) * Revert workers workaround Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix in config Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Fixed discrepancies * updated Jenkisfile * updated Jenkisfile * Cleaning * fixed the onnx bug in conformer for non-streaming models. (NVIDIA#5242) (NVIDIA#5446) Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Co-authored-by: Vahid Noroozi <VahidooX@users.noreply.github.com> * Set sync_batch_comm in other places (NVIDIA#5448) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Radtts 1.13 (NVIDIA#5451) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (NVIDIA#5358) * [TTS] add CI test for RADTTS training recipe. Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Radtts 1.13 plus (NVIDIA#5457) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (NVIDIA#5358) * Fixing RADTTS training - removing view buffer and fixing accuracy issue * Fixes for Torchscript/Triton * Added autocast to radtts UT * using cuda() for training example Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Add num layers check (NVIDIA#5470) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Change to kwargs (NVIDIA#5475) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Support for finetuning and finetuning inference with .ckpt files & batch size refactoring (NVIDIA#5339) (NVIDIA#5478) * Initial refactor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Resolve config before passing to load_from_checkpoint Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fixes for model parallel and nemo restore Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fixes for eval Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Revert config changes Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Refactor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix typo Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove comments Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Minor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix validation reconfiguration Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove old comment Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes for test_ds Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * export_utils bugfix (NVIDIA#5480) * updated export_utils Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Export fixes for Riva (NVIDIA#5496) * Export fixes for Riva Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * Cleaning up training_utils Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * minor bug fix (NVIDIA#5521) Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> * added set_start_method + function param bugfix (NVIDIA#5539) * added set_start_method + function param bugfix Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * upper bound torchmetrics Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: ericharper <complex451@gmail.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: ericharper <complex451@gmail.com> * remove notebook (NVIDIA#5548) Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: ericharper <complex451@gmail.com> * Remove broadcast (NVIDIA#5558) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * cleaning * Fix all gather while writing to a file during T5 finetuning (NVIDIA#5561) * Gather from data parallel only instead of all ranks Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * update readme Signed-off-by: ericharper <complex451@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * added copyright * fixed imports * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * cleaning * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fixed filesize check * last cleaning Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated cmudict path * fixed merge bug Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * warnings fix * fix warnings Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * storing * updated version Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * update Jenkinsfile versions Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * fixed issues Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * fixed more issues * more fixes Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * added experimental tag * Clarification updates Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * remove old cython code Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * remove old cython code Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * docstring fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Enhancements Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Enhancements Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * imports fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix typo Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * excessive comtutations fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * typecheck fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Small refactoring * Small refactoring Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * reversed exp_manager params Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Fixed call for new function signature Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> Signed-off-by: Jason <jasoli@nvidia.com> Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: ekmb <ebakhturina@nvidia.com> Signed-off-by: PeganovAnton <peganoff2@mail.ru> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: fayejf <fayejf07@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: SeanNaren <snarenthiran@nvidia.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Signed-off-by: Taejin Park <tango4j@gmail.com> Signed-off-by: Virginia Adams <vadams@nvidia.com> Signed-off-by: Yu Yao <yuya@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> Co-authored-by: jasonjjl1999 <jasonjjl1999@gmail.com> Co-authored-by: richa.ren@mail.utoronto.ca <richa.ren@mail.utoronto.ca> Co-authored-by: Oktai Tatanov <oktai.tatanov@gmail.com> Co-authored-by: jasonjjl1999 <43978361+jasonjjl1999@users.noreply.github.com> Co-authored-by: martynwei <martyn.wei@gmail.com> Co-authored-by: Ryan Hong <66425733+rhong99@users.noreply.github.com> Co-authored-by: Jason <jasoli@nvidia.com> Co-authored-by: ericharper <complex451@gmail.com> Co-authored-by: Boris Fomitchev <borisfom@users.noreply.github.com> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> Co-authored-by: Ramanathan Arunachalam <ramanathan.arun@rutgers.edu> Co-authored-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> Co-authored-by: bene-ges <61418381+bene-ges@users.noreply.github.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Evelina <10428420+ekmb@users.noreply.github.com> Co-authored-by: PeganovAnton <peganoff2@mail.ru> Co-authored-by: Jocelyn <jocelynh@nvidia.com> Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> Co-authored-by: Sean Naren <snarenthiran@nvidia.com> Co-authored-by: Matvei Novikov <mattyson.so@gmail.com> Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com> Co-authored-by: Igor Gitman <igitman@nvidia.com> Co-authored-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Co-authored-by: Taejin Park <tango4j@gmail.com> Co-authored-by: Adi Renduchintala <108822655+arendu@users.noreply.github.com> Co-authored-by: Virginia Adams <78445382+vadam5@users.noreply.github.com> Co-authored-by: yaoyu-33 <54727607+yaoyu-33@users.noreply.github.com> Co-authored-by: David <amosalla@asu.edu> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com> Co-authored-by: Yi Dong <yidong@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Co-authored-by: Vladimir Bataev <vbataev@nvidia.com> Co-authored-by: Vahid Noroozi <VahidooX@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Jocelyn Huang jocelynh@nvidia.com
What does this PR do ?
Fixes some minor typos in the TTS intro tutorial. No code changes.
Collection: TTS