Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Stateless decoder for RNN-T (NVIDIA#4710)
* stateless RNNT working Signed-off-by: Hainan Xu <hainanx@nvidia.com> * batch decode working Signed-off-by: Hainan Xu <hainanx@nvidia.com> * working backup Signed-off-by: Hainan Xu <hainanx@nvidia.com> * good working version Signed-off-by: Hainan Xu <hainanx@nvidia.com> * temporarily make norm layer have affine Signed-off-by: Hainan Xu <hainanx@nvidia.com> * temp Signed-off-by: Hainan Xu <hainanx@nvidia.com> * temp Signed-off-by: Hainan Xu <hainanx@nvidia.com> * [TTS] add staticmethod decoration for BetaBinomialInterpolator (#4319) Signed-off-by: Xuesong Yang <xuesongyxs@gmail.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * [TTS] remove redundant lines and declare global variables and capture (#4320) exception of non-supported windows. Signed-off-by: Xuesong Yang <xuesongyxs@gmail.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Finetune T5 on the prefix-lm objective (#4328) * Add script and yaml config Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix yaml config Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Style Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Update yaml to remove hardcoded model path Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Fuse bias with geglu in ParallelMLP (#4213) * add code of fused_bias_geglu * call fused_bias_geglu in ParallelMLP * fix some bugs * change biad_gelu_activation to bias_activation_fusion * fix the setting of bias_actication_fusion for T5 * delete bias_gelu_fusion from T5 example config * push reformatted files * hto4h gemms fusion * remove hto4h gemms fusion * push reformatted files * disable bias_activation_fusion while activation is not geglu * add bias_activation_fusion in yaml config file * add bias_gelu_fusion in T5 config yaml file to pass CI test * change bias_gelu_fusion to bias_activation_fusion for T5 CI test * recover latest change Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Support larger datasets for question answering (#4205) * refactor dialogue state tracking for modelling/dataset interoperability Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix style changes Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix typo Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix style raised by lgtm Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix style formatting Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update template to include description of intent Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * changes based on requests in review Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add compatibility with assistant dataset Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove dialogue_state_tracking Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update huggingface utils for dialogue Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * rename dialogue_state_tracking_hybrid to dialogue_state_tracking_sgdqa Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix style Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix nemo/collections/nlp/models/dialogue_state_tracking_sgdqa/__init__.py Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix typo Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add docstrings for assistant data processsor Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkins for SGDGEN local checkpoint Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update style Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * use local vocab file for Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * patch for Jenkins CI using local file Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add slot filling prediction and metrics Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unused code Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * refactor metrics code out of Dialogue GPT Model Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate backward compatible support for IntentSlotClassificationModel (bert model) Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * save prediction file for IntentSlotClassification Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update dialogue gpt model training for megatron gpt Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove batch generate for HF GPT2, which causes lower performance Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add few shot capability to dialogue gpt model Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile and remove unused import Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update code description and clarity Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * address PR comments Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate compatibility with ZeroShotIntentModel Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * rename folder to dialogue due to increased scope and further refactor for clarity Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * added dialogue GPT for sequence generation task (e.g. answer extender) Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add CI test for DialogueGPTGenerationModel Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate DialogueS2SGenerationModel for generation task (e.g. answer extender) Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * modify huggingface utils to support HF t5/BART models Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unused imports Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update bleu metric Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix bleu metric style Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * debug bleu metric Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * debug bleu metric Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update based on PR #3893 Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update 2 based on PR #3893 Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update 3 based on PR #3893 Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate sgd generation based on user user utterance and system slot-values to generate system utterance Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add validation model saving capabilities Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * cleaned up code for SGD Based Answer extender Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Dialogue Generation CI Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix Jenkins CI issue" Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add support for design dataset Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unnecessary imports Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * support megatron for dialogue_s2s_generation_model Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * reduce loaded samples in MSMarcoDataProcessor to 64 when cfg.model.dataset.debug_mode=True Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update CI Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update checkpoint and predictions filename to include epoch number Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate HF BART MNLI into zero shot intent model Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate Dialogue Nearest Neighbour Model Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * refactor Dialogue SGD Data Processor to make interface for models cleaner Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Dialogue S2S Generation model for DialogueSGDDataProcessor interface Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * support sgd and drive thru datasets by zero shot model and nearest neighbour model Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add prediction saving code to nearest neighbour and zero shot intent models Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix typo in sgd data processor Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate Dialogue Mellon QA Data Processor Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update mellon qa Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update dialogue.py to remove outdated info Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update dialogue_config.yaml Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update dialogue_config.yaml Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add dialogue docs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * address review comments Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix for cfg Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * make dependency on apex optional Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * change NLPDDPluggin calling logic to make it possible to run without apex Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add first draft of tutorial Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * reduce ms marco size by removing lines without wellFormedAnswers Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * address pr comments Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update colab tutorial link in dialogue docs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * include unit test and some refactor to facilitate unit test Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * address pr issues Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove typos in dialogue tutorial Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * support larger files for question answering Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unnecessary artifacts to reduce memory use Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * put 0 tensor to device Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update link within dialogue tutorial Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * restore previously delete files Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error handling when loss = nan Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update nan handling Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update spanning loss func Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update spanning loss Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix type error raised in qa_dataset.py Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add error checking message Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * revert back to float32 Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * revert back to float32 Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update exp logging Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update loading of large file from pickle to json Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update loading of large file from pickle to json Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * limit number of negative samples Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * revert post processing Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * revert post processing Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unused methods and style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add more documentation Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unused imports Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * changes base on PR review Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> Co-authored-by: Zhilin Wang <zhilinw@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Eric Harper <complex451@gmail.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Fix bugs in indexed dataset exam script (#4325) * fix the typo Signed-off-by: Yi Dong <yidong@nvidia.com> * add neighbors option Signed-off-by: Yi Dong <yidong@nvidia.com> * change the argument name Signed-off-by: Yi Dong <yidong@nvidia.com> Co-authored-by: Micha Livne <michalivne@users.noreply.github.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Adding docs for ASR SSL (#4303) * Initial commit for SSL docs Signed-off-by: Krishna Puvvada <kpuvvada@nvidia.com> * ssl docs update-1 Signed-off-by: Krishna Puvvada <kpuvvada@nvidia.com> * ssl docs update-2 Signed-off-by: Krishna Puvvada <kpuvvada@nvidia.com> Co-authored-by: Krishna Puvvada <kpuvvada@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Fuse grad division into async grad allreduce (#4327) * O2 runs but O1 does not Signed-off-by: ericharper <complex451@gmail.com> * disable async for O1 Signed-off-by: ericharper <complex451@gmail.com> * typo Signed-off-by: ericharper <complex451@gmail.com> * update async flag in configure_optimizers Signed-off-by: ericharper <complex451@gmail.com> * typo Signed-off-by: ericharper <complex451@gmail.com> * revert Signed-off-by: ericharper <complex451@gmail.com> * update _require if using async Signed-off-by: ericharper <complex451@gmail.com> * clean comments Signed-off-by: ericharper <complex451@gmail.com> * always all_reduce Signed-off-by: ericharper <complex451@gmail.com> * add async grad allreduce and chunk optimization to T5 * push reformatted files after style check * set chunk size as 0 while async grad allreduce is off * more experiments show that 125MB is a better default chunk size for most cases * add grad_allreduce_chunk_size_mb for GPT-3 * at the end of each training step, wait until all async grad allreduce works are done * replace individual allreduce work.wait() with a single dGPU evice synchroonization * add code of fused_bias_geglu * call fused_bias_geglu in ParallelMLP * record the status of each allreduce work seems too much for perf * add more comments * push a reformatted file * fix some bugs * change biad_gelu_activation to bias_activation_fusion * fix the setting of bias_actication_fusion for T5 * delete bias_gelu_fusion from T5 example config * push reformatted files * fuse grad scale with allreduce * push reformatted files * hto4h gemms fusion * remove hto4h gemms fusion * add grad_scale_ar_fusion into GPT-3 * push reformatted files * push reformatted files * rename grad_scale_ar_fusion to grad_div_ar_fusion * disable bias_activation_fusion while activation is not geglu * add bias_activation_fusion in yaml config file * add bias_gelu_fusion in T5 config yaml file to pass CI test * change bias_gelu_fusion to bias_activation_fusion for T5 CI test * recover latest change * add grad_div_ar_fusion in config yaml file * remove a redundant float() Co-authored-by: ericharper <complex451@gmail.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Update container to 22.05 (#4329) * update container to 22.05 Signed-off-by: ericharper <complex451@gmail.com> * try adding safe directory Signed-off-by: ericharper <complex451@gmail.com> * try env var Signed-off-by: ericharper <complex451@gmail.com> * printenv Signed-off-by: ericharper <complex451@gmail.com> * try GIT_BRANCH Signed-off-by: ericharper <complex451@gmail.com> * typo Signed-off-by: ericharper <complex451@gmail.com> * remove dbug statements Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Torchaudio installation fix (#4330) * separate installer added Signed-off-by: Aleksandr Laptev <alaptev@nvidia.com> * apply suggestions, minor fixes Signed-off-by: Aleksandr Laptev <alaptev@nvidia.com> Co-authored-by: Aleksandr Laptev <alaptev@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * [TTS] enforced pin_memory = True (#4341) Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Merge r1.9.0 main (#4331) * update branch Signed-off-by: ericharper <complex451@gmail.com> * update package info Signed-off-by: ericharper <complex451@gmail.com> * cleaned up TN/ ITN doc (#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Draft: Fix restoring from checkpoint for case when `model.common_dataset_parameters.label_vocab_dir` is provided (#4136) * Fix restoring from checkpoint with label vocab dir Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Add tests for various ways to pass label ids to model Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Do not create tmp directory Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix parameter name Signed-off-by: PeganovAnton <peganoff2@mail.ru> * finish cherry-pick op Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix labels errors Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove duplicate stage Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Change target branch Signed-off-by: PeganovAnton <peganoff2@mail.ru> * fix doc (#4146) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Tacotron2 retrain (#4103) * fix yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Fix for new TTSDataset class Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * inference fix Signed-off-by: treacker <emshabalin@yandex.ru> * removed old code Signed-off-by: treacker <emshabalin@yandex.ru> * updated parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * reverted version update Signed-off-by: treacker <emshabalin@yandex.ru> * refactored parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * Updated Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Refactored tutorial for Tacotron2 Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Update Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Update tacotron.yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * cleaned up TN/ ITN doc (#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: treacker <emshabalin@yandex.ru> * Check implicit grad acc in GLUE dataset building (#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Fixed jenkins Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> * Multiprocess improvements (#4127) * initial commit Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * start fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * improve multiprocessing speed while creating speaker dataset Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * updated scp to filelist Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * notebooks' link, typo and import fix (#4158) * redo missing pr 4007 Signed-off-by: fayejf <fayejf07@gmail.com> * remove extremely unreliable links Signed-off-by: fayejf <fayejf07@gmail.com> * update speaker docs (#4164) * update speaker docs Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * chunks -> segments Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Khz -> kHz Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * small fix (#4180) Signed-off-by: fayejf <fayejf07@gmail.com> * fix the server key value problem (#4196) Signed-off-by: Yi Dong <yidong@nvidia.com> * Fix/punctuation/trainer required for setting test data (#4199) * Draft of fix Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Add warnings and replace globa_step with current_epoch Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Small improvements to warnings Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Error and warning messages improvements Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Replace self.trainer with self._trainer Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Update ContextNet version (#4207) Signed-off-by: smajumdar <smajumdar@nvidia.com> * fix bugs for dialogue tutorial (#4211) Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * Dialogue tutorial fix (#4214) * fix bugs for dialogue tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update path for convert_datasets.py due to conflict PR Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * Add docs for Thutmose Tagger (#4173) * Add docs for Thutmose Tagger Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * add level in docs Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * delete folder to avoid error with running when folder exists from previous run Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: ekmb <ebakhturina@nvidia.com> * Dialogue tutorial fix (#4218) * fix bugs for dialogue tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update path for convert_datasets.py due to conflict PR Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * restore previously deleted files Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * style fix Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * Dialogue tutorial fix (#4221) * fix bugs for dialogue tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update path for convert_datasets.py due to conflict PR Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * restore previously deleted files Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * style fix Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * fix syntax error in ipynb-file (#4228) Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix json serialize (#4235) Signed-off-by: Yi Dong <yidong@nvidia.com> * Prompt Learning Typo Fixes (#4238) * Prompt tuning notebook typo fixes Signed-off-by: Virginia Adams <vadams@nvidia.com> * Update tutorials.rst * Update prompt_learning.rst * Update prompt_learning.rst * fixing bug 3642622 (#4250) * fixing bug 3642622 Signed-off-by: Ghasem Pasandi <gpasandi@nvidia.com> * fixing bug 3642622 Signed-off-by: Ghasem Pasandi <gpasandi@nvidia.com> Co-authored-by: Ghasem Pasandi <gpasandi@nvidia.com> * fix broken link in the tutorial (#4257) Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * Typo fix, branch change, better download messagae (#4262) Signed-off-by: Virginia Adams <vadams@nvidia.com> * Raise error if bicleaner is not installed in NMT Data preprocesing notebook (#4264) * Raise error if bicleaner is not installed Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Clear cells Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix missing validation dataset, whitelist certain keywords for datasets (#4269) * Fix missing validation dataset, whitelist certain keywords for datasets Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix missing validation dataset, whitelist certain keywords for datasets Signed-off-by: smajumdar <smajumdar@nvidia.com> * Update asr configs with num_workers and pin_memory (#4270) Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix epoch end (#4265) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Eric Harper <complex451@gmail.com> * Set Save on train end to false (#4274) * Set Save on train end to false Signed-off-by: Virginia Adams <vadams@nvidia.com> * Update prompt_learning.rst * Update prompt_learning.rst * Update YAML (#4261) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Updated config to fix CI test OOM error (#4279) * Updated config to fix CI test issue Signed-off-by: Virginia Adams <vadams@nvidia.com> * Increased num workers Signed-off-by: Virginia Adams <vadams@nvidia.com> * verbose k2 install, skip if failed (#4289) Signed-off-by: Aleksandr Laptev <alaptev@nvidia.com> Co-authored-by: Aleksandr Laptev <alaptev@nvidia.com> * Changed total virtual prompt tokens (#4295) * Changed total virtual prompt tokens Signed-off-by: Virginia Adams <vadams@nvidia.com> * put number of workers back Signed-off-by: Virginia Adams <vadams@nvidia.com> * upper bound lightning Signed-off-by: ericharper <complex451@gmail.com> * update branch Signed-off-by: ericharper <complex451@gmail.com> * update config Signed-off-by: ericharper <complex451@gmail.com> * remove duplicate test Signed-off-by: ericharper <complex451@gmail.com> * fix tn test cases Signed-off-by: ericharper <complex451@gmail.com> * add another safe.directory Signed-off-by: ericharper <complex451@gmail.com> * typo Signed-off-by: ericharper <complex451@gmail.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: PeganovAnton <peganoff2@mail.ru> Co-authored-by: treacker <36159472+treacker@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com> Co-authored-by: Yi Dong <43824965+yidong72@users.noreply.github.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com> Co-authored-by: bene-ges <61418381+bene-ges@users.noreply.github.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: ekmb <ebakhturina@nvidia.com> Co-authored-by: Virginia Adams <78445382+vadam5@users.noreply.github.com> Co-authored-by: Ghasem <35242805+pasandi20@users.noreply.github.com> Co-authored-by: Ghasem Pasandi <gpasandi@nvidia.com> Co-authored-by: Aleksandr Laptev <laptevsasha12@gmail.com> Co-authored-by: Aleksandr Laptev <alaptev@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * [TTS] Training Fastpitch on German text and phonemes and finetuning HiFi-GAN on predicted mels (#4266) * initial commit Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> * cleared notebook outputs Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> * formatting errors Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> * formatting Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> * addressed comments Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> * addressed comments on tutorial Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> * updated tutorial Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> * updated grammar and fastpitch description Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> * updated with feedback Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> * updated with feedback Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> * removed unused imports Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Speedup the speech commands dataset processing script (#4347) * Add multiprocessing support to the google speech commands dataset processing script Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> * fix number of args error with __extract_all_files function Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> * fix styling issues Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> * fix bugs with silence set construction and update librosa output write to use soundfile write Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> * add docstrings and return values in __construct_filepaths as dictionary Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * fix wrong requirement (#4349) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Refactored path to manifest (#4251) Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * IPA support for TTS (#4310) * IPA tokenizer and G2P untested draft Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Add IPA CMUdict and new heteronyms list Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Add draft FastPitch IPA config Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Minor bugfixes for IPA training Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Add phoneme_probability to IPA G2P Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Updates to IPA FastPitch training config Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Update IPA dict and heteronyms file Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Adjust default lr for IPA FastPitch to 1e-3 Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Rename IPA CMUdict to reflect date Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Add docstrings for IPA tokenizer and G2P, update CMUdict path for config Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Fix IPA vocab ordering, add options to uppercase graphemes and remove stress symbols Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Mark IPA classes as experimental Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Update apostrophe-S cases Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Tn install (#4055) * remove conda pynini requirement Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * remove remnants Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * merge with main Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * removing nlp collection dependency from text processing and thus breaking cyclyc imports Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix wrong requirement Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix bug in vi Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * update jenkins folders Signed-off-by: ekmb <ebakhturina@nvidia.com> Co-authored-by: Eric Harper <complex451@gmail.com> Co-authored-by: ekmb <ebakhturina@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * fix tutorial (#4352) Signed-off-by: stevehuang52 <heh@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * fix the post ln (#4350) Signed-off-by: Yi Dong <yidong@nvidia.com> Co-authored-by: Eric Harper <complex451@gmail.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * [Fix] Hanging for Fully Randomized Bucketing (#4348) * Update container to 22.05 (#4329) * update container to 22.05 Signed-off-by: ericharper <complex451@gmail.com> * try adding safe directory Signed-off-by: ericharper <complex451@gmail.com> * try env var Signed-off-by: ericharper <complex451@gmail.com> * printenv Signed-off-by: ericharper <complex451@gmail.com> * try GIT_BRANCH Signed-off-by: ericharper <complex451@gmail.com> * typo Signed-off-by: ericharper <complex451@gmail.com> * remove dbug statements Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: stevehuang52 <heh@nvidia.com> * Merge r1.9.0 main (#4331) * update branch Signed-off-by: ericharper <complex451@gmail.com> * update package info Signed-off-by: ericharper <complex451@gmail.com> * cleaned up TN/ ITN doc (#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Draft: Fix restoring from checkpoint for case when `model.common_dataset_parameters.label_vocab_dir` is provided (#4136) * Fix restoring from checkpoint with label vocab dir Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Add tests for various ways to pass label ids to model Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Do not create tmp directory Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix parameter name Signed-off-by: PeganovAnton <peganoff2@mail.ru> * finish cherry-pick op Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix labels errors Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove duplicate stage Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Change target branch Signed-off-by: PeganovAnton <peganoff2@mail.ru> * fix doc (#4146) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Tacotron2 retrain (#4103) * fix yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Fix for new TTSDataset class Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * inference fix Signed-off-by: treacker <emshabalin@yandex.ru> * removed old code Signed-off-by: treacker <emshabalin@yandex.ru> * updated parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * reverted version update Signed-off-by: treacker <emshabalin@yandex.ru> * refactored parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * Updated Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Refactored tutorial for Tacotron2 Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Update Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Update tacotron.yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * cleaned up TN/ ITN doc (#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: treacker <emshabalin@yandex.ru> * Check implicit grad acc in GLUE dataset building (#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Fixed jenkins Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> * Multiprocess improvements (#4127) * initial commit Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * start fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * improve multiprocessing speed while creating speaker dataset Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * updated scp to filelist Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * notebooks' link, typo and import fix (#4158) * redo missing pr 4007 Signed-off-by: fayejf <fayejf07@gmail.com> * remove extremely unreliable links Signed-off-by: fayejf <fayejf07@gmail.com> * update speaker docs (#4164) * update speaker docs Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * chunks -> segments Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Khz -> kHz Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * small fix (#4180) Signed-off-by: fayejf <fayejf07@gmail.com> * fix the server key value problem (#4196) Signed-off-by: Yi Dong <yidong@nvidia.com> * Fix/punctuation/trainer required for setting test data (#4199) * Draft of fix Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Add warnings and replace globa_step with current_epoch Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Small improvements to warnings Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Error and warning messages improvements Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Replace self.trainer with self._trainer Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Update ContextNet version (#4207) Signed-off-by: smajumdar <smajumdar@nvidia.com> * fix bugs for dialogue tutorial (#4211) Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * Dialogue tutorial fix (#4214) * fix bugs for dialogue tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update path for convert_datasets.py due to conflict PR Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * Add docs for Thutmose Tagger (#4173) * Add docs for Thutmose Tagger Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * add level in docs Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * delete folder to avoid error with running when folder exists from previous run Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: ekmb <ebakhturina@nvidia.com> * Dialogue tutorial fix (#4218) * fix bugs for dialogue tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update path for convert_datasets.py due to conflict PR Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * restore previously deleted files Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * style fix Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * Dialogue tutorial fix (#4221) * fix bugs for dialogue tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update path for convert_datasets.py due to conflict PR Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * restore previously deleted files Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * style fix Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * fix syntax error in ipynb-file (#4228) Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix json serialize (#4235) Signed-off-by: Yi Dong <yidong@nvidia.com> * Prompt Learning Typo Fixes (#4238) * Prompt tuning notebook typo fixes Signed-off-by: Virginia Adams <vadams@nvidia.com> * Update tutorials.rst * Update prompt_learning.rst * Update prompt_learning.rst * fixing bug 3642622 (#4250) * fixing bug 3642622 Signed-off-by: Ghasem Pasandi <gpasandi@nvidia.com> * fixing bug 3642622 Signed-off-by: Ghasem Pasandi <gpasandi@nvidia.com> Co-authored-by: Ghasem Pasandi <gpasandi@nvidia.com> * fix broken link in the tutorial (#4257) Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * Typo fix, branch change, better download messagae (#4262) Signed-off-by: Virginia Adams <vadams@nvidia.com> * Raise error if bicleaner is not installed in NMT Data preprocesing notebook (#4264) * Raise error if bicleaner is not installed Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Clear cells Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix missing validation dataset, whitelist certain keywords for datasets (#4269) * Fix missing validation dataset, whitelist certain keywords for datasets Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix missing validation dataset, whitelist certain keywords for datasets Signed-off-by: smajumdar <smajumdar@nvidia.com> * Update asr configs with num_workers and pin_memory (#4270) Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix epoch end (#4265) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Eric Harper <complex451@gmail.com> * Set Save on train end to false (#4274) * Set Save on train end to false Signed-off-by: Virginia Adams <vadams@nvidia.com> * Update prompt_learning.rst * Update prompt_learning.rst * Update YAML (#4261) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Updated config to fix CI test OOM error (#4279) * Updated config to fix CI test issue Signed-off-by: Virginia Adams <vadams@nvidia.com> * Increased num workers Signed-off-by: Virginia Adams <vadams@nvidia.com> * verbose k2 install, skip if failed (#4289) Signed-off-by: Aleksandr Laptev <alaptev@nvidia.com> Co-authored-by: Aleksandr Laptev <alaptev@nvidia.com> * Changed total virtual prompt tokens (#4295) * Changed total virtual prompt tokens Signed-off-by: Virginia Adams <vadams@nvidia.com> * put number of workers back Signed-off-by: Virginia Adams <vadams@nvidia.com> * upper bound lightning Signed-off-by: ericharper <complex451@gmail.com> * update branch Signed-off-by: ericharper <complex451@gmail.com> * update config Signed-off-by: ericharper <complex451@gmail.com> * remove duplicate test Signed-off-by: ericharper <complex451@gmail.com> * fix tn test cases Signed-off-by: ericharper <complex451@gmail.com> * add another safe.directory Signed-off-by: ericharper <complex451@gmail.com> * typo Signed-off-by: ericharper <complex451@gmail.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: PeganovAnton <peganoff2@mail.ru> Co-authored-by: treacker <36159472+treacker@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com> Co-authored-by: Yi Dong <43824965+yidong72@users.noreply.github.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com> Co-authored-by: bene-ges <61418381+bene-ges@users.noreply.github.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: ekmb <ebakhturina@nvidia.com> Co-authored-by: Virginia Adams <78445382+vadam5@users.noreply.github.com> Co-authored-by: Ghasem <35242805+pasandi20@users.noreply.github.com> Co-authored-by: Ghasem Pasandi <gpasandi@nvidia.com> Co-authored-by: Aleksandr Laptev <laptevsasha12@gmail.com> Co-authored-by: Aleksandr Laptev <alaptev@nvidia.com> Signed-off-by: stevehuang52 <heh@nvidia.com> * fix full_randn bucket hang Signed-off-by: stevehuang52 <heh@nvidia.com> * remove unused variables Signed-off-by: stevehuang52 <heh@nvidia.com> Co-authored-by: Eric Harper <complex451@gmail.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: PeganovAnton <peganoff2@mail.ru> Co-authored-by: treacker <36159472+treacker@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com> Co-authored-by: Yi Dong <43824965+yidong72@users.noreply.github.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com> Co-authored-by: bene-ges <61418381+bene-ges@users.noreply.github.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: ekmb <ebakhturina@nvidia.com> Co-authored-by: Virginia Adams <78445382+vadam5@users.noreply.github.com> Co-authored-by: Ghasem <35242805+pasandi20@users.noreply.github.com> Co-authored-by: Ghasem Pasandi <gpasandi@nvidia.com> Co-authored-by: Aleksandr Laptev <laptevsasha12@gmail.com> Co-authored-by: Aleksandr Laptev <alaptev@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Bits of RADTTS support (#4343) * Bits of RADTTS support Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * Fixed args mismatch Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * Style Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * Addressed review comments Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * More review comments Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Co-authored-by: Eric Harper <complex451@gmail.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Prompt Learning Pipeline Parallel (#4291) * Added get_forward_output_and_loss_func and updated train/val steps Signed-off-by: Virginia Adams <vadams@nvidia.com> * Added preprocess flag before prompt table/encoder access Signed-off-by: Virginia Adams <vadams@nvidia.com> * Made two optimizer groups, one for frozen, one for soft prompt Signed-off-by: Virginia Adams <vadams@nvidia.com> * Pipeline parallel working Signed-off-by: Virginia Adams <vadams@nvidia.com> * Still figuring out setting lr/sched for one param group Signed-off-by: Virginia Adams <vadams@nvidia.com> * Set betas to zero Signed-off-by: Virginia Adams <vadams@nvidia.com> * Only unfreeze one sublayer with lr 0.0 Signed-off-by: Virginia Adams <vadams@nvidia.com> * Pipeline parallel working w/ one optimizer Signed-off-by: Virginia Adams <vadams@nvidia.com> * Trying to fix Jenkins file Signed-off-by: Virginia Adams <vadams@nvidia.com> * Trying to fix Jenkins file Signed-off-by: Virginia Adams <vadams@nvidia.com> * Getting updated jenkins test to work Signed-off-by: Virginia Adams <vadams@nvidia.com> * Getting updated jenkins test to work Signed-off-by: Virginia Adams <vadams@nvidia.com> * added prompt learning tp and pp CI tests Signed-off-by: Virginia Adams <vadams@nvidia.com> * Added amp_o2 model compatibility Signed-off-by: Virginia Adams <vadams@nvidia.com> * Made CI test smaller Signed-off-by: Virginia Adams <vadams@nvidia.com> * Still trying to get Jenkins to work Signed-off-by: Virginia Adams <vadams@nvidia.com> * Still trying to get Jenkins to work Signed-off-by: Virginia Adams <vadams@nvidia.com> * Temporarily moving prompt learning CI test to beginning Signed-off-by: Virginia Adams <vadams@nvidia.com> * Changing the layer being unfrozen Signed-off-by: Virginia Adams <vadams@nvidia.com> * debug jenkins Signed-off-by: Virginia Adams <vadams@nvidia.com> * Move pp unfreeze to init Signed-off-by: Virginia Adams <vadams@nvidia.com> * Try to make Jenkins test parallel Signed-off-by: Virginia Adams <vadams@nvidia.com> * Fix python formatting Signed-off-by: Virginia Adams <vadams@nvidia.com> * Moved prompt learning tests back to where they belong Signed-off-by: Virginia Adams <vadams@nvidia.com> * add back checkpoint convertion CI test Signed-off-by: Virginia Adams <vadams@nvidia.com> * Revert "add back checkpoint convertion CI test" This reverts commit 61e2ffcdefe964c8e74b74d8c10906ae29f32b6d. * Add back checkpoint conversion test Signed-off-by: Virginia Adams <vadams@nvidia.com> * Setting requires grad to True everywhere Signed-off-by: Virginia Adams <vadams@nvidia.com> * Updated config comments and simplified param group code Signed-off-by: Virginia Adams <vadams@nvidia.com> * Added comment on frozen_model having lr=0.0 Signed-off-by: Virginia Adams <vadams@nvidia.com> * Added configure optimizers methods Signed-off-by: Virginia Adams <vadams@nvidia.com> * Set amp_o2 to false Signed-off-by: Virginia Adams <vadams@nvidia.com> * removed o2 code Signed-off-by: Virginia Adams <vadams@nvidia.com> * Python formatting fix Signed-off-by: Virginia Adams <vadams@nvidia.com> Co-authored-by: Eric Harper <complex451@gmail.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * auto switch conformer encoder adapter in_features (#4354) Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Dataloader, collector, loss and metric for multiscale diarization decoder (#4187) * First commit Signed-off-by: Taejin Park <tango4j@gmail.com> * Checked funtionality and imports Signed-off-by: Taejin Park <tango4j@gmail.com> * fixed import issues Signed-off-by: Taejin Park <tango4j@gmail.com> * Removed the changed made by mistake Signed-off-by: Taejin Park <tango4j@gmail.com> * Style fix Signed-off-by: Taejin Park <tango4j@gmail.com> * Fixed LGTM errors 001 Signed-off-by: Taejin Park <tango4j@gmail.com> * Fixed LGTM and style fix Signed-off-by: Taejin Park <tango4j@gmail.com> * Changed docstrings Signed-off-by: Taejin Park <tango4j@gmail.com> * LGTM again Signed-off-by: Taejin Park <tango4j@gmail.com> * Removed unnecessary torch setting lines Signed-off-by: Taejin Park <tango4j@gmail.com> * Style fix and isort Signed-off-by: Taejin Park <tango4j@gmail.com> * jbalam-nv comments reflected Signed-off-by: Taejin Park <tango4j@gmail.com> * style fix Signed-off-by: Taejin Park <tango4j@gmail.com> * Reflected comments and created _diar_label.py Signed-off-by: Taejin Park <tango4j@gmail.com> * Typo fix and style fix Signed-off-by: Taejin Park <tango4j@gmail.com> * Fixed target_spks[0] index error Signed-off-by: Taejin Park <tango4j@gmail.com> * style fix Signed-off-by: Taejin Park <tango4j@gmail.com> * LGTM unused import IterDataset Signed-off-by: Taejin Park <tango4j@gmail.com> * revert collection doc year Signed-off-by: Taejin Park <tango4j@gmail.com> * Code format error in collections.py Signed-off-by: Taejin Park <tango4j@gmail.com> * fix collections space format error Signed-off-by: Taejin Park <tango4j@gmail.com> * merged main correctly Signed-off-by: Taejin Park <tango4j@gmail.com> * style fix Signed-off-by: Taejin Park <tango4j@gmail.com> * Reflected all comments and tested Signed-off-by: Taejin Park <tango4j@gmail.com> * style fix and LGTM Signed-off-by: Taejin Park <tango4j@gmail.com> * rttm_filepath to rttm_file and removed self included funcs, tested Signed-off-by: Taejin Park <tango4j@gmail.com> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Add ASR CTC Decoding module (#4342) * Initial commit Signed-off-by: smajumdar <smajumdar@nvidia.com> * Full support for decoding strategy Signed-off-by: smajumdar <smajumdar@nvidia.com> * Temp Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix labels of y_sequence Signed-off-by: smajumdar <smajumdar@nvidia.com> * Set support for sentencepiece subword merging Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix char and word based token merge alignment Signed-off-by: smajumdar <smajumdar@nvidia.com> * Revert incorrect change Signed-off-by: smajumdar <smajumdar@nvidia.com> * Update docstring Signed-off-by: smajumdar <smajumdar@nvidia.com> * Improve compatibility with greedy tokens and log probs Signed-off-by: smajumdar <smajumdar@nvidia.com> * Update scripts to use decoding strategy Signed-off-by: smajumdar <smajumdar@nvidia.com> * Add tests and docs Signed-off-by: smajumdar <smajumdar@nvidia.com> * Add tests and docs Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix speaker decoder timestamps Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix speaker decoder timestamps Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix decoding of ctc models Signed-off-by: smajumdar <smajumdar@nvidia.com> * Address reviewer comments Signed-off-by: smajumdar <smajumdar@nvidia.com> * Address reviewer comments Signed-off-by: smajumdar <smajumdar@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Option to disable mp in VAD via num_workers=1 (#4317) * Option to disable mp in VAD via num_workers=1 In certain environments python multiprocessing can deadlock. This adds a convenient version to disable by setting num_workers to 1. Signed-off-by: Georg Kucsko <gkucsko@gmail.com> * add none handling Signed-off-by: Georg Kucsko <gkucsko@gmail.com> * additional none handling Signed-off-by: Georg Kucsko <gkucsko@gmail.com> Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * remove redundant bias expand (#4382) * remove redundant bias expand Signed-off-by: Xiaowei Ren <xren@nvidia.com> * delete redundant code Signed-off-by: Xiaowei Ren <xren@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Add option for specifying wandb save_dir from config (#4379) * give option to user to specify wandb save dir via config Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> * create save_dir directory for wandb logger if not exists Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> * update save_dir get method with a default value Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Quick wav2vec fix. In-place operation adding convolutional positions to encoder was overwriting leaf history. Wasn't caught on previous torch versions. (#4383) Signed-off-by: tbartley94 <tbartley@nvidia.com> Co-authored-by: tbartley94 <tbartley@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * [Bugfix][TTS] wrong order of returned tuple for general_collate_fn. (#4388) Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Merge r1.10.0 main (#4398) * update branch Signed-off-by: ericharper <complex451@gmail.com> * Set headscale false (#4364) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Add wandb as dependency (#4365) Signed-off-by: smajumdar <smajumdar@nvidia.com> * Raise trainer error (#4356) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Micha Livne <michalivne@users.noreply.github.com> * Set headscale false (#4364) (#4366) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: smajumdar <smajumdar@nvidia.com> * Finetuning changes for BART (#4003) * Temp Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Checkpoint converter to nemo for bart Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Style Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Micha Livne <michalivne@users.noreply.github.com> * Make position embedding expansion specific to a batch to avoid checkpoint size mismatches (#4357) * Style Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix logging warning Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Micha Livne <michalivne@users.noreply.github.com> * Fix electronic bug, new time ITN rule (#4355) * fix electronic bug Signed-off-by: ekmb <ebakhturina@nvidia.com> * add new itn time rule Signed-off-by: ekmb <ebakhturina@nvidia.com> * revert domain changes Signed-off-by: ekmb <ebakhturina@nvidia.com> * remove repetition Signed-off-by: ekmb <ebakhturina@nvidia.com> * Correct support for dataclasses in default module dim (#4372) * Correct support for dataclasses in default module dim Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix path for save of results Signed-off-by: smajumdar <smajumdar@nvidia.com> * fix pad id bug (#4377) Signed-off-by: Yi Dong <yidong@nvidia.com> * Question answering bug fix (#4381) * refactor dialogue state tracking for modelling/dataset interoperability Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix style changes Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix typo Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix style raised by lgtm Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix style formatting Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update template to include description of intent Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * changes based on requests in review Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add compatibility with assistant dataset Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove dialogue_state_tracking Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update huggingface utils for dialogue Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * rename dialogue_state_tracking_hybrid to dialogue_state_tracking_sgdqa Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix style Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix nemo/collections/nlp/models/dialogue_state_tracking_sgdqa/__init__.py Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix typo Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add docstrings for assistant data processsor Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkins for SGDGEN local checkpoint Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update style Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * use local vocab file for Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * patch for Jenkins CI using local file Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add slot filling prediction and metrics Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unused code Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix …
- Loading branch information