forked from NVIDIA/NeMo
-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
main to ssl synthesis #9
Merged
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
* 1. Debugging. Signed-off-by: Micha Livne <mlivne@cs.toronto.edu> * 1. BART dataset fixes missing <EOS> for deocder output. Signed-off-by: Micha Livne <mlivne@cs.toronto.edu> * 1. Debugging. Signed-off-by: Micha Livne <mlivne@cs.toronto.edu> * 1. Debugging. Signed-off-by: Micha Livne <mlivne@cs.toronto.edu> * 1. Removed extra padding from BARTDataset. Signed-off-by: Micha Livne <mlivne@cs.toronto.edu>
* Updated pipeline parallel code to speed up training Signed-off-by: Virginia Adams <vadams@nvidia.com> * Load global batch size not local mini batch size Signed-off-by: Virginia Adams <vadams@nvidia.com> * Python reformatting Signed-off-by: Virginia Adams <vadams@nvidia.com>
* Temp Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Add megatron dataset Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Update config and fix global batch fetcher Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Add dataset class Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Update comments Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Style Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Update yaml Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix duplicate yaml key Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Translate method and preprocess script for raw text Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Style Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove pdb Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix arg name Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix other arg Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Change sampler back Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Move back to global batch fetcher to use distributed sampler Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Add text memmap data Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Update monitor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fixes for PP Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove unused import Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Truncate examples in text memmap Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * NMT training batch interpolation key Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * tarred data fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Change dataset type check Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix sampler Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Pass dataset cfg to determine type Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Log global step on validation step as well Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix NMT model saving with artifacts Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Initialize DDP in decode if not initialized. Needed for inference only mode Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Megatron NMT inference script Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Inference config file Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * hardcode max delta temporarily Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Style Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * detokenizer if processor is not none Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Sampler config Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Compat with configs without sampler arg Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Style Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Comment for validation dataset type Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix tokenizer building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * CI test for megatron nmt Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix tokenizer in restore Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Style Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * O2 restore from fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove print Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Change tokenizer model name in config Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Logging Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Set seed for distributed sampler Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Cluster debugging messages Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Style Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix max generation delta Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * No LM Init Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Use nlp save restore connector Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove useless infer args Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Typo Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * UTF8 safe print of translation result Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Style Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Add save restore connector back with comment Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Refactor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix CI test Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Add missing args Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Address comments Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Empty to restart * Fix CI test Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Check for test ds Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * set fusion to false Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Initial perceiver encoder Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Perceiver with PP=1 Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove init cross attn Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * CI test and remove init cross attn arg Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove init cross attn layers from file Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Style Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Clean up Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * update branch Signed-off-by: ericharper <complex451@gmail.com> * Set headscale false (NVIDIA#4364) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Add wandb as dependency (NVIDIA#4365) Signed-off-by: smajumdar <smajumdar@nvidia.com> * Raise trainer error (NVIDIA#4356) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Micha Livne <michalivne@users.noreply.github.com> * Set headscale false (NVIDIA#4364) (NVIDIA#4366) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: smajumdar <smajumdar@nvidia.com> * Finetuning changes for BART (NVIDIA#4003) * Temp Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Checkpoint converter to nemo for bart Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Style Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Micha Livne <michalivne@users.noreply.github.com> * Make position embedding expansion specific to a batch to avoid checkpoint size mismatches (NVIDIA#4357) * Style Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix logging warning Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Micha Livne <michalivne@users.noreply.github.com> * Refactor bias act fusion Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Update NMT config Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix electronic bug, new time ITN rule (NVIDIA#4355) * fix electronic bug Signed-off-by: ekmb <ebakhturina@nvidia.com> * add new itn time rule Signed-off-by: ekmb <ebakhturina@nvidia.com> * revert domain changes Signed-off-by: ekmb <ebakhturina@nvidia.com> * remove repetition Signed-off-by: ekmb <ebakhturina@nvidia.com> * Update ci tests Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Correct support for dataclasses in default module dim (NVIDIA#4372) * Correct support for dataclasses in default module dim Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix path for save of results Signed-off-by: smajumdar <smajumdar@nvidia.com> * fix pad id bug (NVIDIA#4377) Signed-off-by: Yi Dong <yidong@nvidia.com> * Question answering bug fix (NVIDIA#4381) * refactor dialogue state tracking for modelling/dataset interoperability Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix style changes Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix typo Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix style raised by lgtm Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix style formatting Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update template to include description of intent Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * changes based on requests in review Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add compatibility with assistant dataset Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove dialogue_state_tracking Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update huggingface utils for dialogue Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * rename dialogue_state_tracking_hybrid to dialogue_state_tracking_sgdqa Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix style Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix nemo/collections/nlp/models/dialogue_state_tracking_sgdqa/__init__.py Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix typo Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add docstrings for assistant data processsor Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkins for SGDGEN local checkpoint Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update style Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * use local vocab file for Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * patch for Jenkins CI using local file Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add slot filling prediction and metrics Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unused code Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * refactor metrics code out of Dialogue GPT Model Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate backward compatible support for IntentSlotClassificationModel (bert model) Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * save prediction file for IntentSlotClassification Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update dialogue gpt model training for megatron gpt Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove batch generate for HF GPT2, which causes lower performance Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add few shot capability to dialogue gpt model Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile and remove unused import Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update code description and clarity Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * address PR comments Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate compatibility with ZeroShotIntentModel Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * rename folder to dialogue due to increased scope and further refactor for clarity Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * added dialogue GPT for sequence generation task (e.g. answer extender) Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add CI test for DialogueGPTGenerationModel Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate DialogueS2SGenerationModel for generation task (e.g. answer extender) Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * modify huggingface utils to support HF t5/BART models Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unused imports Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update bleu metric Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix bleu metric style Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * debug bleu metric Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * debug bleu metric Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update based on PR NVIDIA#3893 Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update 2 based on PR NVIDIA#3893 Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update 3 based on PR NVIDIA#3893 Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate sgd generation based on user user utterance and system slot-values to generate system utterance Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add validation model saving capabilities Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * cleaned up code for SGD Based Answer extender Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Dialogue Generation CI Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix Jenkins CI issue" Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add support for design dataset Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unnecessary imports Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * support megatron for dialogue_s2s_generation_model Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * reduce loaded samples in MSMarcoDataProcessor to 64 when cfg.model.dataset.debug_mode=True Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update CI Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update checkpoint and predictions filename to include epoch number Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate HF BART MNLI into zero shot intent model Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate Dialogue Nearest Neighbour Model Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * refactor Dialogue SGD Data Processor to make interface for models cleaner Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Dialogue S2S Generation model for DialogueSGDDataProcessor interface Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * support sgd and drive thru datasets by zero shot model and nearest neighbour model Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add prediction saving code to nearest neighbour and zero shot intent models Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix typo in sgd data processor Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate Dialogue Mellon QA Data Processor Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update mellon qa Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update dialogue.py to remove outdated info Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update dialogue_config.yaml Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update dialogue_config.yaml Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add dialogue docs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * address review comments Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix for cfg Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * make dependency on apex optional Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * change NLPDDPluggin calling logic to make it possible to run without apex Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add first draft of tutorial Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * reduce ms marco size by removing lines without wellFormedAnswers Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * address pr comments Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update colab tutorial link in dialogue docs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * include unit test and some refactor to facilitate unit test Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * address pr issues Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove typos in dialogue tutorial Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * support larger files for question answering Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unnecessary artifacts to reduce memory use Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * put 0 tensor to device Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update link within dialogue tutorial Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * restore previously delete files Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error handling when loss = nan Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update nan handling Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update spanning loss func Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update spanning loss Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix type error raised in qa_dataset.py Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add error checking message Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * revert back to float32 Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * revert back to float32 Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update exp logging Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update loading of large file from pickle to json Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update loading of large file from pickle to json Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * limit number of negative samples Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * revert post processing Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * revert post processing Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unused methods and style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add more documentation Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unused imports Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * changes base on PR review Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * set wandb logger falseby default Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix * style fix * correct typo * style fix * style fix Co-authored-by: Zhilin Wang <zhilinw@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Eric Harper <complex451@gmail.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> * Fix ASR Typos in tutorials (NVIDIA#4384) * Fix typos Signed-off-by: smajumdar <smajumdar@nvidia.com> * Quick wav2vec fix. In-place operation adding convolutional positions to encoder was overwriting leaf history. Wasn't caught on previous torch versions. (NVIDIA#4383) Signed-off-by: tbartley94 <tbartley@nvidia.com> Co-authored-by: tbartley94 <tbartley@nvidia.com> (cherry picked from commit 0322b15) Co-authored-by: Travis Bartley <Travismbartley@gmail.com> * Add Docs for NeMo Adapters (NVIDIA#4369) Signed-off-by: smajumdar <smajumdar@nvidia.com> * Update NeMo docs (NVIDIA#4397) Signed-off-by: smajumdar <smajumdar@nvidia.com> Co-authored-by: Eric Harper <complex451@gmail.com> * Punctuation and capitalization tests race condition (NVIDIA#4399) * Add draft of race condition fixes Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Minor improvements Signed-off-by: PeganovAnton <peganoff2@mail.ru> * More race condition fixes Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Improve error message Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Improve error message Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Improve error message Signed-off-by: PeganovAnton <peganoff2@mail.ru> * bias act fusion changes Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Address comments Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix geglu without fusion Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Reset files to main Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove hidden blocks Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix style Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Micha Livne <michalivne@users.noreply.github.com> Co-authored-by: Abhinav Khattar <aklife97@gmail.com> Co-authored-by: ericharper <complex451@gmail.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> Co-authored-by: Evelina <10428420+ekmb@users.noreply.github.com> Co-authored-by: Yi Dong <43824965+yidong72@users.noreply.github.com> Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com> Co-authored-by: Zhilin Wang <zhilinw@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Travis Bartley <Travismbartley@gmail.com> Co-authored-by: PeganovAnton <peganoff2@mail.ru>
* initial commit Signed-off-by: Taejin Park <tango4j@gmail.com> * style fix Signed-off-by: Taejin Park <tango4j@gmail.com> * Default maj_vote = False, max_rp=0.25 Signed-off-by: Taejin Park <tango4j@gmail.com> * doc strings and style fix Signed-off-by: Taejin Park <tango4j@gmail.com> * Docstring minor edit Signed-off-by: Taejin Park <tango4j@gmail.com> * Default False in the functions Signed-off-by: Taejin Park <tango4j@gmail.com> * fixed repeated variable Signed-off-by: Taejin Park <tango4j@gmail.com> * Default as maj_vote=False Signed-off-by: Taejin Park <tango4j@gmail.com> * removed redundant part in wrtie_rttm func Signed-off-by: Taejin Park <tango4j@gmail.com> * Removed unused function Signed-off-by: Taejin Park <tango4j@gmail.com> * Updated and tested silence and very short samples Signed-off-by: Taejin Park <tango4j@gmail.com> * style fix Signed-off-by: Taejin Park <tango4j@gmail.com> * Style fix and removing unnecessary parts Signed-off-by: Taejin Park <tango4j@gmail.com> * unused variables are removed Signed-off-by: Taejin Park <tango4j@gmail.com> * Fixed commented torch.jit.script Signed-off-by: Taejin Park <tango4j@gmail.com> * majority voting update Signed-off-by: Taejin Park <tango4j@gmail.com> * cancelling the update on speaker_utils and clus_diarizer Signed-off-by: Taejin Park <tango4j@gmail.com> * style fix Signed-off-by: Taejin Park <tango4j@gmail.com> * bug fix Signed-off-by: Taejin Park <tango4j@gmail.com> * Added fp32 converting for torch.mm Signed-off-by: Taejin Park <tango4j@gmail.com> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com>
Signed-off-by: saarus72 <saarus72@gmail.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Adrian Lancucki <alancucki@users.noreply.github.com> Co-authored-by: Adrian Lancucki <alancucki@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
* Dataloader, collector, loss and metric for multiscale diarization decoder (NVIDIA#4187) * First commit Signed-off-by: Taejin Park <tango4j@gmail.com> * Checked funtionality and imports Signed-off-by: Taejin Park <tango4j@gmail.com> * fixed import issues Signed-off-by: Taejin Park <tango4j@gmail.com> * Removed the changed made by mistake Signed-off-by: Taejin Park <tango4j@gmail.com> * Style fix Signed-off-by: Taejin Park <tango4j@gmail.com> * Fixed LGTM errors 001 Signed-off-by: Taejin Park <tango4j@gmail.com> * Fixed LGTM and style fix Signed-off-by: Taejin Park <tango4j@gmail.com> * Changed docstrings Signed-off-by: Taejin Park <tango4j@gmail.com> * LGTM again Signed-off-by: Taejin Park <tango4j@gmail.com> * Removed unnecessary torch setting lines Signed-off-by: Taejin Park <tango4j@gmail.com> * Style fix and isort Signed-off-by: Taejin Park <tango4j@gmail.com> * jbalam-nv comments reflected Signed-off-by: Taejin Park <tango4j@gmail.com> * style fix Signed-off-by: Taejin Park <tango4j@gmail.com> * Reflected comments and created _diar_label.py Signed-off-by: Taejin Park <tango4j@gmail.com> * Typo fix and style fix Signed-off-by: Taejin Park <tango4j@gmail.com> * Fixed target_spks[0] index error Signed-off-by: Taejin Park <tango4j@gmail.com> * style fix Signed-off-by: Taejin Park <tango4j@gmail.com> * LGTM unused import IterDataset Signed-off-by: Taejin Park <tango4j@gmail.com> * revert collection doc year Signed-off-by: Taejin Park <tango4j@gmail.com> * Code format error in collections.py Signed-off-by: Taejin Park <tango4j@gmail.com> * fix collections space format error Signed-off-by: Taejin Park <tango4j@gmail.com> * merged main correctly Signed-off-by: Taejin Park <tango4j@gmail.com> * style fix Signed-off-by: Taejin Park <tango4j@gmail.com> * Reflected all comments and tested Signed-off-by: Taejin Park <tango4j@gmail.com> * style fix and LGTM Signed-off-by: Taejin Park <tango4j@gmail.com> * rttm_filepath to rttm_file and removed self included funcs, tested Signed-off-by: Taejin Park <tango4j@gmail.com> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * removed references to data_dir Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * added missing parameters to data preparation script Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * removed unnecessary file extension check Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Add ASR CTC Decoding module (NVIDIA#4342) * Initial commit Signed-off-by: smajumdar <smajumdar@nvidia.com> * Full support for decoding strategy Signed-off-by: smajumdar <smajumdar@nvidia.com> * Temp Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix labels of y_sequence Signed-off-by: smajumdar <smajumdar@nvidia.com> * Set support for sentencepiece subword merging Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix char and word based token merge alignment Signed-off-by: smajumdar <smajumdar@nvidia.com> * Revert incorrect change Signed-off-by: smajumdar <smajumdar@nvidia.com> * Update docstring Signed-off-by: smajumdar <smajumdar@nvidia.com> * Improve compatibility with greedy tokens and log probs Signed-off-by: smajumdar <smajumdar@nvidia.com> * Update scripts to use decoding strategy Signed-off-by: smajumdar <smajumdar@nvidia.com> * Add tests and docs Signed-off-by: smajumdar <smajumdar@nvidia.com> * Add tests and docs Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix speaker decoder timestamps Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix speaker decoder timestamps Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix decoding of ctc models Signed-off-by: smajumdar <smajumdar@nvidia.com> * Address reviewer comments Signed-off-by: smajumdar <smajumdar@nvidia.com> * Address reviewer comments Signed-off-by: smajumdar <smajumdar@nvidia.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Option to disable mp in VAD via num_workers=1 (NVIDIA#4317) * Option to disable mp in VAD via num_workers=1 In certain environments python multiprocessing can deadlock. This adds a convenient version to disable by setting num_workers to 1. Signed-off-by: Georg Kucsko <gkucsko@gmail.com> * add none handling Signed-off-by: Georg Kucsko <gkucsko@gmail.com> * additional none handling Signed-off-by: Georg Kucsko <gkucsko@gmail.com> Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * remove redundant bias expand (NVIDIA#4382) * remove redundant bias expand Signed-off-by: Xiaowei Ren <xren@nvidia.com> * delete redundant code Signed-off-by: Xiaowei Ren <xren@nvidia.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * fixed style Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Add option for specifying wandb save_dir from config (NVIDIA#4379) * give option to user to specify wandb save dir via config Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> * create save_dir directory for wandb logger if not exists Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> * update save_dir get method with a default value Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Quick wav2vec fix. In-place operation adding convolutional positions to encoder was overwriting leaf history. Wasn't caught on previous torch versions. (NVIDIA#4383) Signed-off-by: tbartley94 <tbartley@nvidia.com> Co-authored-by: tbartley94 <tbartley@nvidia.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * [Bugfix][TTS] wrong order of returned tuple for general_collate_fn. (NVIDIA#4388) Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Merge r1.10.0 main (NVIDIA#4398) * update branch Signed-off-by: ericharper <complex451@gmail.com> * Set headscale false (NVIDIA#4364) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Add wandb as dependency (NVIDIA#4365) Signed-off-by: smajumdar <smajumdar@nvidia.com> * Raise trainer error (NVIDIA#4356) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Micha Livne <michalivne@users.noreply.github.com> * Set headscale false (NVIDIA#4364) (NVIDIA#4366) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: smajumdar <smajumdar@nvidia.com> * Finetuning changes for BART (NVIDIA#4003) * Temp Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Checkpoint converter to nemo for bart Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Style Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Micha Livne <michalivne@users.noreply.github.com> * Make position embedding expansion specific to a batch to avoid checkpoint size mismatches (NVIDIA#4357) * Style Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix logging warning Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Micha Livne <michalivne@users.noreply.github.com> * Fix electronic bug, new time ITN rule (NVIDIA#4355) * fix electronic bug Signed-off-by: ekmb <ebakhturina@nvidia.com> * add new itn time rule Signed-off-by: ekmb <ebakhturina@nvidia.com> * revert domain changes Signed-off-by: ekmb <ebakhturina@nvidia.com> * remove repetition Signed-off-by: ekmb <ebakhturina@nvidia.com> * Correct support for dataclasses in default module dim (NVIDIA#4372) * Correct support for dataclasses in default module dim Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix path for save of results Signed-off-by: smajumdar <smajumdar@nvidia.com> * fix pad id bug (NVIDIA#4377) Signed-off-by: Yi Dong <yidong@nvidia.com> * Question answering bug fix (NVIDIA#4381) * refactor dialogue state tracking for modelling/dataset interoperability Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix style changes Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix typo Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix style raised by lgtm Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix style formatting Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update template to include description of intent Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * changes based on requests in review Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add compatibility with assistant dataset Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove dialogue_state_tracking Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update huggingface utils for dialogue Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * rename dialogue_state_tracking_hybrid to dialogue_state_tracking_sgdqa Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix style Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix nemo/collections/nlp/models/dialogue_state_tracking_sgdqa/__init__.py Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix typo Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add docstrings for assistant data processsor Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkins for SGDGEN local checkpoint Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update style Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * use local vocab file for Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * patch for Jenkins CI using local file Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add slot filling prediction and metrics Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unused code Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * refactor metrics code out of Dialogue GPT Model Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate backward compatible support for IntentSlotClassificationModel (bert model) Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * save prediction file for IntentSlotClassification Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update dialogue gpt model training for megatron gpt Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove batch generate for HF GPT2, which causes lower performance Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add few shot capability to dialogue gpt model Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile and remove unused import Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update code description and clarity Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * address PR comments Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate compatibility with ZeroShotIntentModel Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * rename folder to dialogue due to increased scope and further refactor for clarity Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * added dialogue GPT for sequence generation task (e.g. answer extender) Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add CI test for DialogueGPTGenerationModel Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate DialogueS2SGenerationModel for generation task (e.g. answer extender) Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * modify huggingface utils to support HF t5/BART models Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unused imports Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update bleu metric Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix bleu metric style Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * debug bleu metric Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * debug bleu metric Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update based on PR NVIDIA#3893 Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update 2 based on PR NVIDIA#3893 Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update 3 based on PR NVIDIA#3893 Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate sgd generation based on user user utterance and system slot-values to generate system utterance Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add validation model saving capabilities Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * cleaned up code for SGD Based Answer extender Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Dialogue Generation CI Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix Jenkins CI issue" Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add support for design dataset Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unnecessary imports Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * support megatron for dialogue_s2s_generation_model Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * reduce loaded samples in MSMarcoDataProcessor to 64 when cfg.model.dataset.debug_mode=True Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update CI Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update checkpoint and predictions filename to include epoch number Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate HF BART MNLI into zero shot intent model Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate Dialogue Nearest Neighbour Model Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * refactor Dialogue SGD Data Processor to make interface for models cleaner Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Dialogue S2S Generation model for DialogueSGDDataProcessor interface Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * support sgd and drive thru datasets by zero shot model and nearest neighbour model Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add prediction saving code to nearest neighbour and zero shot intent models Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix typo in sgd data processor Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate Dialogue Mellon QA Data Processor Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update mellon qa Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update dialogue.py to remove outdated info Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update dialogue_config.yaml Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update dialogue_config.yaml Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add dialogue docs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * address review comments Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix for cfg Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * make dependency on apex optional Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * change NLPDDPluggin calling logic to make it possible to run without apex Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add first draft of tutorial Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * reduce ms marco size by removing lines without wellFormedAnswers Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * address pr comments Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update colab tutorial link in dialogue docs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * include unit test and some refactor to facilitate unit test Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * address pr issues Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove typos in dialogue tutorial Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * support larger files for question answering Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unnecessary artifacts to reduce memory use Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * put 0 tensor to device Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update link within dialogue tutorial Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * restore previously delete files Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error handling when loss = nan Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update nan handling Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update spanning loss func Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update spanning loss Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix type error raised in qa_dataset.py Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add error checking message Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * revert back to float32 Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * revert back to float32 Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update exp logging Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update loading of large file from pickle to json Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update loading of large file from pickle to json Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * limit number of negative samples Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * revert post processing Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * revert post processing Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unused methods and style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add more documentation Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unused imports Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * changes base on PR review Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * set wandb logger falseby default Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix * style fix * correct typo * style fix * style fix Co-authored-by: Zhilin Wang <zhilinw@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Eric Harper <complex451@gmail.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> * Fix ASR Typos in tutorials (NVIDIA#4384) * Fix typos Signed-off-by: smajumdar <smajumdar@nvidia.com> * Quick wav2vec fix. In-place operation adding convolutional positions to encoder was overwriting leaf history. Wasn't caught on previous torch versions. (NVIDIA#4383) Signed-off-by: tbartley94 <tbartley@nvidia.com> Co-authored-by: tbartley94 <tbartley@nvidia.com> (cherry picked from commit 0322b15) Co-authored-by: Travis Bartley <Travismbartley@gmail.com> * Add Docs for NeMo Adapters (NVIDIA#4369) Signed-off-by: smajumdar <smajumdar@nvidia.com> * Update NeMo docs (NVIDIA#4397) Signed-off-by: smajumdar <smajumdar@nvidia.com> Co-authored-by: Eric Harper <complex451@gmail.com> * update branch Signed-off-by: ericharper <complex451@gmail.com> * remove Copy of Signed-off-by: ericharper <complex451@gmail.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> Co-authored-by: Micha Livne <michalivne@users.noreply.github.com> Co-authored-by: Evelina <10428420+ekmb@users.noreply.github.com> Co-authored-by: Yi Dong <43824965+yidong72@users.noreply.github.com> Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com> Co-authored-by: Zhilin Wang <zhilinw@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Travis Bartley <Travismbartley@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * [bugfix][TTS] pitch, voiced_mask, prob_voiced have the same values. (NVIDIA#4392) Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixing import error in some cases (NVIDIA#4401) Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixing bugs in calling method ctc_decoder_predictions_tensor. (NVIDIA#4414) * updated ctc decoding calls. Signed-off-by: Vahid <vnoroozi@nvidia.com> * fixed the ones for timestamp_utils.py Signed-off-by: Vahid <vnoroozi@nvidia.com> * fixed the ones for timestamp_utils.py Signed-off-by: Vahid <vnoroozi@nvidia.com> * fixed the ones for timestamp_utils.py Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Update with new conformer checkpoints. (NVIDIA#4417) Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * [TTS] add static method decorator. (NVIDIA#4443) * [TTS] add static method decorator. Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> * remove protect prefix Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> * fixed style error Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Co-authored-by: Taejin Park <tango4j@gmail.com> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> Co-authored-by: Georg Kucsko <gkucsko@users.noreply.github.com> Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com> Co-authored-by: Xiaowei Ren <103958965+xrennvidia@users.noreply.github.com> Co-authored-by: Shantanu Acharya <shantanua@nvidia.com> Co-authored-by: Travis Bartley <Travismbartley@gmail.com> Co-authored-by: tbartley94 <tbartley@nvidia.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Eric Harper <complex451@gmail.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Micha Livne <michalivne@users.noreply.github.com> Co-authored-by: Evelina <10428420+ekmb@users.noreply.github.com> Co-authored-by: Yi Dong <43824965+yidong72@users.noreply.github.com> Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com> Co-authored-by: Zhilin Wang <zhilinw@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Boris Fomitchev <borisfom@users.noreply.github.com> Co-authored-by: Vahid Noroozi <VahidooX@users.noreply.github.com>
…oading mel Signed-off-by: Paarth Neekhara <paarth.n@gmail.com>
* Add ITN pt Signed-off-by: Guilherme Steinmann <guist@linse.ufsc.br> * Fix style Signed-off-by: Guilherme Steinmann <guist@linse.ufsc.br> * Fix style Signed-off-by: Guilherme Steinmann <guist@linse.ufsc.br> * Update copyright year to 2022 on ITN pt rules and tests Signed-off-by: Guilherme Steinmann <guist@linse.ufsc.br>
Signed-off-by: Ante Jukić <ajukic@nvidia.com> Co-authored-by: Ante Jukić <ajukic@nvidia.com>
phoneme IY1 -> IH1 in NVIDIA Added phonemes for CUSTOMIZABLE Update cmudict file revision and its reference. Signed-off-by: Jason Roche <jroche@nvidia.com> Co-authored-by: Jason Roche <jroche@nvidia.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
* add support for param groups Signed-off-by: stevehuang52 <heh@nvidia.com> * make config more general Signed-off-by: stevehuang52 <heh@nvidia.com> Co-authored-by: Eric Harper <complex451@gmail.com>
* initial commit Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * fixed silence wav file issue causing clustering to evaluate on null embeddings Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * fixed zero duration issue Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * updated with comments Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * minor doc change Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * update log Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>
* Runtime check refinements Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * Added fp32 casting for ASR nets export Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * style Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * Used torch.float32 for clarity Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * Fixing parameters passing Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
* initial_script Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * move old script Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * remove finetune func from label models Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * style clean Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * updated config Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * update tutorial Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * lgtm fixes Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * updated based on comments Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * update doc Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>
…DIA#4511) * Preserve logprobs when preserving alignments Signed-off-by: smajumdar <smajumdar@nvidia.com> * Update tests for rnnt gredy and beam search Signed-off-by: smajumdar <smajumdar@nvidia.com> * Update all dependents of alignments Signed-off-by: smajumdar <smajumdar@nvidia.com> * Update docs Signed-off-by: smajumdar <smajumdar@nvidia.com>
…_whitespace (NVIDIA#4525) * Fix geglu without fusion Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Add extra args Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Reset transformer Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Style Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix spm arg Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix help string Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Add support for access mixin registry of custom losses Signed-off-by: smajumdar <smajumdar@nvidia.com> * add support for asr custom losses Signed-off-by: smajumdar <smajumdar@nvidia.com> * Update for l2 loss Signed-off-by: smajumdar <smajumdar@nvidia.com> * Add unittests Signed-off-by: smajumdar <smajumdar@nvidia.com> * Add unittests Signed-off-by: smajumdar <smajumdar@nvidia.com> * Add unittests Signed-off-by: smajumdar <smajumdar@nvidia.com> * Update registration of tensors to reset after finishing step Signed-off-by: smajumdar <smajumdar@nvidia.com> * Remove comment Signed-off-by: smajumdar <smajumdar@nvidia.com> * Remove comment Signed-off-by: smajumdar <smajumdar@nvidia.com> * Update SSL models Signed-off-by: smajumdar <smajumdar@nvidia.com> * Add support for validation step properly registering tensors Signed-off-by: smajumdar <smajumdar@nvidia.com> * Move reset of registry outside Signed-off-by: smajumdar <smajumdar@nvidia.com>
Signed-off-by: stevehuang52 <heh@nvidia.com>
* fix duplex inference with grammars Signed-off-by: ekmb <ebakhturina@nvidia.com> * add ci test for duplex, fix electronic last sym bug Signed-off-by: ekmb <ebakhturina@nvidia.com> * test fix Signed-off-by: ekmb <ebakhturina@nvidia.com> * fix jenkins Signed-off-by: ekmb <ebakhturina@nvidia.com> * update jenkins grammars Signed-off-by: ekmb <ebakhturina@nvidia.com> * add pt to the docs Signed-off-by: ekmb <ebakhturina@nvidia.com> * fix jenkins Signed-off-by: ekmb <ebakhturina@nvidia.com> * disable test Signed-off-by: ekmb <ebakhturina@nvidia.com> * fix jenkins Signed-off-by: ekmb <ebakhturina@nvidia.com> * jenkins refactor Signed-off-by: ekmb <ebakhturina@nvidia.com> * fix jenkins Signed-off-by: ekmb <ebakhturina@nvidia.com> * fix jenkins Signed-off-by: ekmb <ebakhturina@nvidia.com> * fix jenkins Signed-off-by: ekmb <ebakhturina@nvidia.com> * jenkins Signed-off-by: ekmb <ebakhturina@nvidia.com> * jenkins Signed-off-by: ekmb <ebakhturina@nvidia.com> * jenkins Signed-off-by: ekmb <ebakhturina@nvidia.com> * jenkins Signed-off-by: ekmb <ebakhturina@nvidia.com> * jenkins Signed-off-by: ekmb <ebakhturina@nvidia.com> * jenkins Signed-off-by: ekmb <ebakhturina@nvidia.com> * test Signed-off-by: ekmb <ebakhturina@nvidia.com> * test Signed-off-by: ekmb <ebakhturina@nvidia.com> * test Signed-off-by: ekmb <ebakhturina@nvidia.com> * test Signed-off-by: ekmb <ebakhturina@nvidia.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com>
…DIA#4465) * Add Bucketing support to TarredAudioToClassificationLabelDataset Signed-off-by: Ewald Enzinger <ewald.enzinger@entn.at>
* add FT support Signed-off-by: Abhinav Khattar <aklife97@gmail.com> * rm preproc Signed-off-by: Abhinav Khattar <aklife97@gmail.com> * review changes Signed-off-by: Abhinav Khattar <aklife97@gmail.com> * add CI Signed-off-by: Abhinav Khattar <aklife97@gmail.com> * newline fix Signed-off-by: Abhinav Khattar <aklife97@gmail.com> * CI fix Signed-off-by: Abhinav Khattar <aklife97@gmail.com> * clean up Signed-off-by: Abhinav Khattar <aklife97@gmail.com> * post training cleanup Signed-off-by: Abhinav Khattar <aklife97@gmail.com> * test Signed-off-by: Abhinav Khattar <aklife97@gmail.com> * revert Signed-off-by: Abhinav Khattar <aklife97@gmail.com> * CI test Signed-off-by: Abhinav Khattar <aklife97@gmail.com> * revert CI changes Signed-off-by: Abhinav Khattar <aklife97@gmail.com> * original CI Signed-off-by: Abhinav Khattar <aklife97@gmail.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca>
* add nsys profiling Signed-off-by: ericharper <complex451@gmail.com> * only access omegaconf in setup Signed-off-by: ericharper <complex451@gmail.com> * use robust get_rank function Signed-off-by: ericharper <complex451@gmail.com> * simplify Signed-off-by: ericharper <complex451@gmail.com>
* refactor dialogue state tracking for modelling/dataset interoperability Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix style changes Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix typo Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix style raised by lgtm Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix style formatting Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update template to include description of intent Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * changes based on requests in review Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add compatibility with assistant dataset Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove dialogue_state_tracking Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update huggingface utils for dialogue Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * rename dialogue_state_tracking_hybrid to dialogue_state_tracking_sgdqa Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix style Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix nemo/collections/nlp/models/dialogue_state_tracking_sgdqa/__init__.py Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix typo Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add docstrings for assistant data processsor Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkins for SGDGEN local checkpoint Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update style Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * use local vocab file for Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * patch for Jenkins CI using local file Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add slot filling prediction and metrics Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unused code Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * refactor metrics code out of Dialogue GPT Model Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate backward compatible support for IntentSlotClassificationModel (bert model) Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * save prediction file for IntentSlotClassification Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update dialogue gpt model training for megatron gpt Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove batch generate for HF GPT2, which causes lower performance Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add few shot capability to dialogue gpt model Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile and remove unused import Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update code description and clarity Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * address PR comments Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate compatibility with ZeroShotIntentModel Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * rename folder to dialogue due to increased scope and further refactor for clarity Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * added dialogue GPT for sequence generation task (e.g. answer extender) Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add CI test for DialogueGPTGenerationModel Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate DialogueS2SGenerationModel for generation task (e.g. answer extender) Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * modify huggingface utils to support HF t5/BART models Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unused imports Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update bleu metric Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix bleu metric style Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * debug bleu metric Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * debug bleu metric Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update based on PR NVIDIA#3893 Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update 2 based on PR NVIDIA#3893 Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update 3 based on PR NVIDIA#3893 Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate sgd generation based on user user utterance and system slot-values to generate system utterance Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add validation model saving capabilities Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * cleaned up code for SGD Based Answer extender Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Dialogue Generation CI Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix Jenkins CI issue" Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add support for design dataset Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unnecessary imports Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * support megatron for dialogue_s2s_generation_model Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * reduce loaded samples in MSMarcoDataProcessor to 64 when cfg.model.dataset.debug_mode=True Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update CI Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update checkpoint and predictions filename to include epoch number Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate HF BART MNLI into zero shot intent model Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate Dialogue Nearest Neighbour Model Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * refactor Dialogue SGD Data Processor to make interface for models cleaner Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Dialogue S2S Generation model for DialogueSGDDataProcessor interface Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * support sgd and drive thru datasets by zero shot model and nearest neighbour model Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add prediction saving code to nearest neighbour and zero shot intent models Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix typo in sgd data processor Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate Dialogue Mellon QA Data Processor Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update mellon qa Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update dialogue.py to remove outdated info Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update dialogue_config.yaml Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update dialogue_config.yaml Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add dialogue docs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * address review comments Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix for cfg Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * make dependency on apex optional Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * change NLPDDPluggin calling logic to make it possible to run without apex Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add first draft of tutorial Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * reduce ms marco size by removing lines without wellFormedAnswers Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * address pr comments Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update colab tutorial link in dialogue docs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * include unit test and some refactor to facilitate unit test Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * address pr issues Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove typos in dialogue tutorial Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * support larger files for question answering Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unnecessary artifacts to reduce memory use Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * put 0 tensor to device Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update link within dialogue tutorial Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * restore previously delete files Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error handling when loss = nan Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update nan handling Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update spanning loss func Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update spanning loss Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix type error raised in qa_dataset.py Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add error checking message Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * revert back to float32 Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * revert back to float32 Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update exp logging Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update loading of large file from pickle to json Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update loading of large file from pickle to json Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * limit number of negative samples Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * revert post processing Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * revert post processing Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unused methods and style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add more documentation Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unused imports Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * changes base on PR review Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * set wandb logger falseby default Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update interface with megatron gpt prompt learning Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update inline documentation Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update prompt_ids Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msg Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update config Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update config Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * set inference = False for dialgue prompt learning during trainng Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * set inference = False for dialgue prompt learning during trainng Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unused code Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update config yaml Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix bug for megatron gpt prompt learning Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unused import Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * address comments in PR Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * address comments in PR Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * address typo Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> Co-authored-by: Zhilin Wang <zhilinw@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Eric Harper <complex451@gmail.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca>
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
* update fastpitch to add export controls Signed-off-by: Jason <jasoli@nvidia.com> * final touchups Signed-off-by: Jason <jasoli@nvidia.com> * more final touchups Signed-off-by: Jason <jasoli@nvidia.com>
… models (NVIDIA#4550) Signed-off-by: subhankar-ghosh <subhankar2321@gmail.com>
* added MLM Scoring Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix header Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * refactor Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix bug that made normalization options set Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix style Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix discrepancy of space versus no space to previous version e.g. < sixteen > and <sixteen> Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * remove and from cardinal when lm is used to reduce number of options Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix grammar Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix masked input for [MASK] token before mlm scoring Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * mask out everything apart from one semiotic token Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * reverted masking change and added roman to lm Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix slash, expand measure Signed-off-by: ekmb <ebakhturina@nvidia.com> * fix masked scoring Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * audio based set fix for --lm Signed-off-by: ekmb <ebakhturina@nvidia.com> * fix bug Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * added jenkins test Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * update jenkins Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix header Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix lgtm Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * add dependency Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * moved mlmscore file Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * moved hybrid to nemo_text_processing folder Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * update jenkins Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix path Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix test Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix dataset license Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Co-authored-by: ekmb <ebakhturina@nvidia.com>
* Removed NLPDDPPlugin Import check Signed-off-by: Virginia Adams <vadams@nvidia.com> * Python formatting fix Signed-off-by: Virginia Adams <vadams@nvidia.com> * changed app to app_state Signed-off-by: Virginia Adams <vadams@nvidia.com> * moved num workers check back to bottom Signed-off-by: Virginia Adams <vadams@nvidia.com> * Python code reformat Signed-off-by: Virginia Adams <vadams@nvidia.com>
* Add length ratio filtering script Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix example Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove extra quotes Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Abhinav Khattar <aklife97@gmail.com>
…DIA#4557) * add script Signed-off-by: Abhinav Khattar <aklife97@gmail.com> * style fix Signed-off-by: Abhinav Khattar <aklife97@gmail.com>
…ng rate Signed-off-by: Paarth Neekhara <paarth.n@gmail.com>
… length Signed-off-by: Paarth Neekhara <paarth.n@gmail.com>
Signed-off-by: Ante Jukić <ajukic@nvidia.com> Co-authored-by: Ante Jukić <ajukic@nvidia.com>
* add ngc catalan model resource Signed-off-by: stevehuang52 <heh@nvidia.com> * update docs Signed-off-by: stevehuang52 <heh@nvidia.com>
* fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Change max sequence length computation Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Add args and refactor to support bart, ul2 Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Style Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Add to CI test Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Comments Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Attempt fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Style Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Style fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Revert max seq length change Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix syntax error Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Revert BART change Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove comment Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove unused import Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* added class for qa related metrics Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com> * removed BLEU code from QA metrics Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com> * added classes for data handling and loading for BERT/T5/BART/GPT Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com> * removed unnecassary main function Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com> * added classes for BERT, S2S(T5/BART), GPT question answering models Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com> * created separate modules for model specific input features Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com> * moved non-moodel methods to QAMetrics and refactored method names to more intuitive Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com> * changes classmethods to staticmethods Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com> * removed unnecassary copyright Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com> * removed deprecated input features file Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com> * abstracted cache filename, feature loading, feature dumping to QADataset Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com> * removed unused imports and added dataclass decorator Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com> * removed unused imports and refactored method name Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com> * added base class for QA models and abstracted out common methods Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com> * moved non-model eval code and predictions file dump to metrics class Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com> * added combined example of train/eval/test/inference for all qa models Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com> * renamed qa example file Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com> * fixed trailing whitespaces Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com> * added type casting to float for logger warning Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com> * removed unsed import Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com> * converted cached filename creation to class method Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com> * moved common code in dataset classes to base class, renamed Features class to Example Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com> * converted base QA example class to dataclass Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com> * reduced code repition in prediciton evaluation Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com> * converted prediction output files to jsonl Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com> * added flag for checking if ground truth present in context spans Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com> * converted predictions dump to jsonl from json Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com> * converted nbest predictions dump to jsonl from json Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com> * removed unused argument to no pad loss method Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com> * added unit tests for qa metrics and dataset utilities Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com> * applied style fix on new files Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com> * added integration tests Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com> * restored default values in qa config Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com> * renamed stage to avoid duplicate Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com> * added init files for new modules Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com> * applied style fix for module init files Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com> * added inline comments to make concise Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com> * specified class as abstract Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com> * specified .json format for output prediction files Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com> * created separate variable for answer in context check for readability Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com> * shifted stages to parallel Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com> * applied style fix Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com> * restored file modified by linter Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com> * added transformers offline flag to true and moved all stages to parallel Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com> * moved inference code inside test_ds check Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com> * added script for converting msmarco dataset to squad format Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com> * added tutorial for question answering with generative models Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com> * added copyright header Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com> * renamed old qa docs with _squad postfix and added docs for new qa modules Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com> * added generative qa architecture diagram Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com> * modified tutorial with colab testing changes, improved documentation Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com> * changed branch name to main in tutorial * deprecated old QA tutorial * deprecated old QA docs * deprecated old QA example * removed deprecated ci test for old qa example * removed additional deprecated ci tests
* add kw asr models, add itn ru checkpoint (tagger-based) Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * add rw results to docs Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com>
Signed-off-by: Anas Abou Allaban <aabouallaban@pm.me> Co-authored-by: Samuel Kriman <samuelkriman@gmail.com>
Signed-off-by: Ameya Mahabaleshwarkar <ameyasm1154@gmail.com> Co-authored-by: Eric Harper <complex451@gmail.com>
…VIDIA#4553) * fix tarred dataset len when num shards is not divisible by workers Signed-off-by: Iztok Lebar Bajec <ilb@fri.uni-lj.si> * update error reporting on invalid `shard_strategy` * update NLP/PC tarred dataset docstring * add `shard_strategy` to NLP/PC `@dataclass` * update NLP/PC tarred dataset docstring * add `shard_strategy` to NLP/PC docs * revert test with Dataloader retruning the actual data length * make dataloader return actual num of samples, set `limit_train_baches` on `setup_*` * update `shard_strategy` docstrings Signed-off-by: Iztok Lebar Bajec <ilb@fri.uni-lj.si> * update `tarred_dataset` documentation Signed-off-by: Iztok Lebar Bajec <ilb@fri.uni-lj.si> * fix style * update documentation Signed-off-by: Iztok Lebar Bajec <ilb@fri.uni-lj.si> * updated docstrings Signed-off-by: Iztok Lebar Bajec <ilb@fri.uni-lj.si> Co-authored-by: PeganovAnton <peganoff2@mail.ru>
…nce (NVIDIA#4582) [TTS][ASR] enabled overriding arguments for trimming the leading and trailing silence using librosa.effects.trim Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
…VIDIA#4607) * auto switch conformer encoder adapter in_features Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> * update the norm and warmup default values in the adapters config file Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com>
…ckpointing (rebased) (NVIDIA#4380) * update gpt config and add docstring to parallel_lm_logits Signed-off-by: ericharper <complex451@gmail.com> * update parallel_lm_logits Signed-off-by: ericharper <complex451@gmail.com> * add CoreAttention and start updating ParallelAttention Signed-off-by: ericharper <complex451@gmail.com> * in progress Signed-off-by: ericharper <complex451@gmail.com> * fix args Signed-off-by: ericharper <complex451@gmail.com> * update ParallelTransformerLayer_ Signed-off-by: ericharper <complex451@gmail.com> * update ParallelTransformer Signed-off-by: ericharper <complex451@gmail.com> * remove test_from_pretrained Signed-off-by: ericharper <complex451@gmail.com> * update args Signed-off-by: ericharper <complex451@gmail.com> * propogate args Signed-off-by: ericharper <complex451@gmail.com> * add transposes to GPTModel Signed-off-by: ericharper <complex451@gmail.com> * update matmul_input_buffer dynamically Signed-off-by: ericharper <complex451@gmail.com> * add sequence_parallel arg to post_language_model_processing Signed-off-by: ericharper <complex451@gmail.com> * allreduce sequence parallel layernorm Signed-off-by: ericharper <complex451@gmail.com> * typo Signed-off-by: ericharper <complex451@gmail.com> * flag is sequence_parallel_enabled Signed-off-by: ericharper <complex451@gmail.com> * add sequence parallel args Signed-off-by: ericharper <complex451@gmail.com> * add seq parallel arg to fwd/bwd func Signed-off-by: ericharper <complex451@gmail.com> * don't wrap model with ddp when using O2 Signed-off-by: ericharper <complex451@gmail.com> * raise error when using method or num_layers with selective Signed-off-by: ericharper <complex451@gmail.com> * add sequence parallel for MixedFusedLayerNorm Signed-off-by: ericharper <complex451@gmail.com> * make sure checkpointing is set correctly Signed-off-by: ericharper <complex451@gmail.com> * make sure checkpointing is set correctly Signed-off-by: ericharper <complex451@gmail.com> * make sure checkpointing is set correctly Signed-off-by: ericharper <complex451@gmail.com> * check the right attribute Signed-off-by: ericharper <complex451@gmail.com> * fix args Signed-off-by: ericharper <complex451@gmail.com> * style Signed-off-by: ericharper <complex451@gmail.com> * don't sync after fwd/bwd if using seq par Signed-off-by: ericharper <complex451@gmail.com> * use base model for allreduce_grads Signed-off-by: ericharper <complex451@gmail.com> * remove extra layernorm Signed-off-by: ericharper <complex451@gmail.com> * revert norm_former_norm deletion Signed-off-by: ericharper <complex451@gmail.com> * move sync for allreduce grad to optimizer wrapper Signed-off-by: ericharper <complex451@gmail.com> * auto configure grad div ar fusion Signed-off-by: ericharper <complex451@gmail.com> * Initial rpe refactor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Refactor RPE Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * add transposes to t5 forward Signed-off-by: ericharper <complex451@gmail.com> * Gradient Accumulation fusion to Linear layer weight gradient computation (NVIDIA#4494) * Gradient Accumulation fusion to Linear layer weight gradient computation * fix typo * disable async when using gradient accumulation fusion Signed-off-by: ericharper <complex451@gmail.com> * add comment Signed-off-by: ericharper <complex451@gmail.com> * skip H2D copies of inputs (NVIDIA#4502) add assert * add rpe to core attention Signed-off-by: ericharper <complex451@gmail.com> * set sp to false if tp > 1. make cuda_device_max_connections configurable Signed-off-by: ericharper <complex451@gmail.com> * Refactor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix key Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * convert to string Signed-off-by: ericharper <complex451@gmail.com> * revert Signed-off-by: ericharper <complex451@gmail.com> * revert Signed-off-by: ericharper <complex451@gmail.com> * determine if no_async for ColumnLinear Signed-off-by: ericharper <complex451@gmail.com> * add async_grad_allreduce to parallel_lm_logits for gpt Signed-off-by: ericharper <complex451@gmail.com> * add async_grad_allreduce to parallel_lm_logits for bert and t5 Signed-off-by: ericharper <complex451@gmail.com> * disable gradient accumulation fusion when not using pipeline parallelism Signed-off-by: ericharper <complex451@gmail.com> * style Signed-off-by: ericharper <complex451@gmail.com> * Sequence parallel rebase with bugfixes (NVIDIA#4529) * Support for class labels as strings Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix for micro/macro average Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Metric fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix geglu without fusion Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * style Signed-off-by: ericharper <complex451@gmail.com> * disable grad accumulation fusion with O1 Signed-off-by: ericharper <complex451@gmail.com> * transpose prompt learning encoder input Signed-off-by: ericharper <complex451@gmail.com> * update container in jenkins Signed-off-by: ericharper <complex451@gmail.com> * add pleasefixme to retrieval tests Signed-off-by: ericharper <complex451@gmail.com> * comment retro test from jenkins Signed-off-by: ericharper <complex451@gmail.com> * Make RETRO SP compatible (NVIDIA#4565) * fix sp for retro Signed-off-by: Yi Dong <yidong@nvidia.com> * more tests fixed Signed-off-by: Yi Dong <yidong@nvidia.com> * make it sp compatible Signed-off-by: Yi Dong <yidong@nvidia.com> * add transpose for bert Signed-off-by: ericharper <complex451@gmail.com> * fix bug in pooler Signed-off-by: ericharper <complex451@gmail.com> * remove unused import Signed-off-by: ericharper <complex451@gmail.com> * remove unused import Signed-off-by: ericharper <complex451@gmail.com> * fix output Signed-off-by: ericharper <complex451@gmail.com> * revert normformer delete Signed-off-by: ericharper <complex451@gmail.com> * use 22.07, comment broken jenkins test Signed-off-by: ericharper <complex451@gmail.com> * remove unused import Signed-off-by: ericharper <complex451@gmail.com> * revert test comment Signed-off-by: ericharper <complex451@gmail.com> * add model.optim.capturable=True Signed-off-by: ericharper <complex451@gmail.com> * add model.optim.capturable=True Signed-off-by: ericharper <complex451@gmail.com> * set num_workers=0 Signed-off-by: ericharper <complex451@gmail.com> Co-authored-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Sangkug Lym <slym@nvidia.com> Co-authored-by: Yi Dong <43824965+yidong72@users.noreply.github.com>
* Fix word boundaries Signed-off-by: smajumdar <smajumdar@nvidia.com> * Remove prints Signed-off-by: smajumdar <smajumdar@nvidia.com> * Update notebook Signed-off-by: smajumdar <smajumdar@nvidia.com>
Signed-off-by: Anas Abou Allaban <aabouallaban@pm.me>
* Support listing Hugging Face model info Signed-off-by: smajumdar <smajumdar@nvidia.com> * Add documentation about usage Signed-off-by: smajumdar <smajumdar@nvidia.com> * Add documentation about usage Signed-off-by: smajumdar <smajumdar@nvidia.com> * Update name of method, support list of model filters Signed-off-by: smajumdar <smajumdar@nvidia.com> * Improve docstring Signed-off-by: smajumdar <smajumdar@nvidia.com>
Signed-off-by: Ryan <rlangman@nvidia.com>
* Update audio_to_diar_label to train meeting data Signed-off-by: Taejin Park <tango4j@gmail.com> * Style fix with --scope=nemo Signed-off-by: Taejin Park <tango4j@gmail.com> * Style fix problem, re-run style fix Signed-off-by: Taejin Park <tango4j@gmail.com> * Removed remaining commented lines Signed-off-by: Taejin Park <tango4j@gmail.com> * Remove an unused variable Signed-off-by: Taejin Park <tango4j@gmail.com> * Reflected comments Signed-off-by: Taejin Park <tango4j@gmail.com> * style fixed Signed-off-by: Taejin Park <tango4j@gmail.com> * style fix for no reason Signed-off-by: Taejin Park <tango4j@gmail.com> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com>
* Initial squeezeformer impl Signed-off-by: smajumdar <smajumdar@nvidia.com> * Start time reduce and recovery Signed-off-by: smajumdar <smajumdar@nvidia.com> * Working commit of time reduction and time recovery modules Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix issue with number of params being incorrect Signed-off-by: smajumdar <smajumdar@nvidia.com> * Add initializations to the model Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix scheduler Signed-off-by: smajumdar <smajumdar@nvidia.com> * Remove float() Signed-off-by: smajumdar <smajumdar@nvidia.com> * Correct order of operations Signed-off-by: smajumdar <smajumdar@nvidia.com> * Correct order of operations Signed-off-by: smajumdar <smajumdar@nvidia.com> * Update time reduce PE to only update PE and nothing else Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix initialization Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix PE usage Signed-off-by: smajumdar <smajumdar@nvidia.com> * Comment out k2 for now Signed-off-by: smajumdar <smajumdar@nvidia.com> * Add usage comments to buffered ctc script Signed-off-by: smajumdar <smajumdar@nvidia.com> * Update docs Signed-off-by: smajumdar <smajumdar@nvidia.com> * Add squeezeformer configs for CTC Signed-off-by: smajumdar <smajumdar@nvidia.com> * Mark squeezeformer as experimental Signed-off-by: smajumdar <smajumdar@nvidia.com> * Add Jenkinsfile test Signed-off-by: smajumdar <smajumdar@nvidia.com> * Add Jenkinsfile test Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix style Signed-off-by: smajumdar <smajumdar@nvidia.com> * Replace all with /content/ Signed-off-by: smajumdar <smajumdar@nvidia.com> * Try Jenkinsfile Fix with closure Signed-off-by: smajumdar <smajumdar@nvidia.com> * Update ctc config Signed-off-by: smajumdar <smajumdar@nvidia.com> * Update ctc config Signed-off-by: smajumdar <smajumdar@nvidia.com> * Update ctc config Signed-off-by: smajumdar <smajumdar@nvidia.com> * Add squeezeformer Signed-off-by: smajumdar <smajumdar@nvidia.com> * Add squeezeformer Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix Jenkinsfile Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix Jenkinsfile Signed-off-by: smajumdar <smajumdar@nvidia.com> * Try closure Signed-off-by: smajumdar <smajumdar@nvidia.com> * Remove test Signed-off-by: smajumdar <smajumdar@nvidia.com> * Add back squeezeformer test Signed-off-by: smajumdar <smajumdar@nvidia.com> * Remvoe script tag Signed-off-by: smajumdar <smajumdar@nvidia.com> * Update for review comments Signed-off-by: smajumdar <smajumdar@nvidia.com> * Remove experimental Signed-off-by: smajumdar <smajumdar@nvidia.com> * Correct an issue with RNNT alignments Signed-off-by: smajumdar <smajumdar@nvidia.com> * Correct an issue with RNNT metrics Signed-off-by: smajumdar <smajumdar@nvidia.com> * Code formatting Signed-off-by: smajumdar <smajumdar@nvidia.com> * Correct offset calculation for no look ahead Signed-off-by: smajumdar <smajumdar@nvidia.com>
paarthneekhara
added a commit
that referenced
this pull request
Dec 5, 2023
* multitask dataset and phoneme tts Signed-off-by: Paarth Neekhara <paarth.n@gmail.com> * rbased with jason's branch Signed-off-by: Paarth Neekhara <paarth.n@gmail.com> * vocab size in sampling params issue Signed-off-by: Paarth Neekhara <paarth.n@gmail.com> * added 8 to prompt len to check Signed-off-by: Paarth Neekhara <paarth.n@gmail.com> * Bring back Attention mask Signed-off-by: Paarth Neekhara <paarth.n@gmail.com> * flash attention attention map config Signed-off-by: Paarth Neekhara <paarth.n@gmail.com> * attention prior false handling, and hardcoded attention mask to be used from dataset Signed-off-by: Paarth Neekhara <paarth.n@gmail.com> * loss mask was excluding the first label, so adjusted that Signed-off-by: Paarth Neekhara <paarth.n@gmail.com> * some comments for debugging eos id Signed-off-by: Paarth Neekhara <paarth.n@gmail.com> * inference related changes Signed-off-by: Paarth Neekhara <paarth.n@gmail.com> * custom inference wip Signed-off-by: Paarth Neekhara <paarth.n@gmail.com> * wip for phoneme tokenizer on tarred dataset Signed-off-by: Paarth Neekhara <paarth.n@gmail.com> * more batches in inference Signed-off-by: Paarth Neekhara <paarth.n@gmail.com> * handle attention prior not coming from the dataloader Signed-off-by: Paarth Neekhara <paarth.n@gmail.com> * some changes and minor bug fix Signed-off-by: Paarth Neekhara <paarth.n@gmail.com> * option to use learned position embeddings Signed-off-by: Paarth Neekhara <paarth.n@gmail.com> --------- Signed-off-by: Paarth Neekhara <paarth.n@gmail.com> Co-authored-by: Jason <jasoli@nvidia.com>
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
What does this PR do ?
Add a one line overview of what this PR aims to accomplish.
Collection: [Note which collection this PR will affect]
Changelog
Usage
# Add a code snippet demonstrating how to use this
Before your PR is "Ready for review"
Pre checks:
PR Type:
If you haven't finished some of the above items you can still open "Draft" PR.
Who can review?
Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.
Additional Information