Releases: NVIDIA/NeMo
NVIDIA Neural Modules 1.11.0
Container
For additional information regarding NeMo containers, please visit: https://catalog.ngc.nvidia.com/orgs/nvidia/containers/nemo
docker pull nvcr.io/nvidia/nemo:22.07
ASR
Changelog
- Add ASR CTC Decoding module by @titu1994 :: PR: #4342
- Fixing bugs in calling method ctc_decoder_predictions_tensor. by @VahidooX :: PR: #4414
- Fixed WER initialization in ASR_with_Nemo notebook by @anteju :: PR: #4523
- Update signature of Hypothesis alignments by @titu1994 :: PR: #4511
- Add support for ASR Adapter Auxiliary Losses by @titu1994 :: PR: #4480
- Catalan ASR NGC Resource by @stevehuang52 :: PR: #4576
- Add kw asr models, add itn ru checkpoint (tagger-based) by @bene-ges :: PR: #4595
- Add DALI char dataset support to SSL model by @piraka9011 :: PR: #4592
- Customize arguments for trimming the leading/trailing silence by @XuesongYang :: PR: #4582
- Update Offline ASR with CTC Decoding by @titu1994 :: PR: #4608
- Add Squeezeformer to ASR by @titu1994 :: PR: #4416
- Fix ASR notebooks by @titu1994 :: PR: #4738
- Add pretrained ASR models for Croatian by @anteju :: PR: #4682
- Dataloader, collector, loss and metric for multiscale diarization decoder by @tango4j :: PR: #4187
- Multilingual VAD model by @fayejf :: PR: #4734
- Adding support for models trained with full context for cache-aware streaming. by @VahidooX :: PR: #4687
- Fp16 support for Conformer by @bmwshop :: PR: #4571
- Tiny VAD refactoring for postprocessing by @fayejf :: PR: #4625
- Add silence handling for speaker diarization pipeline by @nithinraok :: PR: #4512
- Add Bucketing support to TarredAudioToClassificationLabelDataset by @entn-at :: PR: #4465
TTS
Changelog
- Wrong order of returned tuple for general_collate_fn. by @XuesongYang :: PR: #4388
- Pitch, voiced_mask, prob_voiced have the same values which is not expected. by @XuesongYang :: PR: #4392
- Add static method decorator. by @XuesongYang :: PR: #4443
- Fix typo in HiFi-GAN config's max steps by @XuesongYang :: PR: #4450
- Relaxed support for both CPUs and GPUs by @XuesongYang :: PR: #4461
- Multi-speaker fastpitch model training recipe on HUI-Audio-Corpus-German by @XuesongYang :: PR: #4413
- Created the finetuning Hifigan 44100Hz recipe on HUI-Audio-Corpus-German by @XuesongYang :: PR: #4478
- Fix dataset parameter typo on tacotron2 example yaml by @saarus72 :: PR: #4471
- Update cmudict by @jasro23 :: PR: #4510
- Customize arguments for trimming the leading/trailing silence by @XuesongYang :: PR: #4582
- Fix off-by-1 bug in Beta Binomial Prior by @rlangman :: PR: #4616
- G2P Aligner by @redoctopus :: PR: #4604
- RADTTS ADLR-NEMO porting by @MikyasDesta :: PR: #4538
- Fixed wrong pronunciations for r1.11. by @XuesongYang :: PR: #4677
- Incremented the version number to 22.08 in tutorials. by @XuesongYang :: PR: #4684
- Bugfix for missing configs. by @XuesongYang :: PR: #4725
- Fix pynini install in TTS tutorials by @redoctopus :: PR: #4729
- Updated config with a German IPA phoneme tokenizer by @XuesongYang :: PR: #4756
- Add multi-speaker German FastPitch and HiFiGAN NGC checkpoints by @XuesongYang :: PR: #4763
- Add single male speaker German FastPitch and HiFiGAN NGC checkpoints by @XuesongYang :: PR: #4770
- Deprecated old scripts for ljspeech. by @XuesongYang :: PR: #4780
- Fix MixerTTS data loading index error by @redoctopus :: PR: #4811
- G2P docs by @ekmb :: PR: #4841
- NMESC speaker counting algorithm update by @tango4j :: PR: #4500
NLP / NMT
Changelog
- Add O2 support for RETRO model by @yidong72 :: PR: #4411
- Add MTEncDec Finetune support by @aklife97 :: PR: #4540
- Fix metric setup for finetuning without a test set by @MaximumEntropy :: PR: #4585
- T0 model and dataset by @MaximumEntropy :: PR: #4598
- Add prompt learning for T5 by @HeyyyyyyG :: PR: #4391
- Add MuTransfer Capablity to RETRO model pretraining by @yidong72 :: PR: #4643
- Label Smoothing in VocabParallelCrossEntropy by @MaximumEntropy :: PR: #4602
- Megatron BART BOS / EOS bug fix by @michalivne :: PR: #4495
- GPT Prompt Learning Improvements by @vadam5 :: PR: #4496
- Megatron perceiver with tensor parallelism only by @MaximumEntropy :: PR: #4318
- Refactor for punctuation model by @jubick1337 :: PR: #4367
- Update megatron prompt learning interface to dialogue by @Zhilin123 :: PR: #4545
- Removed NLPDDPPlugin Import check by @vadam5 :: PR: #4555
- Option to disregard document boundaries for t5, bart, ul2 by @MaximumEntropy :: PR: #4481
- Add Tokenization and Normalization pre-proecssing script for NMT by @aklife97 :: PR: #4557
- Integrating support for GPT/T5/BART for Question Answering by @ameyasm1154 :: PR: #4532
- NeMo Megatron: Add sequence parallelism and selective activation checkpointing (rebased) by @ericharper :: PR: #4380
- Update megatron t5 interface to dialogue by @Zhilin123 :: PR: #4626
- Additional sentencepiece args - Byte fallback, split digits, split_on_whitespace by @MaximumEntropy :: PR: #4525
- Maximum sample-based training for Megatron NMT and Text Memmap based Seq2seq Pre-training by @MaximumEntropy :: PR: #4396
- NeMo Megatron Doc updates1 by @okuchaiev :: PR: #4633
- Asymmetric Encoder and Decoder Configuration for Megatron Models by @MaximumEntropy :: PR: #4568
- Add sentencepiece legacy arg to megatron tokenizer configs by @MaximumEntropy :: PR: #4659
- Megatron encode function with RPE fix by @MaximumEntropy :: PR: #4692
- Updates to NeMo Megatron OSS docs by @okuchaiev :: PR: #4709
- Changes to make Megatron NMT exportable by @Davood-M :: PR: #4499
- fix bug relating to ddp strategy in joint intent slot classification … by @Zhilin123 :: PR: #4762
- Fix qa notebook typos and branch by @ericharper :: PR: #4788
- Colab py37 compatibility megatron by @Zhilin123 :: PR: #4791
- added/fixed export for Megatron models by @Davood-M :: PR: #4712
- Fix providing glue in seq2seq eval by @MaximumEntropy :: PR: #4843
- Fix Megatron NMT consumed samples and ckpt_to_nemo split rank by @MaximumEntropy :: PR: #4884
- Fixing Megatron BERT output dimensions to [batch x sec x hidden] by @michalivne :: PR: #4894
- Prompt Learning Inference Improvements by @vadam5 :: PR: #4566
- MegaMolBART Compatibility by @michalivne :: PR: #4603
Text Normalization / Inverse Text Normalization
Changelog
- Add ITN pt by @guidefloripa :: PR: #4516
- add kw asr models, add itn ru checkpoint (tagger-based) by @bene-ges :: PR: #4595
- Fix ITN pt by @guidefloripa :: PR: #4623
- Bug fix hundred in Audio-based, added method so split text in sentences by @ekmb :: PR: #4610
- Fix itn pt time by @guidefloripa :: PR: #4630
- Pin lightning version to be < 1.7.0 by @MaximumEntropy :: PR: #4660
- G2P for OOV and heteronyms by @ekmb :: PR: #4624
- Publish pretrained itn t5 model for English by @bene-ges :: PR: #4748
- Added MLM Scoring by @yzhang123 :: PR: #4476
Export
Changelog
Bugfixes
Changelog
- Wrong order of returned tuple for general_collate_fn. by @XuesongYang :: PR: #4388
- Pitch, voiced_mask, prob_voiced have the same values which is not expected. by @XuesongYang :: PR: #4392
- Fix tarred dataset len when num shards is not divisible by workers by @itzsimpl :: PR: #4553
- Fix multiple dev/test datasets after restoring from checkpoint by @PeganovAnton :: PR: #4636
- Fix/need different cache dirs for different datasets by @PeganovAnton :: PR: #4640
- Improve mAES algorithm with patches by @titu1994 :: PR: #4662
General Improvements
Changelog
- Option to disable mp in VAD via num_workers=1 by @gkucsko :: PR: #4317
- Remove redundant bias expand by @xrennvidia :: PR: #4382
- Add option for specifying wandb save_dir from config by @shan18 :: PR: #4379
- Quick wav2vec fix. In-place operation adding convolutional positions … by @bonham79 :: PR: #4383
- Fixing import error in some cases by @borisfom :: PR: #4401
- Update with new conformer checkpoints. by @VahidooX :: PR: #4417
- Wav2vec fix by @bonham79 :: PR: #4467
- Relative Audio Paths by @stevehuang52 :: PR: #4470
- Allow Noam lr scheduler to run for more than max_steps by @alancucki :: PR: #4472
- Support for Different LRs with Param Groups by @stevehuang52 :: PR: #4508
- Fix runtime check by @borisfom :: PR: #4501
- Update finetune label models by @nithinraok :: PR: #4504
- Weighted bucketing by @tbartley94 :: PR: #4530
- Relative Audio Path by @stevehuang52 :: PR: #4520
- Fix duplex inference with grammars by @ekmb :: PR: #4517
- Add nsys profiling by @ericharper :: PR: #4539
- Remove the variable that is not used in the context. by @XuesongYang :: PR: #4547
- Adding multispeaker fastpitch and hifigan en model links to available… by @subhankar-ghosh :: PR: #4550
- Add length ratio filtering script by @MaximumEntropy :: PR: #4551
- Relative audio path in speech data explorer by @anteju :: PR: #4570
- Dividing generative question-answering CI tests by @ameyasm1154 :: PR: #4600
- Updating the default parameters in the example adapters config file by @shan18 :: PR: #4607
- Improve normalize_batch ValueError message by @piraka9011 :: PR: #4614
- Support listing Hugging Face model info by @titu1994 :: PR: #4619
- Update diarization data loader to train meeting data by @tango4j :: PR: #4567
- Fix HF check for model card info by @titu1994 :: PR: #4628
- Add Github Action for auto webpage build by @titu1994 :: PR: #4645
- Empty commit by @ti...
NVIDIA Neural Modules 1.10.0
Container
For additional information regarding NeMo containers, please visit: https://catalog.ngc.nvidia.com/orgs/nvidia/containers/nemo
docker pull nvcr.io/nvidia/nemo:22.05
Known Issues
Issues
- Tutorial: Fastpitch_Training_GermanTTS.ipynb is experimental and still being tested.
ASR
Changelog
- Multilang asr tutorial by @bmwshop :: PR: #3931
- Add ASR with Adapters Tutorial by @titu1994 :: PR: #4149
- Add support for Decoder + Joint Adapters for ASR by @titu1994 :: PR: #4189
- updating PretrainedModelInfo and benchmark sheet for ASR models by @krishnacpuvvada :: PR: #4259
- Remove verbose flag from Dali Index Creator by @titu1994 :: PR: #4309
- updating PretrainedModelInfo for ASR SSL models by @krishnacpuvvada :: PR: #4292
- Adding docs for ASR SSL by @krishnacpuvvada :: PR: #4303
- Add ASR Scores to Docs by @titu1994 :: PR: #4412
- [ASR] Replace all paths with /content/ by @titu1994 :: PR: #4427
- added conformer mandarin model. by @VahidooX :: PR: #4201
- Runtime audio segment sampling for SSL by @krishnacpuvvada :: PR: #4126
TTS
Changelog
- [TTS] Add volume passthrough to fp for riva by @blisc :: PR: #4167
- Update TTS Configs from LAMB to AdamW by @redoctopus :: PR: #4233
- Add benchmark=false to all TTS configs by @redoctopus :: PR: #4263
- [TTS] add staticmethod decoration for BetaBinomialInterpolator by @XuesongYang :: PR: #4319
- [TTS] capture exception of non-supported windows. by @XuesongYang :: PR: #4320
- [TTS] enforced pin_memory = True by @XuesongYang :: PR: #4341
- [TTS] Training Fastpitch on German text and phonemes and finetuning HiFi-GAN on predicted mels by @aroraakshit :: PR: #4266
- IPA support for TTS by @redoctopus :: PR: #4310
- Bits of RADTTS support by @borisfom :: PR: #4343
NLP / NMT
Changelog
- Megatron NMT Restore from T5/BART and finetune by @MaximumEntropy :: PR: #3977
- Binarized memmap dataloader for Megatron NMT, Inference and checkpoint -> nemo by @MaximumEntropy :: PR: #4137
- Use unique names for temporary directories in punctuation and capitalization tests by @PeganovAnton :: PR: #4298
- Removes debug logging statements in Megatron NMT by @MaximumEntropy :: PR: #4312
- Raise error if trainer object is None for MegatronBaseModel by @MaximumEntropy :: PR: #4356
- Punctuation and capitalization tests race condition by @PeganovAnton :: PR: #4399
- unify intent slot dataset util functions in tutorials by @Zhilin123 :: PR: #4445
- Fix for TP=2,PP=2 decoding with megatron encoder-decoder models by @MaximumEntropy :: PR: #4484
- Add RETRO model for pretraining by @yidong72 :: PR: #4121
- Add async grad allreduce and chunk optimization by @xrennvidia :: PR: #4084
- Implements the UL2 Dataset and config by @MaximumEntropy :: PR: #4184
- Add RETRO indexed dataset and inference by @yidong72 :: PR: #4220
- Finetune T5 on the prefix-lm objective by @MaximumEntropy :: PR: #4328
- Fuse bias with geglu in ParallelMLP by @xrennvidia :: PR: #4213
- Support larger datasets for question answering by @Zhilin123 :: PR: #4205
- Refactor bias act fusion by @MaximumEntropy :: PR: #4376
- Prompt Learning Pipeline Parallel by @vadam5 :: PR: #4291
- Text memmap dataset by @michalivne :: PR: #4068
- Fuse grad division into async grad allreduce by @xrennvidia :: PR: #4327
Text Normalization / Inverse Text Normalization
Changelog
- [TN] WFST to normalize punctuation by @ekmb :: PR: #4108
- [TN/TTS] Add graph to tag IPA words/sentences in square brackets and leave them unchanged by @ekmb :: PR: #4323
- Tn tutorial by @yzhang123 :: PR: #4090
- [TN] WFST to normalize punctuation by @ekmb :: PR: #4108
- Tn add rules by @yzhang123 :: PR: #4302
- [TN/TTS] Add graph to tag IPA words/sentences in square brackets and leave them unchanged by @ekmb :: PR: #4323
- Tn install by @yzhang123 :: PR: #4055
- Fix electronic bug, new time ITN rule by @ekmb :: PR: #4355
- [TN] Bug fix: expand serial coverage of unknown symbol, remove constraints from word graph by @ekmb :: PR: #4463
- Configure T5 finetuning metrics by @MaximumEntropy :: PR: #4122
Export
Core
Changelog
General Improvements and Fixes
Changelog
- Update container to 22.05 by @ericharper :: PR: #4329
- Fix PTL step calculation by @titu1994 :: PR: #4307
- [NLP] P&C Fix multi node cache issue, add pynini guard by @ekmb :: PR: #4410
- NeMo Megatron GPT Unit Tests by @ericharper :: PR: #4099
- Add the PP2 GPT eval CI test by @yidong72 :: PR: #4168
- BigNLP perf regression fix by @MaximumEntropy :: PR: #4267
- Fixes for Megatron Base Model Artifacts by @MaximumEntropy :: PR: #4248
- Fix a wrong description in offline_diarization_with_asr.yaml by @tango4j :: PR: #4141
- bugfix for import error in Offline_ASR_with_VAD_for_CTC_models by @fayejf :: PR: #4424
- [Fix] ASR RNNT Tutorial by @stevehuang52 :: PR: #4352
- [TTS] Fix Hifigan finetune tutorial by @subhankar-ghosh :: PR: #4182
- [Bugfix][TTS] wrong order of returned tuple for general_collate_fn. by @XuesongYang :: PR: #4432
- [bugfix][TTS] pitch, voiced_mask, prob_voiced have the same values. by @XuesongYang :: PR: #4435
- [TTS] [bugfix] German FastPitch HiFi-GAN tutorial and lr by @aroraakshit :: PR: #4459
- [TTS] [bugfix] update indentation by @aroraakshit :: PR: #4468
- Fix some 's' cases for IPA G2P by @redoctopus :: PR: #4460
- Fix ASR Typos in tutorials by @titu1994 :: PR: #4384
- Use unique names for temporary directories in punctuation and capitalization tests by @PeganovAnton :: PR: #4298
- Punctuation and capitalization tests race condition by @PeganovAnton :: PR: #4399
- Dialogue tasks unit test by @Zhilin123 :: PR: #4112
- fix error by @yzhang123 :: PR: #4120
- fix typo by @stevehuang52 :: PR: #4134
- Fix cmudict typo: phoneme YI1 -> IY1 in NVME by @redoctopus :: PR: #4139
- transcribe: scan directories recursively by @virajkarandikar :: PR: #4159
- Add 44KHz yaml file for Fastpitch training by @subhankar-ghosh :: PR: #4161
- [bugfix] consistent highfreq to both fastpitch and hifigan in their 44100 configs. by @XuesongYang :: PR: #4177
- Upperbound OmegaConf by @titu1994 :: PR: #4191
- Prompt tokenization bugfix by @vadam5 :: PR: #4197
- Updated to Prompt Learning Model to Use Distributed Sampler by @vadam5 :: PR: #4208
- Freesound fixes by @virajkarandikar :: PR: #4155
- Patch Hydra by @titu1994 :: PR: #4202
- Prompt Learning Model Saving Changes by @vadam5 :: PR: #4212
- Speakertasks manifest by @yzhang123 :: PR: #4185
- SSL Multi-loss Update by @sam1373 :: PR: #4186
- Support load_adapters with just adapter_name by @titu1994 :: PR: #4255
- Add special tokens to existing (trained) SentencePiece models by @aklife97 :: PR: #4203
- Fixing the speed slow-down for speech models. by @VahidooX :: PR: #4260
- Fix and add functions in speaker utils by @tango4j :: PR: #4138
- pt container 1.10->1.11.0 by @ekmb :: PR: #4273
- ssl fixes by @sam1373 :: PR: #4268
- Save Virtual Prompt Weights Only by @vadam5 :: PR: #4237
- add 'relative positional embedding (RPE)' feature - re-creating after… by @khcs :: PR: #4256
- Docs CSS: Update h4 tag style for the right side bar by @nickolyamba :: PR: #4284
- Fix Docs CSS: align docs left and increase width for large screens by @nickolyamba :: PR: #4154
- remove redundant condition for fastpitch. by @XuesongYang :: PR: #4281
- [Add] automaticly resolving relative audio path by @stevehuang52 :: PR: #4277
- forcing conv subsampling to 32 bit by @bmwshop :: PR: #4293
- Add library name and version when downloading from the Hugging Face Hub by @osanseviero :: PR: #4304
- clear access registry when adding if not empty by @sam1373 :: PR: #4306
- [collections] bugfix for capturing NotImplementedError of non-supported sup data types. by @XuesongYang :: PR: #4297
- Adjust lr for AdamW from LAMB default by @redoctopus :: PR: #4308
- Fix bugs in indexed dataset exam script by @yidong72 :: PR: #4325
- Torchaudio installation fix by @GNroy :: PR: #4330
- Speedup the speech commands dataset processing script by @shan18 :: PR: #4347
- fix wrong requirement by @yzhang123 :: PR: #4349
- Refactored path to manifest by @treacker :: PR: #4251
- Fix the post LN bug by @yidong72 :: PR: #4350
- [Fix] Hanging for Fully Randomized Bucketing by @stevehuang52 :: PR: #4348
- Auto-switch the input dimensions in the conformer encoder adapter to correct value by @shan18 :: PR: #4354
- Set headscale false by @MaximumEntropy :: PR: #4364
- Add wandb as dependency by @titu1994 :: PR: #4365
- Fix trainer.global_steps in WandB logging by @titu1994 :: PR: #4366
- Finetuning changes for BART by @MaximumEntropy :: PR: #4003
- Make position embedding expansion specific to a batch to avoid checkpoint size mismatches by @MaximumEntropy :: PR: #4357
- Correct support for dataclasses in default module dim by @titu1994 :: PR: #4372
- Fix no attribute 'pad_id' bug when pre-processing by @yidong72 :: PR: #4377
- Question answering bug fix by @Zhilin123 :: PR: #4381
- Docs for NeMo Adapters by @titu1994 :: PR: #4369
- Update NeMo docs by @titu1994 :: PR: #4397
- Fixing import error in some cases by @borisfom :: PR: #4402
- Fix tutorial typos and docs by @titu1994 :: PR: #4415
- Add reconfigure on validation epoch start by @MaximumEntropy :: PR: #4393
- Re-apply fixes from r1.9.0 by @redoctopus :: PR: #4425
- Fix...
NVIDIA Neural Modules 1.9.0
Container
For additional information regarding NeMo containers, please visit: https://catalog.ngc.nvidia.com/orgs/nvidia/containers/nemo
docker pull nvcr.io/nvidia/nemo:22.04
ASR
Changelog
- Fix changed function name in offline vad asr notebeook by @fayejf :: PR: #4007
- NeMo Adapters Support + ASR Adapters by @titu1994 :: PR: #3942
- Update ASR configs with num_workers and pin_memory by @titu1994 :: PR: #4270
- Verbose k2 install, skip if failed by @GNroy :: PR: #4289
- Torch conversion for VAD-Diarization pipeline by @tango4j :: PR: #3930
- Multiprocess improvements by @nithinraok :: PR: #4127
TTS
Changelog
- Tn tts e by @ekmb :: PR: #3988
- Remove AudioToCharWithPriorAndPitchDataset dependency from fastpitch by @subhankar-ghosh :: PR: #4008
- Deprecation by @blisc :: PR: #4082
- FastPitch FT notebook - Improving Speech Quality clarifications by @redoctopus :: PR: #3954
NLP / NMT
Changelog
- Option to remove bias terms from Megatron transformers by @MaximumEntropy :: PR: #3973
- Add NMT method to translate with TN/ITN pre/post-processing by @MaximumEntropy :: PR: #4009
- Fix Punctuation and Capitalization model batching. An issue with shuffling. by @PeganovAnton :: PR: #4050
- Fix GPT model parallel eval by @yidong72 :: PR: #4054
- Updating with main by @jpilaul :: PR: #4073
- Cherry-pick fix for megatron ckpt conversion script when using BCP by @ericharper :: PR: #4089
- Check implicit grad acc in GLUE dataset building by @MaximumEntropy :: PR: #4123
- Fix/punctuation avoid overwritting tmp files by @PeganovAnton :: PR: #4144
- Fix/punctuation/trainer required for setting test data by @PeganovAnton :: PR: #4199
- Raise error if bicleaner is not installed in NMT Data preprocesing notebook by @MaximumEntropy :: PR: #4264
- Fix epoch end for NeMo NMT by @MaximumEntropy :: PR: #4265
- Update YAML with trainer.benchmark=False for NLP by @MaximumEntropy :: PR: #4261
- Add NMT method to translate with TN/ITN pre/post-processing by @MaximumEntropy :: PR: #4009
- Continuous prompt refactor by @vadam5 :: PR: #3877
- T5 finetuning for generic small text-to-text datasets by @MaximumEntropy :: PR: #4032
Text Normalization / Inverse Text Normalization
Changelog
- Tn special text support by @yzhang123 :: PR: #3969
- Tn update numbers by @yzhang123 :: PR: #3992
- Tn tts e by @ekmb :: PR: #3988
- Itn vi by @yzhang123 :: PR: #4029
- Refactor tn data folder, and update of measure by @yzhang123 :: PR: #4028
- Remove conda dependency for tn by @yzhang123 :: PR: #4057
- Tn electronic by @yzhang123 :: PR: #4053
- ThutmoseTaggerModel, a new model for inverse text normalization by @bene-ges :: PR: #4011
- Tutorial on ITN with Thutmose tagger and small fixes by @bene-ges :: PR: #4117
- Cleaned up TN/ ITN doc by @yzhang123 :: PR: #4119
- Update default for SH by @ekmb :: PR: #4135
- Update ContextNet version by @titu1994 :: PR: #4207
NeMo Tools
NeMo Core
Changelog
- Support pre-extracted nemo checkpoint for restoration by @titu1994 :: PR: #4061
- Fix type checking to be compatible with named tuples by @artbataev :: PR: #3986
- Update num worker calculation due to PTL flag changes by @redoctopus :: PR: #4056
- Refresh NeMo documentation to Sphinx Book Theme by @titu1994 :: PR: #3996
- Generalize adapter merge strategy for future adapters by @titu1994 :: PR: #4091
General Improvements
Changelog
- Fix Punctuation and Capitalization model batching. An issue with shuffling. by @PeganovAnton :: PR: #4050
- Fix restoring from checkpoint for case when is provided by @PeganovAnton :: PR: #4136
- Fix/punctuation avoid overwritting tmp files by @PeganovAnton :: PR: #4144
- Fix/punctuation/trainer required for setting test data by @PeganovAnton :: PR: #4199
- Ability to set log_prediction to false by @bmwshop :: PR: #3929
- Glu activation variants by @MaximumEntropy :: PR: #3951
- Ranking merge by @yzhang123 :: PR: #3906
- Fix path in doc by @nithinraok :: PR: #3979
- Adding fisher audio conversion script from old NeMo branch by @jbalam-nv :: PR: #3991
- improvements to geet_commonvoice_data script by @bmwshop :: PR: #3999
- Bugfix and variable name change for clustering code by @tango4j :: PR: #4023
- Exp manager log rank 0 only arguments by @MaximumEntropy :: PR: #4026
- Force import test on PR by @titu1994 :: PR: #4037
- Drop support for kaldi-io by @titu1994 :: PR: #4042
- Cherry pick HF integration and bug fixes from 1.8.1 by @ericharper :: PR: #4052
- Make saving prompt encoder embeddings non-configurable by @vadam5 :: PR: #4071
- Replace sampled tokens with EOD after EOD has been sampled once by @vadam5 :: PR: #4070
- Added answer only loss for prompt learning by @vadam5 :: PR: #4069
- added stacking suport to conformer. by @VahidooX :: PR: #4045
- Update LJSpeech whitelist file path by @redoctopus :: PR: #4078
- Added check for microbatch calculator by @vadam5 :: PR: #4043
- Prompt Learning Docs by @vadam5 :: PR: #4046
- Fix link to prompt tuning page by @SeanNaren :: PR: #4081
- Add docs for by @titu1994 :: PR: #4079
- Dialogue task by @Zhilin123 :: PR: #3884
- RMSNorm, Normformer and fixes from merging 1.8.0 into main by @MaximumEntropy :: PR: #4048
- Correct link to PTL by @titu1994 :: PR: #4088
- Added encoder and decoder modules for RETRO model by @yidong72 :: PR: #4038
- Upgrade container to NGC PyTorch 22.04 by @ericharper :: PR: #4085
- Tarred fix label models by @nithinraok :: PR: #4092
- Fix link to tutorial in dialogue docs by @Zhilin123 :: PR: #4093
- Prompt learning Notebook by @vadam5 :: PR: #4031
- Add more papers by @yzhang123 :: PR: #4097
- Ignore speakers with few utterances by @nithinraok :: PR: #3722
- Access mixin by @sam1373 :: PR: #4098
- Add CharParser for Cyrillic letters by @karpov-nick :: PR: #4101
- Restored tests previously disabled for 22.03 base by @borisfom :: PR: #4109
- Add augmentation to label models by @nithinraok :: PR: #4113
- Fix register artifacts by @ramanathan831 :: PR: #4116
- Fix typo by @yzhang123 :: PR: #4140
- bug_fix_diarization_manifest_creation by @yzhang123 :: PR: #4125
- Tacotron2 retrain by @treacker :: PR: #4103
- WaveGlow input type fixes by @redoctopus :: PR: #4151
- Notebooks' link, typo and import fix by @fayejf :: PR: #4158
- Thutmose tagger bug fixes by @bene-ges :: PR: #4162
- Update speaker docs by @nithinraok :: PR: #4164
- Set plugin to None when no apex by @ekmb :: PR: #4171
- Fix doc by @yzhang123 :: PR: #4152
- Small import name fix by @fayejf :: PR: #4180
- Rename folder VAD -> vad by @fayejf :: PR: #4163
- Fix the server key value problem in the notebook by @yidong72 :: PR: #4196
- Pin omegaconf for r1.9.0 by @ericharper :: PR: #4195
- Fix cherrypicks by @titu1994 :: PR: #4204
- Fix bugs for dialogue tutorial by @Zhilin123 :: PR: #4211
- Tacotron2 1.9.0 bugfixes by @redoctopus :: PR: #4209
- Add docs for Thutmose Tagger by @bene-ges :: PR: #4173
- Dialogue tutorial fix by @Zhilin123 :: PR: #4221
- Fix syntax error in ipynb-file by @bene-ges :: PR: #4228
- Fix JSON serialization problem by @yidong72 :: PR: #4235
- Prompt Learning Typo Fixes by @vadam5 :: PR: #4238
- Fixing bug 3642622 by @pasandi20 :: PR: #4250
- Fix broken link in the tutorial by @bene-ges :: PR: #4257
- Prompt learning notebook bugfix by @vadam5 :: PR: #4262
- Fix missing validation dataset, whitelist certain keywords for datasets by @titu1994 :: PR: #4269
- Set Save on train end to false by @vadam5 :: PR: #4274
- Updated config to fix CI test OOM error by @vadam5 :: PR: #4279
- Changed total virtual prompt tokens by @vadam5 :: PR: #4295
NVIDIA Neural Modules 1.8.2
Known Issues
- Megatron BERT export does not currently work in the NVIDIA NGC PyTorch 22.03 container. The issue will be fixed in the NGC PyTorch 22.04 container.
TTS
- Fastpitch Tutorial fix by @subhankar-ghosh :: PR: #4044
NVIDIA Neural Modules 1.8.1
Known Issues
- Megatron BERT export does not currently work in the NVIDIA NGC PyTorch 22.03 container. The issue will be fixed in the NGC PyTorch 22.04 container.
TTS
- Restore_buffer bug fix and update NeMo checkpoint URL by @subhankar-ghosh :: PR: #4041
Hugging Face Hub Integration
Bug Fixes
NVIDIA Neural Modules 1.8.0
Known Issues
Issues
- Megatron BERT export does not currently work in the NVIDIA NGC PyTorch 22.03 container. The issue will be fixed in the NGC PyTorch 22.04 container.
- pytest for Vietnamese inverse text normalization are failing. Fixed in main
Container
For additional information regarding NeMo containers, please visit: https://catalog.ngc.nvidia.com/orgs/nvidia/containers/nemo
docker pull nvcr.io/nvidia/nemo:22.03
ASR
Changelog
- ASR SSL Update by @sam1373 :: PR: #3714
- Polylang asr by @bmwshop :: PR: #3721
- Test grad accumulation for RNNT loss by @titu1994 :: PR: #3731
- Add readme files describing model execution flow for ASR tasks by @titu1994 :: PR: #3812
- add fr asr ckpt to doc by @yzhang123 :: PR: #3809
- Fix asr tests in 22.02 by @titu1994 :: PR: #3823
- Add new pretrained Spanish ASR models by @erastorgueva-nv :: PR: #3830
- Documentation updates for ASR by @titu1994 :: PR: #3846
- Offline VAD+ASR tutorial by @fayejf :: PR: #3828
- Added Hindi and Marathi Models in Nemo pretrained ASR_CTC_BPE models … by @meghmak13 :: PR: #3856
- Add a missing line to ASR_with_NeMo.ipynb by @lifefeel :: PR: #3908
- Multilang asr models by @bmwshop :: PR: #3907
- added stt_en_conformer_transducer_large_ls to NGC by @VahidooX :: PR: #3920
- Fix DALI test on 22.03 by @titu1994 :: PR: #3911
- Adding RNN encoder for LSTM-Transducer and LSTM-CTC models by @VahidooX :: PR: #3886
- Fix issue with Segfault in ASR models by @titu1994 :: PR: #3956
- Added Mandarin pretrained Conformer-Transducer-Large model trained on AISHELL2. by @VahidooX :: PR: #3970
TTS
Changelog
- Bump TTS deprecation version to 1.9 by @blisc :: PR: #3955
- Add pinned pynini and scipy installs to TTS training tutorial by @redoctopus :: PR: #3967
- Compatability override to load_state_dict for old TTS checkpoints by @redoctopus :: PR: #3978
NLP / NMT
Changelog
- Use worker processes for data preprocessing by @crcrpar :: PR: #3665
- Set find_unused_parameters to False in GPT example script by @ericharper :: PR: #3837
- GPT multinode eval by @ericharper :: PR: #3821
- Fix MegatronPretrainingRandomSampler by taking into account by @crcrpar :: PR: #3826
- Add slot filling into DST Generative model by @Zhilin123 :: PR: #3695
- Disable nvfuser for gpt by @ericharper :: PR: #3845
- Multi-Label Joint Intent Slot Classification by @chenrichard10 :: PR: #3742
- fix bug in intent/slot model reloading by @carolmanderson :: PR: #3874
- Make test_gpt_eval unit test less strict by @yidong72 :: PR: #3898
- Comment gpt resume ci test by @MaximumEntropy :: PR: #3901
- Neural Machine Translation with Megatron Transformer Models (Tensor Parallel and Tarred Datasets Only) by @MaximumEntropy :: PR: #3861
- Megatron support by @ramanathan831 :: PR: #3893
- Populate the GPT/BERT with uploaded models by @yidong72 :: PR: #3885
- Megatron BART by @michalivne :: PR: #3666
- Additional Japanese processor for NMT that uses MeCab segmentation. Fix for BLEU in one-many NMT by @MaximumEntropy :: PR: #3889
- NMT GRPC sever URL fix by @MaximumEntropy :: PR: #3918
- Megatron legacy conversion support by @ramanathan831 :: PR: #3919
- Update max_epochs on megatron configs by @ericharper :: PR: #3958
- Fix NMT variable passing bug by @aklife97 :: PR: #3985
- Fix nemo megatron restore with artifacts by @ericharper :: PR: #3997
- Fix megatron notebook by @ramanathan831 :: PR: #4004
- Megatron work-arounds by @borisfom :: PR: #3998
- Add T5 model P-tuning support by @yidong72 :: PR: #3768
- Make index mappings dir configurable by @ericharper :: PR: #3868
- T5 pipeline parallel by @MaximumEntropy :: PR: #3750
Text Normalization / Inverse Text Normalization
Changelog
Export
Changelog
Bugfixes
General Improvements
Changelog
- Pynini pip by @yzhang123 :: PR: #3726
- upgrade PTL trainer flags by @nithinraok :: PR: #3589
- Updated Speech Data Explorer by @vsl9 :: PR: #3710
- Fix spelling error in num_workers parameter to actually set number of dataset workers specified in yaml configs by @themikem :: PR: #3800
- Support for Camembert Huggingface bert-like models by @itzsimpl :: PR: #3799
- Update to 22.02 by @ericharper :: PR: #3771
- Fixing the defaults of conformer models in the config files by @VahidooX :: PR: #3836
- Fix T5 Encoder Mask while decoding by @MaximumEntropy :: PR: #3838
- fix: multilingual transcribe does not require lang id param by @bmwshop :: PR: #3833
- Misc improvements by @titu1994 :: PR: #3843
- Change container by @MaximumEntropy :: PR: #3844
- Making gender assignment random for cardinals, fractions, and decimal… by @bonham79 :: PR: #3759
- Jenkinsfile test changes by @chenrichard10 :: PR: #3879
- Adding a RegEx tokenizers by @michalivne :: PR: #3839
- enable bias+dropout+add fusion with nvfuser at inference by @erhoo82 :: PR: #3869
- Add text_generation_util to support TopK, TopP sampling + Tabular Data Generation. by @yidong72 :: PR: #3834
- Ptl requirements bound by @MaximumEntropy :: PR: #3903
- doc links update by @ekmb :: PR: #3891
- add citations by @yzhang123 :: PR: #3902
- Update NeMo CI to 22.03 by @MaximumEntropy :: PR: #3900
- Add domain groups to changelog builder by @titu1994 :: PR: #3904
- add input threshhold by @yzhang123 :: PR: #3913
- improvements to commonvoice data script by @bmwshop :: PR: #3892
- fixes to the cleanup flag by @bmwshop :: PR: #3921
- Upgrade to PTL 1.6.0 by @ericharper :: PR: #3890
- JSON output from diarization now includes sentences. Optimized senten… by @demsarjure :: PR: #3897
- Stateless timer fix for PTL 1.6 by @MaximumEntropy :: PR: #3925
- fix save_best missing chpt bug, update for setup_tokenizer() changes by @ekmb :: PR: #3932
- Fix tarred sentence dataset length by @MaximumEntropy :: PR: #3941
- remove old doc by @ekmb :: PR: #3946
- Fix issues with librosa deprecations by @titu1994 :: PR: #3950
- Fix notebook bugs for branch r1.8.0 by @yidong72 :: PR: #3948
- Fix global batch fit loop by @ericharper :: PR: #3936
- Refactor restorefrom by @ramanathan831 :: PR: #3927
- Fix variable name and move models to CPU in Change partition by @aklife97 :: PR: #3972
- Fix notebook error by @yidong72 :: PR: #3975
- Notebook Bug Fixes for r1.8.0 by @vadam5 :: PR: #3989
- Fix compat override for TalkNet Aligner by @redoctopus :: PR: #3993
- docs fixes by @ekmb :: PR: #3987
- Fixes val_check_interval, skip loading train data during eval by @MaximumEntropy :: PR: #3968
- LogProb calculation performance fix by @yidong72 :: PR: #3984
- Fix P-Tune T5 model by @yidong72 :: PR: #4001
- Fix the broadcast shape mismatch by @yidong72 :: PR: #4017
- Add known issues to notebook by @ericharper :: PR: #4024
NVIDIA Neural Modules 1.7.2
GPT Bugfixes
- GPT dataloader improvements and fixes by @crcrpar :: PRs #3826 , #3665
- Disable nvfuser by @ericharper :: PR #3845
- Set find_unused_parameters to False by @ericharper :: PR #3837
T5 XNLI Example
NVIDIA Neural Modules 1.7.1
Known Issues
- find_unused_parameters should be False when training GPT: #3837
Bugfixes
- revert changes by @yzhang123 :: PR: #3785
- Fixed soft prompt eval loading bug by @vadam5 :: PR: #3805
- mT5 whole word masking and T5 finetuning config fixes by @MaximumEntropy :: PR: #3776
- Raise error if FP16 training is tried with O2 recipe. by @ericharper :: PR: #3806
NVIDIA Neural Modules 1.7.0
Known Issues
- Megatron GPT training with O2 and FP16 is bugged. FP16 and O1 still works.
- find_unused_parameters should be False when training GPT: #3837
- FastPitch training may result in stalled GPUs. Users will have to manually kill their runs and continue training from the latest checkpoint.
- mT5 issue with whole word masking, see #3776
- T5 finetuning config issue, see #3776
Container
NOTE: From NeMo 1.7.0 onwards, NeMo containers will follow the YY.MM conversion for naming, where the YY.MM value is based on the base container. For additional information regarding NeMo containers, please visit : https://catalog.ngc.nvidia.com/orgs/nvidia/containers/nemo
docker pull nvcr.io/nvidia/nemo:22.01
ASR
- Wav2vec by @tbartley94 :: PR: #3297
- Fix bug in multi-checkpoint loading by @sam1373 :: PR: #3536
- Add HuggingFace Datasets to NeMo ASR Dataset script by @titu1994 :: PR: #3513
- Add support for Gradient Clipping (clamp) in RNNT Numba loss by @titu1994 :: PR: #3550
- Enable Tarred Dataset Support for NVIDIA DALI by @titu1994 :: PR: #3485
- Add initial support for Buffered RNNT Scripts by @titu1994 :: PR: #3602
- Significantly speed up RNNT loss on CUDA by @titu1994 :: PR: #3653
- Fixing the bug in the stateful rnnt decoder. by @VahidooX :: PR: #3673
- Add Buffered RNNT with LCS Merge algorithm by @titu1994 :: PR: #3669
- Asr noise data scripts by @jbalam-nv :: PR: #3660
- ASR SSL update by @sam1373 :: PR: #3746
- Add randomized bucketing by @VahidooX :: PR: #3445
- Self-supervised tutorial & update by @sam1373 :: PR: #3344
- Updated conformer models. by @VahidooX :: PR: #3741
- Added speaker identification script with cosine and neural classifier… by @nithinraok :: PR: #3672
- Fix in clustering diarizer by @nithinraok :: PR: #3701
- Add a function that writes cluster label in diarization pipeline by @tango4j :: PR: #3643
TTS
- port UnivNet to NeMo TTS collection by @L0SG :: PR: #3186
- E2E TTS fixes by @redoctopus :: PR: #3508
- New structure for TTS datasets in scripts/dataset_processing, VocoderDataset, update TTSDataset by @Oktai15 :: PR: #3484
- Depreciate some TTS models and TTS datasets by @Oktai15 :: PR: #3576
- Fix bugs in HiFi-GAN (scheduler, optimizers) and add input_example() in Mixer-TTS/Mixer-TTS-X by @Oktai15 :: PR: #3564
- Update UnivNet, HiFi-GAN and WaveGlow, small fixes in Mixer-TTS, FastPitch and Exportable by @Oktai15 :: PR: #3585
- Fix typo in FastPitch config (pitch_avg -> pitch_mean) by @eyentei :: PR: #3593
- Fix incorrect usage of TTSDataset in some files and fix one-line bug in NVIDIA's CMUDict by @Oktai15 :: PR: #3594
- Convert entry from UTF-16 to UTF-8 by @redoctopus :: PR: #3597
- remove CheckInstall by @blisc :: PR: #3577
- Fix UnivNet LibriTTS pretrained location by @m-toman :: PR: #3615
- FastPitch training tutorial by @subhankar-ghosh :: PR: #3631
- Update Aligner, add new methods to AlignmentEncoder by @Oktai15 :: PR: #3641
- Add Mixed Representation Training by @blisc :: PR: #3473
- Add speakerID to libritts/get_data.py by @subhankar-ghosh :: PR: #3662
- Update TTS tutorials, Simplification of testing Mixer-TTS and FastPitch by @Oktai15 :: PR: #3680
- Clean FastPitch_Finetuning.ipynb notebook by @Oktai15 :: PR: #3698
- Add cache_size to BetaBinomialInterpolator, fix bugs in TTS tutorials and FastPitch by @Oktai15 :: PR: #3706
- Fix bugs in VocoderDataset and TTSDataset by @Oktai15 :: PR: #3713
- Fix bugs in E2E TTS, Mixer-TTS and FastPitch by @Oktai15 :: PR: #3740
NLP / NMT
- NLPDDPPlugin find_unused_parameters is configurable by @mlgill :: PR: #3478
- Megatron encoder-decoder refactor by @michalivne :: PR: #3542
- Finetuning NeMo Megatron T5 Models on GLUE by @MaximumEntropy :: PR: #3408
- Pipeline parallelism for GPT by @ericharper :: PR: #3388
- Generalized the P-tuning method to support various NLP tasks by @yidong72 :: PR: #3623
- Megatron_LM checkpoint to NeMo checkpoint support by @yidong72 :: PR: #3692
- Bugfix for GPT eval by @ericharper :: PR: #3744
- Yuya/megatron t5 glue eval by @yaoyu-33 :: PR: #3751
- Enforce legacy tokenizer for sentencepiece to add special tokens for T5 by @MaximumEntropy :: PR: #3457
- Added P-Tuning method by @yidong72 :: PR: #3488
- O2 style mixed precision training for T5 by @MaximumEntropy :: PR: #3664
- LM adapted T5 dataset by @MaximumEntropy :: PR: #3654
- Fix consumed samples calculation + PTune Model bugs by @yidong72 :: PR: #3738
- Add pipeline support to eval methods by @ericharper :: PR: #3684
- XNli benchmark by @yidong72 :: PR: #3693
- Refactor dialogue state tracking for modelling/dataset interoperability by @Zhilin123 :: PR: #3526
- Changes to support mean n-gram size masking for T5 by @MaximumEntropy :: PR: #3646
- Dialogue state tracking refactor by @Zhilin123 :: PR: #3667
- Parallel prompt tuning by @vadam5 :: PR: #3670
- GEGLU activation for T5 by @MaximumEntropy :: PR: #3694
Text Normalization / Inverse Text Normalization
- Text normalization takes too much time for a string which contains a lot of dates by @PeganovAnton :: PR: #3451
- ITN bug fixes, ip address, card num support, whitelist clean up by @ekmb :: PR: #3574
- Fix tn bugs by @yzhang123 :: PR: #3580
- add serial number to itn by @yzhang123 :: PR: #3584
- ITN: SH bug fixes for telephone by @ekmb :: PR: #3592
- Tn bug 1.7.0 by @yzhang123 :: PR: #3730
- TN docs update by @ekmb :: PR: #3735
Export
- Update UnivNet, HiFi-GAN and WaveGlow, small fixes in Mixer-TTS, FastPitch and Exportable by @Oktai15 :: PR: #3585
- Conformer onnx fix by @borisfom :: PR: #3524
- Add onnx support for speaker models by @nithinraok :: PR: #3650
- Jasper mask/export fix by @borisfom :: PR: #3691
Bugfixes
- Text normalization takes too much time for a string which contains a lot of dates by @PeganovAnton :: PR: #3451
- Dialogue state tracking refactor/ SGDGEN patch 2 by @Zhilin123 :: PR: #3674
- lower bound PTL to 1.5.10 and remove last ckpt patch fix by @nithinraok :: PR: #3690
Improvements
- Wfst tutorial by @tbartley94 :: PR: #3479
- Update CMUdict with ADLR version pronunciations by @redoctopus :: PR: #3446
- Fix docs by @yzhang123 :: PR: #3523
- Add docstring to UnivNetModel by @L0SG :: PR: #3529
- Increase lower bound due to security vulnerability by @ericharper :: PR: #3537
- Add Change Log builder to NeMo by @titu1994 :: PR: #3527
- Bugfix, need to freeze the model by @yidong72 :: PR: #3540
- Bucketing quick fix by @tbartley94 :: PR: #3543
- More fixes to SentencePiece for T5 by @MaximumEntropy :: PR: #3515
- Update CONTRIBUTING.md by @Oktai15 :: PR: #3569
- Update pr template and re-add Changelog builder by @titu1994 :: PR: #3575
- Apex quick fix by @ekmb :: PR: #3591
- Upgrade to 22.01 container by @ericharper :: PR: #3571
- Fix typo and update minimal version of scipy by @Oktai15 :: PR: #3604
- Add env variable to force transformers to run offline during CI by @ericharper :: PR: #3607
- Correctly install NeMo wheel by @titu1994 :: PR: #3599
- Fix wheel build by @titu1994 :: PR: #3610
- Fixed EH and error reporting in restore_from by @borisfom :: PR: #3583
- Clarifying documentation by @itzsimpl :: PR: #3616
- Improve docs for finetuning by @titu1994 :: PR: #3622
- Add NeMo version to all new .nemo files by @titu1994 :: PR: #3605
- Update numba if NVIDIA_PYTORCH_VERSION not correct by @itzsimpl :: PR: #3614
- Remove @experimental decorator in diarization related files. by @tango4j :: PR: #3625
- Remove compression from .nemo files by @okuchaiev :: PR: #3626
- Update adobe analytics by @ericharper :: PR: #3645
- Add ssl tutorial to tutorial docs page by @sam1373 :: PR: #3649
- Fix number of channels>1 issue by @ekmb :: PR: #3652
- Fixed the bug in bucketing. by @VahidooX :: PR: #3663
- Adding guard by @yzhang123 :: PR: #3655
- Add tutorial paths by @titu1994 :: PR: #3651
- Folder name update by @ekmb :: PR: #3671
- Test HF online for SGD-GEN only by @MaximumEntropy :: PR: #3681
- Update Librosa support to 0.9 by @titu1994 :: PR: #3682
- Comment out numba in 22.01 release by @titu1994 :: PR: #3685
- Fix failing tests inside of the 22.01 container in PR 3571 by @fayejf :: PR: #3609
- Fixed Apex guard when imported classes are used for default values by @michalivne :: PR: #3700
- Update citrinet_512.yaml by @Jorjeous :: PR: #3642
- update torchaudio in Dockerfile to match torch version by @GNroy :: PR: #3637
- Enforce import tests on the three domains by @titu1994 :: PR: #3702
- Audio based norm speed up by @ekmb :: PR: #3703
- Fix device on notebook by @titu1994 :: PR: #3732
- pynini pip by @yzhang123 :: PR: #3729
- Removed fp16 converting in complete method by @dimapihtar :: PR: #3709
- Mirror AN4 while CMU servers are down by @titu1994 :: PR: #3743
- Fix SSL configs for 1.7 by @sam1373 :: PR: #3748
- Punct process bug fix by @ekmb :: PR: #3747
- Specify gpus in SSL notebook by @sam1373 :: PR: #3753
- Duplex model inference fix, money encoder fix by @ekmb :: PR: #3754
- Update decoding strategy docs and override general value for tutorials by @titu1994 :: PR: #3755
- Fix directories in ssl notebook by @sam1373 :: PR: #3758
- Update Tacotron2_Training.ipynb by @blisc :: PR: #3769
- Fix dockerfile by @yzhang123 :: PR: #3778
- Prompt-Tuning-Documentation by @vadam5 :: PR: #3777
- Prompt tuning bug fix by @vadam5 :: PR: #3780
NVIDIA Neural Modules 1.6.2
Bug fix
- Changed Apex not found error to warning to enable NLP models which aren't apex dependent when Apex isn't installed.