Skip to content

Releases: NVIDIA/NeMo

NVIDIA Neural Modules 1.11.0

08 Sep 17:06
Compare
Choose a tag to compare

Container

For additional information regarding NeMo containers, please visit: https://catalog.ngc.nvidia.com/orgs/nvidia/containers/nemo

docker pull nvcr.io/nvidia/nemo:22.07

ASR

Changelog
  • Add ASR CTC Decoding module by @titu1994 :: PR: #4342
  • Fixing bugs in calling method ctc_decoder_predictions_tensor. by @VahidooX :: PR: #4414
  • Fixed WER initialization in ASR_with_Nemo notebook by @anteju :: PR: #4523
  • Update signature of Hypothesis alignments by @titu1994 :: PR: #4511
  • Add support for ASR Adapter Auxiliary Losses by @titu1994 :: PR: #4480
  • Catalan ASR NGC Resource by @stevehuang52 :: PR: #4576
  • Add kw asr models, add itn ru checkpoint (tagger-based) by @bene-ges :: PR: #4595
  • Add DALI char dataset support to SSL model by @piraka9011 :: PR: #4592
  • Customize arguments for trimming the leading/trailing silence by @XuesongYang :: PR: #4582
  • Update Offline ASR with CTC Decoding by @titu1994 :: PR: #4608
  • Add Squeezeformer to ASR by @titu1994 :: PR: #4416
  • Fix ASR notebooks by @titu1994 :: PR: #4738
  • Add pretrained ASR models for Croatian by @anteju :: PR: #4682
  • Dataloader, collector, loss and metric for multiscale diarization decoder by @tango4j :: PR: #4187
  • Multilingual VAD model by @fayejf :: PR: #4734
  • Adding support for models trained with full context for cache-aware streaming. by @VahidooX :: PR: #4687
  • Fp16 support for Conformer by @bmwshop :: PR: #4571
  • Tiny VAD refactoring for postprocessing by @fayejf :: PR: #4625
  • Add silence handling for speaker diarization pipeline by @nithinraok :: PR: #4512
  • Add Bucketing support to TarredAudioToClassificationLabelDataset by @entn-at :: PR: #4465

TTS

Changelog

NLP / NMT

Changelog

Text Normalization / Inverse Text Normalization

Changelog

Export

Changelog

Bugfixes

Changelog
  • Wrong order of returned tuple for general_collate_fn. by @XuesongYang :: PR: #4388
  • Pitch, voiced_mask, prob_voiced have the same values which is not expected. by @XuesongYang :: PR: #4392
  • Fix tarred dataset len when num shards is not divisible by workers by @itzsimpl :: PR: #4553
  • Fix multiple dev/test datasets after restoring from checkpoint by @PeganovAnton :: PR: #4636
  • Fix/need different cache dirs for different datasets by @PeganovAnton :: PR: #4640
  • Improve mAES algorithm with patches by @titu1994 :: PR: #4662

General Improvements

Changelog
Read more

NVIDIA Neural Modules 1.10.0

01 Jul 22:14
Compare
Choose a tag to compare

Container

For additional information regarding NeMo containers, please visit: https://catalog.ngc.nvidia.com/orgs/nvidia/containers/nemo

docker pull nvcr.io/nvidia/nemo:22.05

Known Issues

Issues
  • Tutorial: Fastpitch_Training_GermanTTS.ipynb is experimental and still being tested.

ASR

Changelog

TTS

Changelog

NLP / NMT

Changelog

Text Normalization / Inverse Text Normalization

Changelog
  • [TN] WFST to normalize punctuation by @ekmb :: PR: #4108
  • [TN/TTS] Add graph to tag IPA words/sentences in square brackets and leave them unchanged by @ekmb :: PR: #4323
  • Tn tutorial by @yzhang123 :: PR: #4090
  • [TN] WFST to normalize punctuation by @ekmb :: PR: #4108
  • Tn add rules by @yzhang123 :: PR: #4302
  • [TN/TTS] Add graph to tag IPA words/sentences in square brackets and leave them unchanged by @ekmb :: PR: #4323
  • Tn install by @yzhang123 :: PR: #4055
  • Fix electronic bug, new time ITN rule by @ekmb :: PR: #4355
  • [TN] Bug fix: expand serial coverage of unknown symbol, remove constraints from word graph by @ekmb :: PR: #4463
  • Configure T5 finetuning metrics by @MaximumEntropy :: PR: #4122

Export

Changelog

Core

Changelog

General Improvements and Fixes

Changelog
Read more

NVIDIA Neural Modules 1.9.0

03 Jun 20:40
Compare
Choose a tag to compare

Container

For additional information regarding NeMo containers, please visit: https://catalog.ngc.nvidia.com/orgs/nvidia/containers/nemo

docker pull nvcr.io/nvidia/nemo:22.04

ASR

Changelog
  • Fix changed function name in offline vad asr notebeook by @fayejf :: PR: #4007
  • NeMo Adapters Support + ASR Adapters by @titu1994 :: PR: #3942
  • Update ASR configs with num_workers and pin_memory by @titu1994 :: PR: #4270
  • Verbose k2 install, skip if failed by @GNroy :: PR: #4289
  • Torch conversion for VAD-Diarization pipeline by @tango4j :: PR: #3930
  • Multiprocess improvements by @nithinraok :: PR: #4127

TTS

Changelog

NLP / NMT

Changelog

Text Normalization / Inverse Text Normalization

Changelog

NeMo Tools

Changelog
  • Added exception handling for audio player in SDE by @vsl9 :: PR: #4077

NeMo Core

Changelog
  • Support pre-extracted nemo checkpoint for restoration by @titu1994 :: PR: #4061
  • Fix type checking to be compatible with named tuples by @artbataev :: PR: #3986
  • Update num worker calculation due to PTL flag changes by @redoctopus :: PR: #4056
  • Refresh NeMo documentation to Sphinx Book Theme by @titu1994 :: PR: #3996
  • Generalize adapter merge strategy for future adapters by @titu1994 :: PR: #4091

General Improvements

Changelog

NVIDIA Neural Modules 1.8.2

26 Apr 21:29
Compare
Choose a tag to compare

Known Issues

  • Megatron BERT export does not currently work in the NVIDIA NGC PyTorch 22.03 container. The issue will be fixed in the NGC PyTorch 22.04 container.

TTS

NVIDIA Neural Modules 1.8.1

22 Apr 05:06
2ef2892
Compare
Choose a tag to compare

Known Issues

  • Megatron BERT export does not currently work in the NVIDIA NGC PyTorch 22.03 container. The issue will be fixed in the NGC PyTorch 22.04 container.

TTS

Hugging Face Hub Integration

Bug Fixes

NVIDIA Neural Modules 1.8.0

20 Apr 04:29
Compare
Choose a tag to compare

Known Issues

Issues
  • Megatron BERT export does not currently work in the NVIDIA NGC PyTorch 22.03 container. The issue will be fixed in the NGC PyTorch 22.04 container.
  • pytest for Vietnamese inverse text normalization are failing. Fixed in main

Container

For additional information regarding NeMo containers, please visit: https://catalog.ngc.nvidia.com/orgs/nvidia/containers/nemo

docker pull nvcr.io/nvidia/nemo:22.03

ASR

Changelog

TTS

Changelog
  • Bump TTS deprecation version to 1.9 by @blisc :: PR: #3955
  • Add pinned pynini and scipy installs to TTS training tutorial by @redoctopus :: PR: #3967
  • Compatability override to load_state_dict for old TTS checkpoints by @redoctopus :: PR: #3978

NLP / NMT

Changelog

Text Normalization / Inverse Text Normalization

Changelog

Export

Changelog

Bugfixes

Changelog

General Improvements

Changelog

NVIDIA Neural Modules 1.7.2

17 Mar 22:35
c16b894
Compare
Choose a tag to compare

GPT Bugfixes

T5 XNLI Example

NVIDIA Neural Modules 1.7.1

08 Mar 03:04
d5ad011
Compare
Choose a tag to compare

Known Issues

  • find_unused_parameters should be False when training GPT: #3837

Bugfixes

NVIDIA Neural Modules 1.7.0

02 Mar 00:57
256236f
Compare
Choose a tag to compare

Known Issues

  • Megatron GPT training with O2 and FP16 is bugged. FP16 and O1 still works.
  • find_unused_parameters should be False when training GPT: #3837
  • FastPitch training may result in stalled GPUs. Users will have to manually kill their runs and continue training from the latest checkpoint.
  • mT5 issue with whole word masking, see #3776
  • T5 finetuning config issue, see #3776

Container

NOTE: From NeMo 1.7.0 onwards, NeMo containers will follow the YY.MM conversion for naming, where the YY.MM value is based on the base container. For additional information regarding NeMo containers, please visit : https://catalog.ngc.nvidia.com/orgs/nvidia/containers/nemo

docker pull nvcr.io/nvidia/nemo:22.01

ASR

TTS

  • port UnivNet to NeMo TTS collection by @L0SG :: PR: #3186
  • E2E TTS fixes by @redoctopus :: PR: #3508
  • New structure for TTS datasets in scripts/dataset_processing, VocoderDataset, update TTSDataset by @Oktai15 :: PR: #3484
  • Depreciate some TTS models and TTS datasets by @Oktai15 :: PR: #3576
  • Fix bugs in HiFi-GAN (scheduler, optimizers) and add input_example() in Mixer-TTS/Mixer-TTS-X by @Oktai15 :: PR: #3564
  • Update UnivNet, HiFi-GAN and WaveGlow, small fixes in Mixer-TTS, FastPitch and Exportable by @Oktai15 :: PR: #3585
  • Fix typo in FastPitch config (pitch_avg -> pitch_mean) by @eyentei :: PR: #3593
  • Fix incorrect usage of TTSDataset in some files and fix one-line bug in NVIDIA's CMUDict by @Oktai15 :: PR: #3594
  • Convert entry from UTF-16 to UTF-8 by @redoctopus :: PR: #3597
  • remove CheckInstall by @blisc :: PR: #3577
  • Fix UnivNet LibriTTS pretrained location by @m-toman :: PR: #3615
  • FastPitch training tutorial by @subhankar-ghosh :: PR: #3631
  • Update Aligner, add new methods to AlignmentEncoder by @Oktai15 :: PR: #3641
  • Add Mixed Representation Training by @blisc :: PR: #3473
  • Add speakerID to libritts/get_data.py by @subhankar-ghosh :: PR: #3662
  • Update TTS tutorials, Simplification of testing Mixer-TTS and FastPitch by @Oktai15 :: PR: #3680
  • Clean FastPitch_Finetuning.ipynb notebook by @Oktai15 :: PR: #3698
  • Add cache_size to BetaBinomialInterpolator, fix bugs in TTS tutorials and FastPitch by @Oktai15 :: PR: #3706
  • Fix bugs in VocoderDataset and TTSDataset by @Oktai15 :: PR: #3713
  • Fix bugs in E2E TTS, Mixer-TTS and FastPitch by @Oktai15 :: PR: #3740

NLP / NMT

Text Normalization / Inverse Text Normalization

Export

Bugfixes

  • Text normalization takes too much time for a string which contains a lot of dates by @PeganovAnton :: PR: #3451
  • Dialogue state tracking refactor/ SGDGEN patch 2 by @Zhilin123 :: PR: #3674
  • lower bound PTL to 1.5.10 and remove last ckpt patch fix by @nithinraok :: PR: #3690

Improvements

NVIDIA Neural Modules 1.6.2

05 Feb 06:09
Compare
Choose a tag to compare

Bug fix

  • Changed Apex not found error to warning to enable NLP models which aren't apex dependent when Apex isn't installed.