-
Notifications
You must be signed in to change notification settings - Fork 2.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Megatron KERPLE positional embeddings #6478
Commits on Apr 17, 2023
-
[TTS] FastPitch adapter fine-tune and conditional layer normalization (…
Configuration menu - View commit details
-
Copy full SHA for b9a9c40 - Browse repository at this point
Copy the full SHA b9a9c40View commit details -
[TTS] whitelist broken path fix. (#6412)
* [TTS] whitelist broken path fix. Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 14e9668 - Browse repository at this point
Copy the full SHA 14e9668View commit details
Commits on Apr 18, 2023
-
[TTS] FastPitch speaker encoder (#6417)
* Add initial codes Signed-off-by: hsiehjackson <c2hsieh@ucsd.edu> * Remove wemb Signed-off-by: hsiehjackson <c2hsieh@ucsd.edu> * Fix import Signed-off-by: hsiehjackson <c2hsieh@ucsd.edu> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Restore aligner loss Signed-off-by: hsiehjackson <c2hsieh@ucsd.edu> * Add ConditionalInput Signed-off-by: hsiehjackson <c2hsieh@ucsd.edu> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix error and support pre-trained config Signed-off-by: hsiehjackson <c2hsieh@ucsd.edu> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Follow comments Signed-off-by: hsiehjackson <c2hsieh@ucsd.edu> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Rename config Signed-off-by: hsiehjackson <c2hsieh@ucsd.edu> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Change copyright and random weight test Signed-off-by: hsiehjackson <c2hsieh@ucsd.edu> * Add initial codes Signed-off-by: hsiehjackson <c2hsieh@ucsd.edu> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: hsiehjackson <c2hsieh@ucsd.edu> * Fix import error Signed-off-by: hsiehjackson <c2hsieh@ucsd.edu> * Add initial codes Signed-off-by: hsiehjackson <c2hsieh@ucsd.edu> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: hsiehjackson <c2hsieh@ucsd.edu> * Fix dataset error Signed-off-by: hsiehjackson <c2hsieh@ucsd.edu> * Remove reference speaker embedding Signed-off-by: hsiehjackson <c2hsieh@ucsd.edu> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: hsiehjackson <c2hsieh@ucsd.edu> * Remove SV encoder Signed-off-by: hsiehjackson <c2hsieh@ucsd.edu> * Follow comments Signed-off-by: hsiehjackson <c2hsieh@ucsd.edu> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: hsiehjackson <c2hsieh@ucsd.edu> * Fix length type Signed-off-by: hsiehjackson <c2hsieh@ucsd.edu> * Fix append Signed-off-by: hsiehjackson <c2hsieh@ucsd.edu> * Move error msg Signed-off-by: hsiehjackson <c2hsieh@ucsd.edu> * Add look-up into speaker encoder Signed-off-by: hsiehjackson <c2hsieh@ucsd.edu> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: hsiehjackson <c2hsieh@ucsd.edu> * Add valueerror msg Signed-off-by: hsiehjackson <c2hsieh@ucsd.edu> * Move lookup Signed-off-by: hsiehjackson <c2hsieh@ucsd.edu> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: hsiehjackson <c2hsieh@ucsd.edu> * Remove unused Signed-off-by: hsiehjackson <c2hsieh@ucsd.edu> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: hsiehjackson <c2hsieh@ucsd.edu> * Fix error Signed-off-by: hsiehjackson <c2hsieh@ucsd.edu> * Rebase and Fix error Signed-off-by: hsiehjackson <c2hsieh@ucsd.edu> * Fix spk encoder Signed-off-by: hsiehjackson <c2hsieh@ucsd.edu> * Rename n_speakers Signed-off-by: hsiehjackson <c2hsieh@ucsd.edu> * Follow comments Signed-off-by: hsiehjackson <c2hsieh@ucsd.edu> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix n_speakers None error Signed-off-by: hsiehjackson <c2hsieh@ucsd.edu> --------- Signed-off-by: hsiehjackson <c2hsieh@ucsd.edu> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 536ee62 - Browse repository at this point
Copy the full SHA 536ee62View commit details -
Sharded manifests for tarred datasets (#6395)
* testing sharded manifests Signed-off-by: Dima Rekesh <bmwshop@gmail.com> * compatibility Signed-off-by: Dima Rekesh <bmwshop@gmail.com> * proper fixes Signed-off-by: Dima Rekesh <bmwshop@gmail.com> * adding flag tot convert_to_tarred_audio_dataset Signed-off-by: Dima Rekesh <bmwshop@gmail.com> * shard_manifests conf param Signed-off-by: Dima Rekesh <bmwshop@gmail.com> * propagating the shard_manifests param Signed-off-by: Dima Rekesh <bmwshop@gmail.com> * propagating the shard_manifests param Signed-off-by: Dima Rekesh <bmwshop@gmail.com> * distributed checks Signed-off-by: Dima Rekesh <bmwshop@gmail.com> * typo Signed-off-by: Dima Rekesh <bmwshop@gmail.com> * typo Signed-off-by: Dima Rekesh <bmwshop@gmail.com> * fixes Signed-off-by: Dima Rekesh <bmwshop@gmail.com> * fixes Signed-off-by: Dima Rekesh <bmwshop@gmail.com> * fixes Signed-off-by: Dima Rekesh <bmwshop@gmail.com> * fixes Signed-off-by: Dima Rekesh <bmwshop@gmail.com> * fixes Signed-off-by: Dima Rekesh <bmwshop@gmail.com> * fixes Signed-off-by: Dima Rekesh <bmwshop@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fixes based on PR comments and tests Signed-off-by: Dima Rekesh <bmwshop@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fixes to convert_to_tarred_audio_dataset.py Signed-off-by: Dima Rekesh <bmwshop@gmail.com> * reversing manifest shards flag Signed-off-by: Dima Rekesh <bmwshop@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * tests Signed-off-by: Dima Rekesh <bmwshop@gmail.com> * excluding manifests from webdataset url expansion Signed-off-by: Dima Rekesh <bmwshop@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * expand manifest paths before attempting to cache from datastore Signed-off-by: Dima Rekesh <bmwshop@gmail.com> * explicit use of UTF-8 for manifest i/o Signed-off-by: Dima Rekesh <bmwshop@gmail.com> --------- Signed-off-by: Dima Rekesh <bmwshop@gmail.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for ceb539f - Browse repository at this point
Copy the full SHA ceb539fView commit details -
Update wfst_text_normalization.rst (#6374)
Add Hungarian (incoming in NeMo-text-processing) Signed-off-by: Jim O’Regan <jaoregan@tcd.ie>
Configuration menu - View commit details
-
Copy full SHA for 499a3b2 - Browse repository at this point
Copy the full SHA 499a3b2View commit details
Commits on Apr 19, 2023
-
Support Swiglu in TP PP Conversion (#6437) (#6451)
* Support Swiglu in TP PP Conversion * Guard activation * Guard activation --------- Signed-off-by: smajumdar <titu1994@gmail.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for a365879 - Browse repository at this point
Copy the full SHA a365879View commit details -
Update NeMo_TTS_Primer.ipynb (#6436)
* Update NeMo_TTS_Primer.ipynb Changed a mistake in line 782. Instead of frequency band (ie. pitch) we should write frequency bin. Note that frequency bins in FFT are not related to pitch. Signed-off-by: Mostafa Ghorbandoost <mos.ghorbandoost@gmail.com> * Update NeMo_TTS_Primer.ipynb Corrected the description of spectrogram and mel spectrogram calculations in lines 782 & 783 and added a fourth point to the description and added a reference for more mathematical details at the end of this point. Signed-off-by: Mostafa Ghorbandoost <mos.ghorbandoost@gmail.com> --------- Signed-off-by: Mostafa Ghorbandoost <mos.ghorbandoost@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for be711c9 - Browse repository at this point
Copy the full SHA be711c9View commit details
Commits on Apr 20, 2023
-
add rampup batch size support for Megatron GPT (#6424)
* added rampup batch size support Signed-off-by: Dmytro Pykhtar <dpykhtar@nvidia.com> * added tests for rampup batch size Signed-off-by: Dmytro Pykhtar <dpykhtar@nvidia.com> * fixed the typos Signed-off-by: Dmytro Pykhtar <dpykhtar@nvidia.com> * added assertions Signed-off-by: Dmytro Pykhtar <dpykhtar@nvidia.com> * changed assertion rules Signed-off-by: Dmytro Pykhtar <dpykhtar@nvidia.com> * deleted unused imports Signed-off-by: Dmytro Pykhtar <dpykhtar@nvidia.com> * changed tests for rampup batch size Signed-off-by: Dmytro Pykhtar <dpykhtar@nvidia.com> * updated rampup batch size tests Signed-off-by: Dmytro Pykhtar <dpykhtar@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fixed styling Signed-off-by: Dmytro Pykhtar <dpykhtar@nvidia.com> * rampup batch size tests changes Signed-off-by: Dmytro Pykhtar <dpykhtar@nvidia.com> --------- Signed-off-by: Dmytro Pykhtar <dpykhtar@nvidia.com> Signed-off-by: Dmytro Pykhtar <37850217+dimapihtar@users.noreply.github.com> Co-authored-by: Dmytro Pykhtar <dpykhtar@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Eric Harper <complex451@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 9e72326 - Browse repository at this point
Copy the full SHA 9e72326View commit details -
Meagtron encoder decoder fix for empty validation outputs (#6459) (#6461
Configuration menu - View commit details
-
Copy full SHA for 41fcf4d - Browse repository at this point
Copy the full SHA 41fcf4dView commit details
Commits on Apr 21, 2023
-
Code-Switching dataset creation - upgrading to aggregate tokenizer ma…
…nifest format (#6448) * added functionality to create agg tokenizer compatible manifest for CS, flag to use this mode by default Signed-off-by: Kunal Dhawan <kunaldhawan97@gmail.com> * updated README with the new agg_tokenizer_manifest flag Signed-off-by: Kunal Dhawan <kunaldhawan97@gmail.com> * fixed typo in scripts/speech_recognition/code_switching/README.md Signed-off-by: Kunal Dhawan <kunaldhawan97@gmail.com> * changed agg_tokenizer_manifest to is_lid_manifest Signed-off-by: Kunal Dhawan <kunaldhawan97@gmail.com> --------- Signed-off-by: Kunal Dhawan <kunaldhawan97@gmail.com> Co-authored-by: Dima Rekesh <bmwshop@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 77f0959 - Browse repository at this point
Copy the full SHA 77f0959View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2822ff3 - Browse repository at this point
Copy the full SHA 2822ff3View commit details -
Update script for ngram rnnt and hat beam search decoding (#6370)
* add rnnt ngram beamsearch script Signed-off-by: andrusenkoau <andrusenkoau@gmail.com> * add return encoding embedding option Signed-off-by: andrusenkoau <andrusenkoau@gmail.com> * update script Signed-off-by: andrusenkoau <andrusenkoau@gmail.com> * add rnnt and hat ngram decoding script Signed-off-by: andrusenkoau <andrusenkoau@gmail.com> * add some parameters Signed-off-by: andrusenkoau <andrusenkoau@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * add return_encoder_embeddings parameter to RNNTDecodingConfig Signed-off-by: andrusenkoau <andrusenkoau@gmail.com> * replace return_encoder_embeddings parameter Signed-off-by: andrusenkoau <andrusenkoau@gmail.com> * generalization of scipt behavior Signed-off-by: andrusenkoau <andrusenkoau@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * remove return_encoder_embeddings parameter Signed-off-by: andrusenkoau <andrusenkoau@gmail.com> * remove return_encoder_embeddings parameter Signed-off-by: andrusenkoau <andrusenkoau@gmail.com> * add manual encoder_embeddings calculation Signed-off-by: andrusenkoau <andrusenkoau@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix beam_width value to 8 Signed-off-by: Andrei Andrusenko <52885736+andrusenkoau@users.noreply.github.com> * fix rescoring description Signed-off-by: Andrei Andrusenko <52885736+andrusenkoau@users.noreply.github.com> --------- Signed-off-by: andrusenkoau <andrusenkoau@gmail.com> Signed-off-by: Andrei Andrusenko <52885736+andrusenkoau@users.noreply.github.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 244ba8d - Browse repository at this point
Copy the full SHA 244ba8dView commit details
Commits on Apr 22, 2023
-
BERT pre-training mp fork to spawn (#6442) (#6454)
* change bert fork to spawn * num_workers=0 fix --------- Signed-off-by: Abhinav Khattar <aklife97@gmail.com> Co-authored-by: Abhinav Khattar <aklife97@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 094cbae - Browse repository at this point
Copy the full SHA 094cbaeView commit details -
fix replace_bos_with_pad not found (#6443) (#6450)
Signed-off-by: Abhinav Khattar <aklife97@gmail.com> Co-authored-by: Abhinav Khattar <aklife97@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for daa9744 - Browse repository at this point
Copy the full SHA daa9744View commit details -
reduce workers on NMT CI (#6472) (#6474)
Signed-off-by: Abhinav Khattar <aklife97@gmail.com> Co-authored-by: Abhinav Khattar <aklife97@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 557c4b7 - Browse repository at this point
Copy the full SHA 557c4b7View commit details
Commits on Apr 23, 2023
-
1. Added KERPLE positional embeddings to encoder-decoder.
Signed-off-by: Micha Livne <mlivne@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 690742b - Browse repository at this point
Copy the full SHA 690742bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 8664d09 - Browse repository at this point
Copy the full SHA 8664d09View commit details -
Signed-off-by: Micha Livne <mlivne@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for ed4c373 - Browse repository at this point
Copy the full SHA ed4c373View commit details -
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Configuration menu - View commit details
-
Copy full SHA for e3ca438 - Browse repository at this point
Copy the full SHA e3ca438View commit details -
Configuration menu - View commit details
-
Copy full SHA for c6fa1a9 - Browse repository at this point
Copy the full SHA c6fa1a9View commit details -
Merge branch 'megatron-kerple-positional-embeddings' of github.com:NV…
…IDIA/NeMo into megatron-kerple-positional-embeddings
Configuration menu - View commit details
-
Copy full SHA for f482074 - Browse repository at this point
Copy the full SHA f482074View commit details -
Configuration menu - View commit details
-
Copy full SHA for f6ed850 - Browse repository at this point
Copy the full SHA f6ed850View commit details -
Configuration menu - View commit details
-
Copy full SHA for 27cf8de - Browse repository at this point
Copy the full SHA 27cf8deView commit details -
Configuration menu - View commit details
-
Copy full SHA for 0f593b8 - Browse repository at this point
Copy the full SHA 0f593b8View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9e84e42 - Browse repository at this point
Copy the full SHA 9e84e42View commit details