-
Notifications
You must be signed in to change notification settings - Fork 2.6k
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
* Tn bug 1.7.0 (#3730) * fix es and fr bug Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * add file Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * [TTS] Fix bugs in E2E TTS, Mixer-TTS and FastPitch (#3740) * fix bugs Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix bug in e2e tts and mixer tts Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * Mirror AN4 data while servers are down (#3743) Signed-off-by: smajumdar <titu1994@gmail.com> * Bugfix for GPT eval (#3744) * use tokens_cut not tokens Signed-off-by: ericharper <complex451@gmail.com> * remove precision conversion and comment jit for bias gelu Signed-off-by: ericharper <complex451@gmail.com> * revert comment update mbs in config Signed-off-by: ericharper <complex451@gmail.com> * calculate micro_batch_size during complete and compute_logprobs Signed-off-by: ericharper <complex451@gmail.com> * ASR SSL update (#3746) * ssl update Signed-off-by: sam1373 <samuelkriman@gmail.com> * tutorial update Signed-off-by: sam1373 <samuelkriman@gmail.com> * Fix SSL configs for 1.7 (#3748) * ssl update Signed-off-by: sam1373 <samuelkriman@gmail.com> * tutorial update Signed-off-by: sam1373 <samuelkriman@gmail.com> * revert configs Signed-off-by: sam1373 <samuelkriman@gmail.com> * revert configs Signed-off-by: sam1373 <samuelkriman@gmail.com> * punct process bug fix (#3747) Signed-off-by: ekmb <ebakhturina@nvidia.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> * updated conformer models. (#3741) Signed-off-by: Vahid <vnoroozi@nvidia.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> * Yuya/megatron t5 glue eval (#3751) * Add megatron t5 glue eval-only script Signed-off-by: Yu Yao <yuya@nvidia.com> * Update megatron t5 glue eval default configs Signed-off-by: Yu Yao <yuya@nvidia.com> * Update megatron t5 glue eval configs Signed-off-by: Yu Yao <yuya@nvidia.com> * Update config comments Signed-off-by: Yu Yao <yuya@nvidia.com> Co-authored-by: Yu Yao <yuya@nvidia.com> * Specify gpus in SSL notebook (#3753) * ssl update Signed-off-by: sam1373 <samuelkriman@gmail.com> * tutorial update Signed-off-by: sam1373 <samuelkriman@gmail.com> * revert configs Signed-off-by: sam1373 <samuelkriman@gmail.com> * revert configs Signed-off-by: sam1373 <samuelkriman@gmail.com> * specify gpus Signed-off-by: sam1373 <samuelkriman@gmail.com> * Duplex model inference fix, money encoder fix (#3754) Signed-off-by: ekmb <ebakhturina@nvidia.com> * Update docs for RNNT and overriding fused batch size (#3755) Signed-off-by: smajumdar <titu1994@gmail.com> * fix consumed samples calculation + PTune Model bugs (#3738) * fix the way computing consumed samples Signed-off-by: Yi Dong <yidong@nvidia.com> * fixed ptune model Signed-off-by: Yi Dong <yidong@nvidia.com> * make sure notebook is working Signed-off-by: Yi Dong <yidong@nvidia.com> * added try-catch Signed-off-by: Yi Dong <yidong@nvidia.com> Co-authored-by: Micha Livne <michalivne@users.noreply.github.com> Co-authored-by: Eric Harper <complex451@gmail.com> * fix directories in ssl notebook (#3758) * ssl update Signed-off-by: sam1373 <samuelkriman@gmail.com> * tutorial update Signed-off-by: sam1373 <samuelkriman@gmail.com> * revert configs Signed-off-by: sam1373 <samuelkriman@gmail.com> * revert configs Signed-off-by: sam1373 <samuelkriman@gmail.com> * specify gpus Signed-off-by: sam1373 <samuelkriman@gmail.com> * update dirs Signed-off-by: sam1373 <samuelkriman@gmail.com> * TN docs update (#3735) * TN docs update: audio based docs added, quick start, ref fixed, etc Signed-off-by: ekmb <ebakhturina@nvidia.com> * add deployment script dir and Sp TN Signed-off-by: ekmb <ebakhturina@nvidia.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> * Update Tacotron2_Training.ipynb (#3769) Signed-off-by: Jason <jasoli@nvidia.com> * update branch Signed-off-by: ericharper <complex451@gmail.com> * update requirements and package info Signed-off-by: ericharper <complex451@gmail.com> * revert Signed-off-by: ericharper <complex451@gmail.com> * revert Signed-off-by: ericharper <complex451@gmail.com> * update branch Signed-off-by: ericharper <complex451@gmail.com> * remove unused import Signed-off-by: ericharper <complex451@gmail.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Oktai Tatanov <oktai.tatanov@gmail.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> Co-authored-by: Samuel Kriman <samuelkriman@gmail.com> Co-authored-by: Evelina <10428420+ekmb@users.noreply.github.com> Co-authored-by: Vahid Noroozi <VahidooX@users.noreply.github.com> Co-authored-by: yaoyu-33 <54727607+yaoyu-33@users.noreply.github.com> Co-authored-by: Yu Yao <yuya@nvidia.com> Co-authored-by: Yi Dong <43824965+yidong72@users.noreply.github.com> Co-authored-by: Micha Livne <michalivne@users.noreply.github.com> Co-authored-by: Jason <jasoli@nvidia.com>
- Loading branch information
Showing
63 changed files
with
819 additions
and
317 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file was deleted.
Oops, something went wrong.
This file was deleted.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,21 @@ | ||
(Inverse) Text Normalization | ||
============================ | ||
|
||
NeMo supports Text Normalization (TN) and Inverse Text Normalization (ITN) tasks via rule-based `nemo_text_processing` python package and Neural-based TN/ITN model. | ||
|
||
Rule-based (WFST) TN/ITN: | ||
|
||
.. toctree:: | ||
:maxdepth: 1 | ||
|
||
wfst/intro | ||
|
||
|
||
Neural TN/ITN: | ||
|
||
.. toctree:: | ||
:maxdepth: 1 | ||
|
||
nn_text_normalization | ||
|
||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,56 @@ | ||
@article{ebden2015kestrel, | ||
title={The Kestrel TTS text normalization system}, | ||
author={Ebden, Peter and Sproat, Richard}, | ||
journal={Natural Language Engineering}, | ||
volume={21}, | ||
number={3}, | ||
pages={333}, | ||
year={2015}, | ||
publisher={Cambridge University Press} | ||
} | ||
|
||
@article{sproat2016rnn, | ||
title={RNN approaches to text normalization: A challenge}, | ||
author={Sproat, Richard and Jaitly, Navdeep}, | ||
journal={arXiv preprint arXiv:1611.00068}, | ||
year={2016} | ||
} | ||
|
||
@book{taylor2009text, | ||
title={Text-to-speech synthesis}, | ||
author={Taylor, Paul}, | ||
year={2009}, | ||
publisher={Cambridge university press} | ||
} | ||
|
||
@misc{zhang2021nemo, | ||
title={NeMo Inverse Text Normalization: From Development To Production}, | ||
author={Yang Zhang and Evelina Bakhturina and Kyle Gorman and Boris Ginsburg}, | ||
year={2021}, | ||
eprint={2104.05055}, | ||
archivePrefix={arXiv}, | ||
primaryClass={cs.CL} | ||
} | ||
|
||
@inproceedings{sparrowhawk, | ||
title = {TTS for Low Resource Languages: A Bangla Synthesizer}, | ||
author = {Alexander Gutkin and Linne Ha and Martin Jansche and Knot Pipatsrisawat and Richard Sproat}, | ||
booktitle = {10th Language Resources and Evaluation Conference}, | ||
year = {2016}, | ||
} | ||
|
||
@article{mohri2005weighted, | ||
title={Weighted automata in text and speech processing}, | ||
author={Mohri, Mehryar and Pereira, Fernando and Riley, Michael}, | ||
journal={arXiv preprint cs/0503077}, | ||
year={2005} | ||
} | ||
|
||
@incollection{mohri2009weighted, | ||
title={Weighted automata algorithms}, | ||
author={Mohri, Mehryar}, | ||
booktitle={Handbook of weighted automata}, | ||
pages={213--254}, | ||
year={2009}, | ||
publisher={Springer} | ||
} |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,22 @@ | ||
WFST-based (Inverse) Text Normalization | ||
======================================= | ||
|
||
NeMo supports Text Normalization (TN) and Inverse Text Normalization (ITN) tasks via rule-based `nemo_text_processing` python package and Neural-based TN/ITN model. | ||
|
||
`nemo_text_processing` that is installed with the `nemo_toolkit`, see :doc:`NeMo Introduction <../starthere/intro>` for installation details. | ||
Additional requirements can be found in `setup.sh <https://github.com/NVIDIA/NeMo/blob/stable/nemo_text_processing/setup.sh>`_. | ||
|
||
Tutorials on how to get started with WFST-based NeMo text normalization could be found `tutorials/text_processing <https://github.com/NVIDIA/NeMo/tree/stable/tutorials/text_processing>`_. | ||
|
||
Rule-based (WFST) TN/ITN: | ||
|
||
.. toctree:: | ||
:maxdepth: 2 | ||
|
||
wfst_text_normalization | ||
wfst_inverse_text_normalization | ||
wfst_text_processing_deployment | ||
wfst_api | ||
|
||
|
||
|
2 changes: 2 additions & 0 deletions
2
docs/source/nemo_text_processing/api.rst → .../nlp/text_normalization/wfst/wfst_api.rst
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,3 +1,5 @@ | ||
.. _wfst_api: | ||
|
||
NeMo Text Processing API | ||
======================== | ||
|
||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.