Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

P&C docs #5069

Merged
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion docs/source/nlp/models.rst
Original file line number Diff line number Diff line change
Expand Up @@ -8,7 +8,7 @@ NeMo's NLP collection supports provides the following task-specific models:
.. toctree::
:maxdepth: 1

punctuation_and_capitalization
punctuation_and_capitalization_models
token_classification
joint_intent_slot
text_classification
Expand Down
7 changes: 7 additions & 0 deletions docs/source/nlp/nlp_all.bib
Original file line number Diff line number Diff line change
Expand Up @@ -170,4 +170,11 @@ @inproceedings{koehnetal2007moses
publisher = "Association for Computational Linguistics",
url = "https://aclanthology.org/P07-2045",
pages = "177--180",
}

@article{sunkara2020multimodal,
title={Multimodal Semi-supervised Learning Framework for Punctuation Prediction in Conversational Speech},
author={Monica Sunkara, Srikanth Ronanki, Dhanush Bekal, Sravan Bodapati, Katrin Kirchhoff},
journal={arXiv preprint arXiv:2008.00702},
year={2020}
}
8 changes: 0 additions & 8 deletions docs/source/nlp/punctuation_and_capitalization.rst
Original file line number Diff line number Diff line change
Expand Up @@ -3,14 +3,6 @@
Punctuation and Capitalization Model
====================================

Automatic Speech Recognition (ASR) systems typically generate text with no punctuation and capitalization of the words.
There are two issues with non-punctuated ASR output:

- it could be difficult to read and understand
- models for some downstream tasks, such as named entity recognition, machine translation, or text-to-speech, are
usually trained on punctuated datasets and using raw ASR output as the input to these models could deteriorate their
performance

Quick Start Guide
-----------------

Expand Down
Loading