NVIDIA · ekmb · Oct 4, 2022 · Oct 4, 2022
diff --git a/docs/source/nlp/models.rst b/docs/source/nlp/models.rst
@@ -8,7 +8,7 @@ NeMo's NLP collection supports provides the following task-specific models:
 .. toctree::
    :maxdepth: 1
 
-   punctuation_and_capitalization
+   punctuation_and_capitalization_models
    token_classification
    joint_intent_slot
    text_classification

diff --git a/docs/source/nlp/nlp_all.bib b/docs/source/nlp/nlp_all.bib
@@ -170,4 +170,11 @@ @inproceedings{koehnetal2007moses
     publisher = "Association for Computational Linguistics",
     url = "https://aclanthology.org/P07-2045",
     pages = "177--180",
+}
+
+@article{sunkara2020multimodal,
+  title={Multimodal Semi-supervised Learning Framework for Punctuation Prediction in Conversational Speech},
+  author={Monica Sunkara, Srikanth Ronanki, Dhanush Bekal, Sravan Bodapati, Katrin Kirchhoff},
+  journal={arXiv preprint arXiv:2008.00702},
+  year={2020}
 }
diff --git a/docs/source/nlp/punctuation_and_capitalization.rst b/docs/source/nlp/punctuation_and_capitalization.rst
@@ -3,14 +3,6 @@
 Punctuation and Capitalization Model
 ====================================
 
-Automatic Speech Recognition (ASR) systems typically generate text with no punctuation and capitalization of the words.
-There are two issues with non-punctuated ASR output:
-
-- it could be difficult to read and understand
-- models for some downstream tasks, such as named entity recognition, machine translation, or text-to-speech, are
-  usually trained on punctuated datasets and using raw ASR output as the input to these models could deteriorate their
-  performance
-
 Quick Start Guide
 -----------------