Build n-gram language model for DeepSpeech2, and add inference interfaces insertable to CTC decoder. #2229

xinghai-sun · 2017-05-22T07:14:40Z

Train an Engish language model (Kneser-Ney smoothed 5-gram, with pruning), with KenLM toolkit, on cleaned text from the Common Crawl Repository. For detailed requirements please refer to DS2 paper.
Add the training script into the DS2 trainer script.
Add inference interfaces for this n-gram language model, insertable to CTC-LM-beam-search for decoding.
Keep in mind that the interfaces should be compatible with both English (word-based LM) and Madarin (character-based LM).
Please work closely with the "Add CTC-LM-beam-search decoder" task.
Refer to the DS2 design doc and update it when necessary.

pkuyym · 2017-05-31T09:18:52Z

@cxwangyi @kuke @xinghai-sun
Hi, as mentioned in the paper, a language model has to be trained to improve the generating results and the LM is a critical component to ensure the performance. The language model is trained on texts crawled from commoncrawl.org using KenLM toolkit. However, we need more details to train such a language model. Any possible to get the trained language model or text dataset trained on?

xinghai-sun · 2017-06-01T04:32:42Z

英文的语料应该很多，不一定拘泥于paper提到的语料，我们可以先小语料试，例如PTB。
n-gram LM训练的工具尽可能先用KenLM；如果不用，也尽可能保证 smooth的方法对齐或合理。
重点关注model loading和inference的接口设计，并且做好和beam search decoder的联调。
联系NLP或者SVAIL看看有没有现成的powerful LM model，中英文都问下，请@lcy-seso 协助下。

* update faster modelzoo and config, test=dygraph * update model link, test=dygraph

wwfcnu · 2024-04-30T06:35:52Z

英文的语料应该很多，不一定拘泥于paper提到的语料，我们可以先小语料试，例如PTB。

n-gram LM训练的工具尽可能先用KenLM；如果不用，也尽可能保证 smooth的方法对齐或合理。

重点关注model loading和inference的接口设计，并且做好和beam search decoder的联调。

联系NLP或者SVAIL看看有没有现成的powerful LM model，中英文都问下，请@lcy-seso 协助下。

@lcy-seso 请问有现成可用的LM model可以用吗

xinghai-sun mentioned this issue May 22, 2017

Deep Speech 2 on PaddlePaddle: Plan & Task Breakdown PaddlePaddle/models#44

Closed

pkuyym self-assigned this May 22, 2017

kuke self-assigned this May 22, 2017

xinghai-sun mentioned this issue May 25, 2017

Add design doc for DeepSpeech2 on PaddlePaddle. #2255

Merged

qingqing01 closed this as completed Jan 23, 2018

heavengate pushed a commit to heavengate/Paddle that referenced this issue Aug 16, 2021

[dygraph] update rcnn modelzoo and config (PaddlePaddle#2229)

38c30bb

* update faster modelzoo and config, test=dygraph * update model link, test=dygraph

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Build n-gram language model for DeepSpeech2, and add inference interfaces insertable to CTC decoder. #2229

Build n-gram language model for DeepSpeech2, and add inference interfaces insertable to CTC decoder. #2229

xinghai-sun commented May 22, 2017 •

edited

Loading

pkuyym commented May 31, 2017

xinghai-sun commented Jun 1, 2017

wwfcnu commented Apr 30, 2024

Build n-gram language model for DeepSpeech2, and add inference interfaces insertable to CTC decoder. #2229

Build n-gram language model for DeepSpeech2, and add inference interfaces insertable to CTC decoder. #2229

Comments

xinghai-sun commented May 22, 2017 • edited Loading

pkuyym commented May 31, 2017

xinghai-sun commented Jun 1, 2017

wwfcnu commented Apr 30, 2024

xinghai-sun commented May 22, 2017 •

edited

Loading