GitHub - batvoice-org/tf2-punctuator2: Tensorflow 2.0 implementation of automatic punctuation with RNN + attention

Tensorflow 2.0 implementation of RNN + attention-based automatic punctuation

Derived from this project written in Theano.

At this stage this is a rough draft, tested only with a single type of "punctuation" (actually, sentence boundaries). However, it is easily adapted to any number of punctuation markers.

Also added a little script to infer and visualize the attention weights for any sentence fed to a trained model.

Hyperparameters and paths to data and checkpoints are written in a bash file to be sourced before running the script, e.g.

source env.sh
python 01_train.python

See example_data to see how the data should be formatted.

Preprocessing scripts are not provided in this version.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
example_data		example_data
01_train.py		01_train.py
02_test.py		02_test.py
03_infer_and_visualize.py		03_infer_and_visualize.py
README.md		README.md
checkpoints.py		checkpoints.py
env.sh.example		env.sh.example
metrics.py		metrics.py
model.py		model.py
optimization.py		optimization.py
pipeline.py		pipeline.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Tensorflow 2.0 implementation of RNN + attention-based automatic punctuation

About

Releases

Packages

Languages

batvoice-org/tf2-punctuator2

Folders and files

Latest commit

History

Repository files navigation

Tensorflow 2.0 implementation of RNN + attention-based automatic punctuation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages