Jane Austen Text Generation

About the project

The purpose of this project was to create and train a model capable of capturing and emulating the nuances of Jane Austen's writing. Several model architectures and tokenization techniques were explored and compared in terms of performance on a test set and quality of text generated. The data for training and testing is based on the works of Jane Austen as sourced from Project Gutenberg.

Models

Vanilla Recurrent Neural Network (RNN)
One-layer Long-Short-Term-Memory (LSTM)
Two-layer Long-Short-Term-Memory (LSTM)

Tokenization

Character-level
Word-level, using word2vec embedding
Subword-level, using Byte-Pair Encoding (BPE)

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
README.md		README.md
models.ipynb		models.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Jane Austen Text Generation

About the project

Models

Tokenization

About

Releases

Packages

Contributors 2

Languages

annamaartensson/dd2424project

Folders and files

Latest commit

History

Repository files navigation

Jane Austen Text Generation

About the project

Models

Tokenization

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages