Kokoro-Based TTS Extension for Oobabooga Text Generation WebUI

Enhance your text generation experience with the Kokoro TTS extension, seamlessly integrating with the Oobabooga Text Generation WebUI.

License

Project License: This extension is released under the MIT License and is built upon the Original Kokoro 82M Inference Code.
Model Weights: The model weights are not covered by the MIT License. They are licensed under the Apache 2.0 License and will be directly downloaded from Hugging Face.

Features

Current Version: Kokoro v1 Supported Languages: English

Kokoro TTS is limited to inputs up to 510 tokens. Note that Kokoro tokens differ from LLM tokens. This extension allows you to generate longer audio outputs by splitting the input text into segments and concatenating the resulting audio.

Text Splitting Methods

Split by Sentence: Divides the text into chunks of complete sentences, each chunk containing fewer than or 510 tokens.
Split by Word: Divides the text into chunks of individual words, each chunk containing fewer than or 510 tokens.

I recommend using the "Split by Sentence" method to maintain context and ensure higher quality audio output.

Installation

Prerequisites

Before installing the extension, ensure you have the following dependencies installed:

eSpeak: Download from eSpeak NG Releases.
FFmpeg: Download from FFmpeg Downloads.

Python Dependencies

Install the required Python packages using the appropriate script for your operating system.

Windows

Run the Windows setup script:
```
.\cmd_windows.bat
```

Install the Python dependencies:

pip install -r extensions\KokoroTtsTextGenerationWebUI\requirements.txt

Linux

Run the Linux setup script:
```
./cmd_linux.sh
```

Install the Python dependencies:

pip install -r extensions/KokoroTtsTextGenerationWebUI/requirements.txt

Multiple GPU Support

By default, the extension utilizes the first available GPU. To specify a different GPU, modify the device variable in src/generate.py to your desired GPU identifier.

Roadmap

Contributing

I welcome contributions to improve this project! If you'd like to contribute, please create a pull request or open an issue. Your improvements and suggestions are highly appreciated.

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
audio		audio
src		src
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
requirements.txt		requirements.txt
script.py		script.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Kokoro-Based TTS Extension for Oobabooga Text Generation WebUI

License

Features

Text Splitting Methods

Installation

Prerequisites

Python Dependencies

Windows

Linux

Multiple GPU Support

Roadmap

Contributing

About

Contributors 2

Languages

License

h43lb1t0/KokoroTtsTexGernerationWebui

Folders and files

Latest commit

History

Repository files navigation

Kokoro-Based TTS Extension for Oobabooga Text Generation WebUI

License

Features

Text Splitting Methods

Installation

Prerequisites

Python Dependencies

Windows

Linux

Multiple GPU Support

Roadmap

Contributing

About

Topics

Resources

License

Stars

Watchers

Forks

Contributors 2

Languages