Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching
-
Updated
Mar 26, 2025 - Python
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching
This is an on-CPU real-time conversational system for two-way speech communication with AI models, utilizing a continuous streaming architecture for fluid conversations with immediate responses and natural interruption handling.
A local and uncensored AI entity.
Convert your PDFs and EPUBs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and efficient processing for low-resource systems.
Yet another self-hosted AI voice assistant. GlaDOS' blazing fast pipeline with a more realistic Kokoro TTS voice and vision.
An extension to use Kokoro TTS in text generation webui
Kokoro Manim Voiceover
Sky LiveKit Agent Perplexica is a local, free solution integrating LiveKit with advanced internet search. It uses a local Perplexica instance with function calling to retrieve and summarise search results in natural language. Powered by Faster Whisper, Ollama (Qwen 2.5) and Kokoro-82M TTS.
一个基于 [Bob](https://bobtranslate.com/) 的文本转语音插件,使用 Kokoro 本地部署模型作为语音合成服务。
Listen to DeepSeek's thinking process in real-time! This script converts DeepSeek's thinking tags (<think>...</think>) to speech using Kokoro TTS, allowing you to hear the model's "thoughts" as it reasons through your questions.
Leverage Kokoro's TTS capabilities using LitServe.
Mirror - A simple script to convert an analog book (image) into an audiobook (via Ai)
A speech-to-speech chat app using Gemini and Kokoro
Add a description, image, and links to the kokoro-tts topic page so that developers can more easily learn about it.
To associate your repository with the kokoro-tts topic, visit your repo's landing page and select "manage topics."