Web application that converts audio and video to text using AI, supporting various formats and self-hosting.
-
Updated
Apr 7, 2025 - Python
Web application that converts audio and video to text using AI, supporting various formats and self-hosting.
Takes audio (mp3) and text input (string) and force aligns the text to the audio. Uses stable-ts and whisperx.
Add a description, image, and links to the stable-ts topic page so that developers can more easily learn about it.
To associate your repository with the stable-ts topic, visit your repo's landing page and select "manage topics."