Whisper OSC

This is using the awesome whisper.cpp project to transcribe your microphone and send it as a message to VRChat. It is a quick adaptation of the stream example from whisper.cpp and uses tinyosc to build the OSC messages at the moment.

Still work in progress, but already kinda works on Linux.

Goals

Easy to use and setup on Windows
Easy to compile using zig as a build system

How to build and run

Download the appropriate models from whisper.cpp and copy them to your $PWD/models.
Install Zig and build it using zig build -Drelease-fast=true (Tested with zig 0.9.1)
Run it, e.g. using zig build run -Drelease-fast=true -- -m ./models/ggml-tiny.en.bin -t 10 --step 1100 --length 5000.

TODO

Get cross compiling to windows to work
More post processing of chat output, since VRChat throttles chatbox messages
- Filter out reptition, non-voice tokens and simply throttle the output somehow
- Transcriptions can also dissapear too quickly.

Port it to zig and clean it up
Consider moving from SDL to miniaudio or something else
GUI?

Prior Art

Whispering Tiger looks really cool, but I haven't tried it yet. I hope this project can be a bit more lightweight though, but probably won't be as accurate and fast since it's CPU only and has some other limitations.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
cpp		cpp
deps		deps
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
build.zig		build.zig

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Whisper OSC

Goals

How to build and run

TODO

Prior Art

About

Releases

Packages

Languages

License

Okabintaro/whisper_osc

Folders and files

Latest commit

History

Repository files navigation

Whisper OSC

Goals

How to build and run

TODO

Prior Art

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages