You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
$ export HOST="http://localhost:3000"
$ curl "$HOST/v1/audio/speech" \
--verbose \
-H "Content-Type: application/json" \
-d '{ "model": "tts-1", "input": "Today is a wonderful day to build something people love!", "voice": "am_adam" }' \
--output speech.mp3
VLC is unable to play it and ffmpeg also crashes when trying to convert it (while it worked with OpenAI's mp3). The vscode audio-preview extension can play it without problems.
The text was updated successfully, but these errors were encountered:
Hello, this format should be able to modify when user request, wav is the default format used in save, would u like add a PR to support mp3 return specific in openai returning?
curl -X POST http://localhost:3000/v1/audio/speech \
-H "Content-Type: application/json" \
-d '{ "model": "anything can go here", "input": "Hello, this is a test of the Kokoro TTS system!", "voice": "af_sky", "response_format": "mp3" }' --output sky-says-hello.mp3
I'm normally converting the mp3 to m4a file so that it is ready to paste or share on chat systems and then people can easily listen to the voice message.
From the OpenAI docs, OpenAI returns an mp3:
But if I look at the file that I get from the Kokoros OpenAI compatible endpoint, it looks like it's a wav file?
This is the request I made
VLC is unable to play it and ffmpeg also crashes when trying to convert it (while it worked with OpenAI's mp3). The vscode audio-preview extension can play it without problems.
The text was updated successfully, but these errors were encountered: