Feature Request: Integration w/ Conversation Framework Pipecat #810
Rodaviva29
started this conversation in
Ideas
Replies: 1 comment 1 reply
-
https://github.com/SWivid/F5-TTS/tree/main/src/f5_tts/infer#socket-realtime-client from #803 There may be some first packet delay as it's a NAR TTS, but there will be no delay if the LLM part prefill in advance some text content when the user is about to finish speaking (in that case, may even have larger chunk size for better consistency). |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Integrate F5-TTS to a well-known framework for voice and multimodal conversational AI, pipecat: https://github.com/pipecat-ai/pipecat.
Currently it supports a lot of Text-to-Speech services (https://docs.pipecat.ai/server/services/supported-services), but F5-TTS has gain a lot of popularity and would be amazing to have it integrated with another framework.
Integration in services, maybe similar to XTTS as a streaming alike? https://github.com/pipecat-ai/pipecat/blob/main/src/pipecat/services/xtts.py.
I would to get this into reality but I'm not very into this type of programming, so it would be amazing if anyone could contribute ❤️
Beta Was this translation helpful? Give feedback.
All reactions