Replies: 1 comment 1 reply
-
Would you happen to have any video with the expected output to test? |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
When I first saw this one I got a little bit inspired for the amount of support for vast diversity of media https://github.com/rmusser01/tldw
But not just from the document reading or web crawling side, but multimedia handling side of things as well #138
Podcasts, live streams, presentations, video essays, theater transcriptions, documentaries... Most of these use something like OpenAI's Whisper model for voice, maybe video VLMs for visuals, but even then there are options
Major hurdles:
Examples:
Beta Was this translation helpful? Give feedback.
All reactions