-
Notifications
You must be signed in to change notification settings - Fork 3.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
silence removal for transcription implemented #1649
base: master
Are you sure you want to change the base?
Conversation
@artshcherbina Hello, I think, this is good, but I wanted to asked, if this can be implemented also to be in the .srt file and so on? And if yes, what happens, does the timestamps correctly align with the original file? I implemented something similar in Windows via ffmpeg and silence detection.
However, this did not improve anything unfortunately. The sad thing is, that Whisper is trying to transcribe literally everything. Do you any audio samples, I could try to test out with my script, I have a few scripts, which I would like to test out and share with the community, if they are working good or better. |
Hello, @jensdraht1999 . |
I wonder, if you could resolve the rest of the issues and then merge it. |
My changes break some default behavior. |
Thank you for you great work!
I've added some simple logic to detect silence, and process only real voice input.
I start transcription, if more then 1 second of silence is passed (You may need to tune
--pressure-t
to your microphone).You may tune silence duration with
--silence_t
argument.Transcribed text is copied to the clipboard (currenly only on linux, xclip is used).
The code can be further improved, if it seems reasonable.