Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

集成 UVR 以通过去除背景噪音来提高 Whisper 的准确性 / Integrate UVR for Enhanced Whisper Accuracy by Removing Background Noise #301

Open
AlexMa2011 opened this issue Aug 12, 2024 · 0 comments
Labels
enhancement New feature or request

Comments

@AlexMa2011
Copy link

AlexMa2011 commented Aug 12, 2024

Is your feature request related to a problem? Please describe.

To improve the accuracy of Whisper in noisy audio environments, it’s crucial to provide it with cleaner audio inputs. The current voice detection models (VAD) are not sufficient for separating vocals effectively in complex audio scenarios.

为了提高在嘈杂音频环境中 Whisper 的准确性,提供更清晰的音频输入至关重要。当前的人声检测模型(VAD)在复杂音频场景中分离人声的效果不够理想。

Describe the solution you’d like

I suggest integrating Ultimate Vocal Remover (UVR), which effectively isolates vocals from mixed audios. This can reduce the interference from background music and ambient noise on Whisper’s voice recognition.

我建议集成 Ultimate Vocal Remover(UVR),该技术能有效从混合音频中隔离人声。这样可以减少背景音乐和环境噪声对 Whisper 语音识别的干扰。

Describe alternatives you’ve considered

While manual preprocessing using separate tools can achieve similar results, it increases the workflow complexity and user effort. An integrated solution within MemoAI would greatly enhance user experience and efficiency.

虽然使用单独工具进行手动预处理可以达到类似的效果,但它增加了工作流程的复杂性和用户的劳动量。在 MemoAI 中集成此解决方案将大大提高用户体验和效率。

Additional context

URLs : UVR
Python package

@AlexMa2011 AlexMa2011 added the enhancement New feature or request label Aug 12, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant