We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
If audio input is full silent, it will output/ return things like "MBC 뉴스 이재경입니다.".
Seems VAD has been applied in SYSTRAN/faster-whisper since for sample with long silence but verbal at the end it could return correct text.
The text was updated successfully, but these errors were encountered:
This is inherent to all whisper based models.
Also related to #108 as cleaningup audio will return a very short result if it's silence so it wouls be ignored as under the duration threshold
Sorry, something went wrong.
No branches or pull requests
If audio input is full silent, it will output/ return things like "MBC 뉴스 이재경입니다.".
Seems VAD has been applied in SYSTRAN/faster-whisper since for sample with long silence but verbal at the end it could return correct text.
The text was updated successfully, but these errors were encountered: