real time transcription - gradio audiostreaming feature ready #82

sandorkonya · 2024-09-13T19:35:29Z

@fedirz thank you for this repo, so far the best one i found for live transcription, only one of the few that utilizes localagreement!

The gradient demo on Huggingface features recording audio, but an upcoming gradio merge is going to allow real time transcription directly from browser!
could you add flozi00/distilwhisper-german-canary to the available models?

regards

fedirz · 2024-09-21T01:17:54Z

Thanks, I'll take a look once 5.0 is merged in
This project supports only models which can be run with ctranslate2. flozi00/distilwhisper-german-canary isn't compatible with ctranslate2 as far as I can tell. The author does have a model flozi00/whisper-large-v3-german-ct2 which looks like it's ctranslate2 compatible, however the model doesn't have proper tags and that's why you don't see it in the list of models when running curl -s http://localhost:8000/v1/models | jq '.data[].id'. Consider asking the author of the model to add a ctranslate2 tag to his model.
For reference, here's a piece of code which determines available models.

models = huggingface_hub.list_models(library="ctranslate2", tags="automatic-speech-recognition", cardData=True)

Provide feedback