Any speech to text library that uses whisper api?

k110111@feddit.de · 8 months ago

Any speech to text library that uses whisper api?

Sims@lemmy.ml · 8 months ago

Dunno, but this guy (all about ai) builds one with ‘faster-whisper’, so perhaps you can get a few pointers there? I believe he chunks the Audio on silence. He have a few other speech2x videos. Have fun. https://youtu.be/k6nIxWGdrS4

Also: https://github.com/SYSTRAN/faster-whisper

Sims@lemmy.ml · 8 months ago

Just stumbled upon this speedy one: https://github.com/sanchit-gandhi/whisper-jax

And this one for word precision time marks: https://github.com/m-bain/whisperX

PipedLinkBot@feddit.rocks · 8 months ago

Here is an alternative Piped link(s):

https://piped.video/k6nIxWGdrS4

Piped is a privacy-respecting open-source alternative frontend to YouTube.

I’m open-source; check me out at GitHub.