Wav2vec2-compatible Speech recognition software

vi21@lemmy.ml · 2 years ago

Wav2vec2-compatible Speech recognition software

vi21@lemmy.ml · 2 years ago

[Longer version]

Thanks to Common Voice contributors, Mozilla and @wannaphong@lemmy.ml , now we have a Wav2vec2 model for recognizing Thai speech available by training a wav2vec2 model on the Common Voice dataset. Now, I can use the model to convert my speech to text on the Huggingface website. It works accurately. I love it.

However, using speech-to-text on the Huggingface website seems to be for testing. I want to use it instead of typing on LibreOffice or Firefox. I did some explorations, but I didn’t find anything that I could use.

Is there any speech recognition software on GNU/Linux which will work with a wav2vec2 model?

thervingi@lemmy.ml · 2 years ago

There should be lots of pytorch based speec recognition models now (works on python).

veer66@mstdn.io · edit-2 2 years ago

deleted by creator