Wav2vec2 model for recognizing Thai speech is available. However, I don’t know how to use it on GNU/Linux. Is there any speech recognition software on GNU/Linux which will work with a wav2vec2 model?
[Longer version]
Thanks to Common Voice contributors, Mozilla and @wannaphong@lemmy.ml , now we have a Wav2vec2 model for recognizing Thai speech available by training a wav2vec2 model on the Common Voice dataset. Now, I can use the model to convert my speech to text on the Huggingface website. It works accurately. I love it.
However, using speech-to-text on the Huggingface website seems to be for testing. I want to use it instead of typing on LibreOffice or Firefox. I did some explorations, but I didn’t find anything that I could use.
Is there any speech recognition software on GNU/Linux which will work with a wav2vec2 model?
There should be lots of pytorch based speec recognition models now (works on python).
deleted by creator