Wav2vec2 model for recognizing Thai speech is available. However, I don’t know how to use it on GNU/Linux. Is there any speech recognition software on GNU/Linux which will work with a wav2vec2 model?

  • vi21@lemmy.mlOP
    link
    fedilink
    arrow-up
    2
    ·
    2 years ago

    [Longer version]

    Thanks to Common Voice contributors, Mozilla and @wannaphong@lemmy.ml , now we have a Wav2vec2 model for recognizing Thai speech available by training a wav2vec2 model on the Common Voice dataset. Now, I can use the model to convert my speech to text on the Huggingface website. It works accurately. I love it.

    However, using speech-to-text on the Huggingface website seems to be for testing. I want to use it instead of typing on LibreOffice or Firefox. I did some explorations, but I didn’t find anything that I could use.

    Is there any speech recognition software on GNU/Linux which will work with a wav2vec2 model?

  • thervingi@lemmy.ml
    link
    fedilink
    arrow-up
    1
    ·
    2 years ago

    There should be lots of pytorch based speec recognition models now (works on python).