Can someone share experience in using whisper_stt in Oobabooga?

Shiimiish@lm.ainyataovi.net · edit-2 11 months ago

Can someone share experience in using whisper_stt in Oobabooga?

Blaed@lemmy.world · edit-2 11 months ago

You could try reducing your memory overhead by going down to 3B parameters. If you want to avoid that - maybe experiment with different models between both GPTQ & GGML formats?

If you’re willing to spend a few dollars an hour, you could drastically increase overall memory and power and see if you can get it running on a rented GPU through something like vast.ai or runpod.ai. Might be worth exploring for any test of yours that might need extra oomph.

Given time, I think many of these models will become easier to run as new optimization and runtime methods begin to emerge.