• JohnEdwa
    link
    fedilink
    English
    arrow-up
    2
    ·
    edit-2
    5 hours ago

    Yeah, out of all the generative AI fields, voice generation at this point is like 95% there in its capability of producing convincing speech even with consumer level tech like ElevenLabs. That last 5% might not even be solvable currently, as it’s those moments it gets the feeling, intonation or pronunciation wrong when the only context you give it is a text input, which is why everything purely automated tends to fall apart quite fast.

    Especially voice cloning - the DRG Cortana Mission Control mod is one of the examples I like to use.