Tech behemoth OpenAI has touted its artificial intelligence-powered transcription tool Whisper as having near “human level robustness and accuracy.”

But Whisper has a major flaw: It is prone to making up chunks of text or even entire sentences, according to interviews with more than a dozen software engineers, developers and academic researchers. Those experts said some of the invented text — known in the industry as hallucinations — can include racial commentary, violent rhetoric and even imagined medical treatments.

Experts said that such fabrications are problematic because Whisper is being used in a slew of industries worldwide to translate and transcribe interviews, generate text in popular consumer technologies and create subtitles for videos.

More concerning, they said, is a rush by medical centers to utilize Whisper-based tools to transcribe patients’ consultations with doctors, despite OpenAI’ s warnings that the tool should not be used in “high-risk domains.”

  • QuadratureSurfer@lemmy.world
    link
    fedilink
    English
    arrow-up
    40
    ·
    11 hours ago

    As someone who uses Whisper fairly often, it’s obvious that they’ve trained off of a bunch of YouTube videos.

    Most of the time it’s very accurate, but there have definitely been a few times in long transcription sessions where it will randomly hallucinate that someone is saying “Don’t forget to like and subscribe!” When nothing was said anywhere near that.

    • rustydrd@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      20
      ·
      edit-2
      8 hours ago

      That’s hilarious. I just love how AI is basically like a 6-year-old who weaves his favorite new expressions into everything without fully understanding what they mean.