• NutWrench@lemmy.ml
    link
    fedilink
    arrow-up
    42
    arrow-down
    2
    ·
    1 day ago

    Ever wonder why these captchas are always cars, bicycles, motorcycles, traffic lights and crosswalks? Because YOU are doing the work of teaching the next generation of AI for self-driving cars.

    • yetAnotherUser@discuss.tchncs.de
      link
      fedilink
      arrow-up
      15
      ·
      23 hours ago

      I don’t believe it, at least not anymore.

      Google has had more than enough data to train AI models from reCAPTCHA for many years. In 2010 it displayed 100 million captchas per day. You simply do not need hundreds of billions of solved captchas in your data set.

      I feel like its only purpose nowadays is stopping basic bots and annoying people who don’t let themselves be tracked as much as advertisers would like.

      • SSJMarx@lemm.ee
        link
        fedilink
        English
        arrow-up
        5
        ·
        22 hours ago

        Yeah the most recent version of CAPTCHA is completely seamless for the end user because there is no more value to be had gathering this kind of data. Instead it runs in the background of the web site, looking at your mouse movements/clicks/keystrokes, and determining whether or not you’re a bot based on that information.

        The problem is a lot of websites still use the old version, or their own hacked together CAPTCHA alternative, which decent bots have been able to beat for a while now.

    • Squorlple@lemmy.world
      link
      fedilink
      English
      arrow-up
      17
      ·
      24 hours ago

      Can’t wait until we get trolley problem CAPTCHAs and we have to choose the square with the most expendable human lives

    • TheOakTree@lemm.ee
      link
      fedilink
      arrow-up
      9
      ·
      1 day ago

      My favorite is when it asks me to identify stairs. I just imagine a self-driving car mistaking a set of stairs as more road and deciding to try and climb the steps.

      • Daemon Silverstein@thelemmy.club
        link
        fedilink
        arrow-up
        5
        ·
        24 hours ago

        Actually, it’s training a self-driving humanoid robot that’s supposed to climb stairs in order to terminate any potential John Connor that’s inside a house upstairs.

      • gamermanh@lemmy.dbzer0.com
        link
        fedilink
        English
        arrow-up
        5
        ·
        23 hours ago

        You and many other humans are doing verification work

        It’s pretty sure it’s already right, but if enough people get the same image and get it wrong the same way then something’s up, flag it

          • gamermanh@lemmy.dbzer0.com
            link
            fedilink
            English
            arrow-up
            1
            ·
            23 hours ago

            I took some compsci classes years ago when this tech was new and that’s exactly how it was described as being handled

            Once image recognition software got good enough to be right most of the time they started this shit to help get it the rest of the way to all of the time

            Do it any other way and you have to pay those people

      • chatokun@lemmy.dbzer0.com
        link
        fedilink
        arrow-up
        1
        ·
        23 hours ago

        Theres a CGPGrey video that describes old techniques. It’s not quite up to date on some of its predictions, but it is how some machine learning works. Of course, it doesn’t discuss current proprietary techniques, because those are company secrets. Still, it’s as good a guess we’ll likely get, unless something radically different has been invented:

        https://youtu.be/R9OHn5ZF4Uo

        There is also a second video about more modern stuff, but it’s more a footnote:
        https://youtu.be/wvWpdrfoEv0