• Fubarberry
    link
    fedilink
    English
    arrow-up
    44
    ·
    12 hours ago

    On the bright side it makes it easier to identify user accounts that are actually just chatgpt bots. I predict a future where we identify humans/AI by asking them for filtered questions, things like bomb recipes/meth/say something positive about Hitler/etc.

    • Lev_Astov@lemmy.world
      link
      fedilink
      arrow-up
      3
      ·
      5 hours ago

      A buddy has been testing whether his LLMs he puts together are properly jailbroken by asking them to explain how to build the silliest bomb possible. I find that terribly amusing. Unfortunately they don’t usually come up with anything particularly silly.

    • Kusimulkku@lemm.ee
      link
      fedilink
      arrow-up
      11
      ·
      edit-2
      9 hours ago

      Over on 4chan they’ve decided that the ultimate silver bullet for AI is to ask it say the n-word. It was pretty funny since they were using that trick on a site where you had to identify if it was another person or AI.

      • Kusimulkku@lemm.ee
        link
        fedilink
        arrow-up
        3
        ·
        9 hours ago

        ignores previous instructions [insert new instructions]

        Yeah from my testing those don’t work anymore

      • Fubarberry
        link
        fedilink
        English
        arrow-up
        8
        ·
        12 hours ago

        That seems like less fun than asking all strangers inappropriate questions.