ooli2@lemm.eeM to chatGPT Jailbreak@lemm.eeEnglish · 15 hours ago

'Indiana Jones' jailbreak approach highlights the vulnerabilities of existing LLMs

4

cross-posted to:
hackernews@lemmy.bestiver.se

21

'Indiana Jones' jailbreak approach highlights the vulnerabilities of existing LLMs

ooli2@lemm.eeM to chatGPT Jailbreak@lemm.eeEnglish · 15 hours ago

4

cross-posted to:
hackernews@lemmy.bestiver.se

Large language models (LLMs), such as the model underpinning the functioning of the conversational agent ChatGPT, are becoming increasingly widespread worldwide. As many people are now turning to LLM-based platforms to source information and write context-specific texts, understanding their limitations and vulnerabilities is becoming increasingly vital.

Chat

ooli2@lemm.eeOPM
link
fedilink
English
arrow-up
2·
4 hours ago
May be because, like Indy replace the mayan artefact by a stone to avoid the trap, this jailbreak replace the request about crime, with similarly loaded one about crime history