Large language models (LLMs), such as the model underpinning the functioning of the conversational agent ChatGPT, are becoming increasingly widespread worldwide. As many people are now turning to LLM-based platforms to source information and write context-specific texts, understanding their limitations and vulnerabilities is becoming increasingly vital.
May be because, like Indy replace the mayan artefact by a stone to avoid the trap, this jailbreak replace the request about crime, with similarly loaded one about crime history