Adversarial Poetry as a Universal Single-Turn Jailbreak Mechanism in Large Language Models
#llm #security

arXiv.org
Adversarial Poetry as a Universal Single-Turn Jailbreak Mechanism in Large Language Models
We present evidence that adversarial poetry functions as a universal single-turn jailbreak technique for Large Language Models (LLMs). Across 25 fr...