Anthropic was just used by Chinese state-sponsored hackers to conduct an espionage campaign on over 30 Organizations world wide.
They literally just role played with claude telling it they were performing a legitimate defensive security test for a cyber security firm.
Its not the first time role play has been used to conduct prompt injections on AI models. We can't expect it to be the last.
Login to reply