I double checked the results and edited some of the messages in the system prompt that didn't seem accurate and then reran the benchmark myself. Still the same results. Claude Opus 4 will contact authorities if it thinks you're doing anything illegal. The latest security threat is LLMs themselves.
nostr:nevent1qvzqqqqqqypzpcpnjdyv5m9vjuyvmx8xx830fw4d2dxle6rs3qdkt2jh6v8lwff7qqsd0hmk7gs9e70atpc898cmze697s9qzdxxczvr3cgsmzqr6qe9wjcenuaxv
nostr:nevent1qvzqqqqqqypzpcpnjdyv5m9vjuyvmx8xx830fw4d2dxle6rs3qdkt2jh6v8lwff7qqsd0hmk7gs9e70atpc898cmze697s9qzdxxczvr3cgsmzqr6qe9wjcenuaxv