Riddle-like poems tricked chatbots into spewing hate speech and helping design nuclear weapons and nerve agents. It turns out ...
When prompts were presented in poetic rather than prose form, attack success rates increased from 8% to 43%, on average — a ...
Poetic prompts that look harmless to a casual reader are now being used to coax large language models into describing the ...