Riddle-like poems tricked chatbots into spewing hate speech and helping design nuclear weapons and nerve agents. It turns out ...
When prompts were presented in poetic rather than prose form, attack success rates increased from 8% to 43%, on average — a ...
Morning Overview on MSN
Poems can trick AI into aiding nuclear weapon guides
Poetic prompts that look harmless to a casual reader are now being used to coax large language models into describing the ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results