2025-12-05
The Verge
9 related
Researchers claim that prompts framed as riddle-like poems could skirt AI chatbots' safety features designed to block production of explicit or harmful content
Riddle-like poems tricked chatbots into spewing hate speech and helping design nuclear weapons and nerve agents.
Loading articles...