Sunday, September 7, 2025

Psychological Tricks Can Get AI to Break the Rules

Researchers convinced large language model chatbots to comply with “forbidden” requests using a variety of conversational tactics.

from WIRED https://ift.tt/eFLx02y

No comments:

Post a Comment