
AI Tool Poisoning: Hackers Exfiltrate Data From Assistants in New Supply Chain Attack
Hackers may be able to trick AI assistants into sending private data by hiding instructions in app descriptions, a method called AI tool poisoning. This attack appears to work across different assistants like ChatGPT and Claude, because they trust and follow hidden commands in tool metadata. Security experts found that these attacks succeed over 60 percent of the time, and may have already caused many costly data breaches. Defending against this is hard because the attack hides in normal workflows, and retraining models does not always fix it. Experts suggest using signed tools, sandboxing, and human approval, but a full solution may not be available yet.













