I read that an attack vector against AI agents is malicious instructions in the content the agent consumes. How come there isn’t an AI equivalent of a virus scan that can detect issues in the content? Or a read but don’t execute prompt/skill? It seems existing security defenses should apply. What about AI stops them? submitted by /u/slartybartvart
Originally posted by u/slartybartvart on r/ArtificialInteligence
You must log in or # to comment.
