← Back to Model Beat
10Research·Mar 11

Designing AI agents to resist prompt injection

How ChatGPT defends against prompt injection and social engineering by constraining risky actions and protecting sensitive data in agent workflows.

Covered by 1 source

Related stories

ResearchUlysses Sequence Parallelism: Training with Million-Token ContextsMar 9ResearchSakana AI Tapped for Defense Research - StartupHub.aiMar 13ResearchAn AI Agent Blackmailed a Developer. Now What?Mar 10