January 2026
Why
If you want an AI agent to follow some policy, you can currently do no better than to tell it your policy and hope for the best. The problem with this is that the agent behaves stochastically and is prone to prompt injection attacks, so its stated promise to follow your rules is little more than empty words. The agent lacks integrity.
Read more