Policy-Invisible Violations in LLM-Based Agents

Explore the critical challenge of policy-invisible violations in LLM agents and the Sentinel framework's use of counterfactual graph simulation to ensure rob...

Level: advanced

By Jie Wu, Ming Gong

Category: discussion