Guardian-as-an-Advisor: Advancing Next-Generation Guardian Models for Trustworthy LLMs

This research introduces Guardian-as-an-Advisor, a soft-gating pipeline designed to mitigate over-refusal in LLM safety checkers while maintaining compliance...

Level: advanced

By Yue Huang

Category: discussion