Calibrate-Then-Delegate: Safety Monitoring with Risk and Budget Guarantees via Model Cascades

This research introduces Calibrate-Then-Delegate, a novel model cascade that uses Delegation Value probes to optimize LLM safety monitoring. It replaces unce...

Level: advanced

By Edoardo Pona

Category: discussion