This research investigates how off-policy training data affects LLM probe generalization, revealing significant performance degradation due to domain shifts ...
Level: advanced
By Unknown
Category: research