Counterfactual Reasoning for Steerable Pluralistic Value Alignment of Large Language Models

This research introduces COUPLE, a framework leveraging structural causal models and counterfactual reasoning to achieve steerable, pluralistic value alignme...

Level: expert

By Hanze Guo, Jing Yao, Xiao Zhou, Xiaoyuan Yi, Xing Xie

Category: discussion