COMAL: A Convergent Meta-Algorithm for Aligning LLMs with General Preferences

Explore COMAL, a novel meta-algorithm that leverages game theory to align Large Language Models with diverse human preferences through Nash equilibrium conve...

Level: advanced

By Unknown

Category: discussion