GTAlign: Game-Theoretic Alignment of LLM Assistants for Mutual Welfare

GTAlign introduces a game-theoretic framework to align LLMs with user preferences, resolving the prisoner's dilemma through mutual welfare rewards and payoff...

Level: advanced

By Unknown

Category: research