GTAlign introduces a game-theoretic framework to align LLMs with user preferences, resolving the prisoner's dilemma through mutual welfare rewards and payoff...
Level: advanced
By Unknown
Category: research