Expert-Choice Routing Enables Adaptive Computation in Diffusion Language Models

This research introduces Expert-Choice routing as a superior alternative to token-choice methods in diffusion language models, enabling adaptive computation ...

Level: advanced

By Shuibai Zhang

Category: research