DiTSinger: Scaling Singing Voice Synthesis with Diffusion Transformer and Implicit Alignment

DiTSinger introduces a novel Diffusion Transformer architecture for scalable singing voice synthesis, leveraging implicit alignment and LLM-generated lyrics ...

Level: advanced

By Unknown

Category: research