Diffusion-Link: Diffusion Probabilistic Model for Bridging the Audio-Text Modality Gap

Diffusion-Link introduces a lightweight diffusion module to bridge audio-text modality gaps, leveraging residual MLPs to map embeddings and enhance encoder-L...

Level: advanced

By Unknown

Category: research