Sommelier: Scalable Open Multi-turn Audio Pre-processing for Full-duplex Speech Language Models
This research introduces Sommelier, an open-source pipeline designed to solve critical data scarcity issues in full-duplex speech models by effectively handl...