ES4R: Speech Encoding Based on Prepositive Affective Modeling for Empathetic Response Generation

ES4R introduces a dual-level affective modeling architecture that explicitly captures emotional states prior to speech encoding, addressing coherence degrada...

Level: advanced

By Zhuoyue Gao and 6 other authors

Category: research