ES4R: Speech Encoding Based on Prepositive Affective Modeling for Empathetic Response Generation
ES4R introduces a dual-level affective modeling architecture that explicitly captures emotional states prior to speech encoding, addressing coherence degrada...