Transformer Redesign for Late Fusion of Audio-Text Features on Ultra-Low-Power Edge Hardware
This research introduces a late-fusion transformer architecture optimized for ultra-low-power edge hardware, achieving real-time multimodal emotion recogniti...