ETA-VLA: Efficient Token Adaptation via Temporal Fusion and Intra-LLM Sparsification for Vision-Language-Action Models
This research introduces ETA-VLA, a novel framework that significantly reduces the computational cost of Vision-Language-Action models through dynamic token ...