ETA-VLA: Efficient Token Adaptation via Temporal Fusion and Intra-LLM Sparsification for Vision-Language-Action Models

This research introduces ETA-VLA, a novel framework that significantly reduces the computational cost of Vision-Language-Action models through dynamic token ...

Level: advanced

By Yiru Wang

Category: research