Explore gSMILE, a gradient-free framework that leverages Wasserstein distance to achieve token-level interpretability in large language models, revealing cri...
Level: advanced
By Zeinab Dehghani, Mohammed Naveed Akram, Koorosh Aslansefat, Adil Khan, Yiannis Papadopoulos
Category: research