PermLLM introduces learnable channel permutation to optimize N:M sparse large language models, reducing computational complexity while preserving critical we...
Level: advanced
By Unknown
Category: research