TPP-SD: Accelerating Transformer Point Process Sampling with Speculative Decoding

This research introduces TPP-SD, a speculative decoding framework for Transformer-based temporal point processes that achieves 2–6× speedup while maintaining...

Level: advanced

By Shukai Gong and 4 other authors

Category: research