When Adaptive Rewards Hurt: Causal Probing and the Switching-Stability Dilemma in LLM-Guided LEO Satellite Scheduling

This research challenges the assumption that adaptive rewards improve LLM-guided satellite scheduling, revealing a critical switching-stability dilemma where...

Level: advanced

By Yuanhang Li

Category: research