When Adaptive Rewards Hurt: Causal Probing and the Switching-Stability Dilemma in LLM-Guided LEO Satellite Scheduling
This research challenges the assumption that adaptive rewards improve LLM-guided satellite scheduling, revealing a critical switching-stability dilemma where...