On the Generalization Gap in LLM Planning: Tests and Verifier-Reward RL

This research investigates the critical generalization gap in LLM planning, revealing how current models fail to transfer reasoning skills across domains des...

Level: advanced

By Valerio Belcamino, Nicholas Attolino, Alessio Capitanelli, Fulvio Mastrogiovanni

Category: research