LLM Reasoning with Process Rewards for Outcome-Guided Steps

This research introduces PROGRS, a novel framework leveraging Process Reward Models to overcome sparse feedback limitations in LLM mathematical reasoning. By...

Level: advanced

By Mohammad Rezaei

Category: research