Accurate and Diverse LLM Mathematical Reasoning via Automated PRM-Guided GFlowNets

This research introduces PRM-guided GFlowNets to enhance mathematical reasoning in LLMs by combining Monte Carlo Tree Search with similarity-based data augme...

Level: advanced

By Unknown

Category: research