This research introduces PRM-guided GFlowNets to enhance mathematical reasoning in LLMs by combining Monte Carlo Tree Search with similarity-based data augme...
Level: advanced
By Unknown
Category: research