WIST: Web-Grounded Iterative Self-Play Tree for Domain-Targeted Reasoning Improvement

Explore WIST, a novel framework leveraging open-web data and self-play mechanisms to significantly enhance domain-targeted reasoning in large language models...

Level: advanced

By Fangyuan Li

Category: research