Explore InfoMosaic-Bench, a new benchmark evaluating how tool-augmented agents handle multi-source information seeking and the critical role of domain-specif...
Level: advanced
By Unknown
Category: research