InfoMosaic-Bench: Evaluating Multi-Source Information Seeking in Tool-Augmented Agents

Explore InfoMosaic-Bench, a new benchmark evaluating how tool-augmented agents handle multi-source information seeking and the critical role of domain-specif...

Level: advanced

By Unknown

Category: research