This study evaluates AI models' abstract reasoning capabilities across modalities using the ConceptARC benchmark, revealing critical gaps in current accuracy...
Level: advanced
By Unknown
Category: research