MatSciBench: Benchmarking the Reasoning Ability of Large Language Models in Materials Science

MatSciBench introduces a comprehensive benchmark evaluating the reasoning capabilities of Large Language Models within the complex domain of materials scienc...

Level: advanced

By Unknown

Category: research