MatSciBench: Benchmarking the Reasoning Ability of Large Language Models in Materials Science
MatSciBench introduces a comprehensive benchmark evaluating the reasoning capabilities of Large Language Models within the complex domain of materials scienc...