Benchmark Profiling: Mechanistic Diagnosis of LLM Benchmarks

This research introduces Benchmark Profiling, a framework that dissects LLM performance into ten distinct cognitive abilities using gradient-based scoring an...

Level: advanced

By Unknown

Category: research