An LLM-as-Judge Metric for Bridging the Gap with Human Evaluation in SE Tasks

This research introduces SE-Jury, an automated metric leveraging ensemble judgments to bridge the gap between AI and human evaluation in software engineering...

Level: advanced

By Unknown

Category: research