Addressing Pitfalls in the Evaluation of Uncertainty Estimation Methods for Natural Language Generation
This research critiques current uncertainty evaluation methods for LLMs, proposing structured tasks and out-of-distribution detection to mitigate hallucinati...