Unlocking Vision-Language Models for Video Anomaly Detection via Fine-Grained Prompting

Explore ASK-Hint, a structured prompting framework that leverages Vision-Language Models to enhance video anomaly detection through fine-grained, explainable...

Level: advanced

By Unknown

Category: education