Multimodal Prompt Optimization: Why Not Leverage Multiple Modalities for MLLMs

Explore the Multimodal Prompt Optimizer (MPO), a novel approach that overcomes text-only limitations by leveraging alignment-preserving updates and Bayesian ...

Level: advanced

By Unknown

Category: research