microCLIP: Unsupervised CLIP Adaptation via Coarse-Fine Token Fusion for Fine-Grained Image Classification
microCLIP introduces a novel unsupervised adaptation method for CLIP models, leveraging coarse-fine token fusion and saliency-oriented attention to significa...