Status Card Hours Zero Image

PerceptionCLIP: Visual Classification by Inferring and Conditioning on Contexts

Vision-language models like CLIP are widely used in zero-shot image classification due to their ability to understand various visual concepts and natural language descriptions. However, how to fully ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Feedback

Trending now