Learn about Vision-language models, Vision-language encoders and various computer vision applications:
The workshop slides to follow.
- Follow the instructions to setup your environment for the workshop.
- Vision-language encoders (CLIP, CLIPSeg, Florence) Instructions
- Set up VLLM's as Python interface. Instructions