Repository navigation

#

vision-and-language-pre-training

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Jupyter Notebook
5196
8 个月前

Code Implementation of "Simple Image-level Classification Improves Open-vocabulary Object Detection" (AAAI'24)

Python
24
1 年前

Companion Repo for the Vision Language Modelling YouTube series - https://bit.ly/3PsbsC2 - by Prithivi Da. Open to PRs and collaborations

Jupyter Notebook
14
3 年前

Vision-Language Pre-Training for Boosting Scene Text Detectors (CVPR2022)

12
3 年前

The official implementation for the ICCV 2023 paper "Grounded Image Text Matching with Mismatched Relation Reasoning".

Python
6
1 年前