Repository navigation

#

vision-language-transformer

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python
8720
1 年前

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Jupyter Notebook
5440
1 年前
Python
690
2 年前

[CVPR 2024] Aligning and Prompting Everything All at Once for Universal Visual Perception

Python
583
1 年前

[ICCV2021 & TPAMI2023] Vision-Language Transformer and Query Generation for Referring Segmentation

Python
358
4 年前
Jupyter Notebook
33
4 年前

[NeurIPS 2023] Bootstrapping Vision-Language Learning with Decoupled Language Pre-training

Python
25
2 年前

A collection of VLMs papers, blogs, and projects, with a focus on VLMs in Autonomous Driving and related reasoning techniques.

10
9 个月前

Mini-batch selective sampling for knowledge adaption of VLMs for mammography.

Jupyter Notebook
1
10 个月前