Repository navigation

#

open-vocabulary-segmentation

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook
16801
1 年前

A collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-edge models like YOLO11, RT-DETR, SAM 2, Florence-2, PaliGemma 2, and Qwen2.5VL.

Jupyter Notebook
8220
5 天前

Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API 🔥

Python
1684
7 个月前

Official PyTorch implementation of ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [CVPR 2023 Highlight]

Python
919
1 年前

Combining Segment Anything (SAM) with Grounded DINO for zero-shot object detection and CLIPSeg for zero-shot segmentation

Jupyter Notebook
422
1 年前

Official implementation of OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion

Python
362
5 个月前

[NeurIPS 2023] Weakly Supervised 3D Open-vocabulary Segmentation

Python
120
2 年前

[CVPR 2025] Official repository of the paper "Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation"

Python
106
1 个月前

[ICCV2025] Official repository of the paper "Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary Segmentation"

Python
65
12 天前

[ICLR 2025] Official code of "Segment any 3D Object with Language"

Python
50
2 个月前

Official repository of the paper "High-Quality Mask Tuning Matters for Open-Vocabulary Segmentation"

Python
40
5 个月前

(ECCV 2024) Open-Vocabulary Camouflaged Object Segmentation

Python
27
10 个月前

FLOSS: Plug-in Training-free and label-free text template selection that boosts OVSS methods (ICCV 2025)

Python
25
1 个月前

[AAAI 2025] Open-vocabulary Video Instance Segmentation Codebase built upon Detectron2, which is really easy to use.

Python
24
8 个月前

[IEEE TCSVT24] OV-NeRF: Open-vocabulary Neural Radiance Fields with Vision and Language Foundation Models for 3D Semantic Understanding

Python
13
2 个月前

(SegmentedOWLv2) is a powerful command-line tool for text-prompted object segmentation for video and images.

Jupyter Notebook
6
23 天前