Repository navigation

#

open-vocabulary-segmentation

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook
16970
1 年前

A collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-edge models like YOLO11, RT-DETR, SAM 2, Florence-2, PaliGemma 2, and Qwen2.5VL.

Jupyter Notebook
8533
1 天前

Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API 🔥

Python
1682
9 个月前

Official PyTorch implementation of ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [CVPR 2023 Highlight]

Python
928
1 年前

Combining Segment Anything (SAM) with Grounded DINO for zero-shot object detection and CLIPSeg for zero-shot segmentation

Jupyter Notebook
425
1 年前

Official implementation of OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion

Python
369
7 个月前

[NeurIPS 2023] Weakly Supervised 3D Open-vocabulary Segmentation

Python
121
2 年前

[CVPR 2025] Official repository of the paper "Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation"

Python
109
3 个月前

[ICCV2025] Official repository of the paper "Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary Segmentation"

Jupyter Notebook
83
1 天前

[ICLR 2025] Official code of "Segment any 3D Object with Language"

Python
51
3 个月前

(ECCV 2024) Open-Vocabulary Camouflaged Object Segmentation

Python
48
1 个月前

Official repository of the paper "High-Quality Mask Tuning Matters for Open-Vocabulary Segmentation"

Python
40
6 个月前

FLOSS: Plug-in Training-free and label-free text template selection that boosts OVSS methods (ICCV 2025)

Python
25
3 个月前

[AAAI 2025] Open-vocabulary Video Instance Segmentation Codebase built upon Detectron2, which is really easy to use.

Python
24
9 个月前

[IEEE TCSVT24] OV-NeRF: Open-vocabulary Neural Radiance Fields with Vision and Language Foundation Models for 3D Semantic Understanding

Python
13
3 个月前

(SegmentedOWLv2) is a powerful command-line tool for text-prompted object segmentation for video and images.

Jupyter Notebook
6
2 个月前