Repository navigation

#

open-vocabulary-segmentation

Website
Wikipedia

IDEA-Research / Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

open-vocabulary-detection open-vocabulary-segmentation data-generation automatic-labeling-system caption speech image-editing

Jupyter Notebook

16147

1476

7 个月前

roboflow / notebooks

This repository offers a comprehensive collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-edge models like YOLO11, RT-DETR, SAM 2, Florence-2, PaliGemma 2, and Qwen2.5VL.

机器视觉深度学习深度神经网络 image-classification image-segmentation object-detection yolov5 PyTorch 教程 yolov8 google-colab 机器学习 zero-shot-classification open-vocabulary-detection automatic-labeling-system open-vocabulary-segmentation paligemma qwen vlm

Jupyter Notebook

7582

1186

11 天前

roboflow / awesome-openai-vision-api-experiments

Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API 🔥

ChatGPT 机器视觉 openai classification clip zero-shot grounding-dino open-vocabulary-detection open-vocabulary-segmentation segment-anything

Python

1678

133

3 个月前

hkchengrex / Tracking-Anything-with-DEVA

[ICCV 2023] Tracking Anything with Decoupled Video Segmentation

深度学习 object-tracking open-vocabulary-segmentation video-editing video-object-segmentation video-segmentation open-vocabulary-video-segmentation open-world-video-segmentation iccv2023

Python

1360

131

9 个月前

FoundationVision / GLEE

[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

foundation-model object-detection open-world tracking open-vocabulary-detection open-vocabulary-segmentation open-vocabulary-video-segmentation referring-expression-comprehension referring-expression-segmentation video-instance-segmentation video-object-segmentation zero-shot-object-detection referring-video-object-segmentation interactive-segmentation segment-anything

Python

1119

69

6 个月前

Official PyTorch implementation of ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [CVPR 2023 Highlight]

深度学习 instance-segmentation panoptic-segmentation PyTorch semantic-segmentation diffusion-models text-image-retrieval zero-shot-learning open-vocabulary-segmentation

Python

895

49

9 个月前

SkalskiP / awesome-foundation-and-multimodal-models

👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]

blip clip foundational-models grounding-dino llava multimodal segment-anything 机器视觉自然语言处理 open-vocabulary-detection open-vocabulary-segmentation image-captioning

Python

611

45

1 年前

segments-ai / panoptic-segment-anything

Combining Segment Anything (SAM) with Grounded DINO for zero-shot object detection and CLIPSeg for zero-shot segmentation

open-vocabulary-detection open-vocabulary-segmentation segmentation

Jupyter Notebook

404

26

1 年前

wanghao9610 / OV-DINO

Official implementation of OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion

open-vocabulary-detection object-detection open-world zero-shot-object-detection open-vocabulary-segmentation

Python

310

19

1 个月前

Kunhao-Liu / 3D-OVS

[NeurIPS 2023] Weakly Supervised 3D Open-vocabulary Segmentation

3D nerf open-vocabulary-segmentation

Python

118

5

1 年前

hustvl / MaskAdapter

[CVPR 2025] Official repository of the paper "Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation"

clip open-vocabulary-segmentation segment-anything segmentation vision-language-model zero-shot zero-shot-segmentation

Python

85

1

2 个月前

CVRP-SOLE / SOLE

[ICLR 2025] Official code of "Segment any 3D Object with Language"

open-vocabulary-segmentation scannet segment-anything segment-anything-model

Python

43

0

3 个月前

lorebianchi98 / Talk2DINO

Official repository of the paper "Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary Segmentation"

clip 机器视觉 open-vocabulary-segmentation

Python

26

2

3 个月前

HVision-NKU / MaskCLIPpp

Official repository of the paper "High-Quality Mask Tuning Matters for Open-Vocabulary Segmentation"

open-vocabulary-segmentation vision-language-model clip image-segmentation

Python

23

1

1 个月前

chenxi52 / FrozenSeg

Open-Vocabulary Panoptic Segmentation

clip open-vocabulary-segmentation panoptic-segmentation segment-anything segmentation instance-segmentation multi-modal-learning vision-and-language zero-shot

Python

23

1

7 个月前

clownrat6 / OpenVIS

Open-vocabulary Video Instance Segmentation Codebase built upon Detectron2, which is really easy to use.

open-vocabulary-segmentation open-vocabulary-video-segmentation video-instance-segmentation

Python

22

0

4 个月前

lartpang / OVCamo

(ECCV 2024) Open-Vocabulary Camouflaged Object Segmentation

camouflaged-object-detection open-vocabulary-detection open-vocabulary-segmentation

Python

22

1

6 个月前

yasserben / FLOSS

FLOSS: Plug-in Training-free and label-free text template selection that boosts OVSS methods

clip open-vocabulary-segmentation semantic-segmentation vision-language-model

Python

17

1

5 天前

PhucNDA / Open3DSceneUnderstanding

[ICCVW23] VinAI-3DIS Metadata repo of OpenSUN3D

instance-segmentation open-vocabulary-segmentation

Jupyter Notebook

4

0

2 年前

katsunori-waragai / zed-gsam

grounded-segment-anything with ZED SDK

open-vocabulary-segmentation segment-anything segmentation

Python

0

0

5 个月前