Repository navigation

#

zero-shot-classification

This repository offers a comprehensive collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-edge models like YOLO11, RT-DETR, SAM 2, Florence-2, PaliGemma 2, and Qwen2.5VL.

Jupyter Notebook
7582
11 天前

[NeurIPS 2023] This repository includes the official implementation of our paper "An Inverse Scaling Law for CLIP Training"

Python
314
1 年前

official code of “OpenShape: Scaling Up 3D Shape Representation Towards Open-World Understanding”

Python
267
1 个月前

Reproducible scaling laws for contrastive language-image learning (https://arxiv.org/abs/2212.07143)

Jupyter Notebook
164
1 年前

Official PyTorch Implementation of MSDN (CVPR'22)

Python
53
2 年前

Evaluate custom and HuggingFace text-to-image/zero-shot-image-classification models like CLIP, SigLIP, DFN5B, and EVA-CLIP. Metrics include Zero-shot accuracy, Linear Probe, Image retrieval, and KNN accuracy.

Jupyter Notebook
48
3 个月前

[ICASSP 2025] Open-source code for the paper "Enhancing Remote Sensing Vision-Language Models for Zero-Shot Scene Classification"

Python
48
10 天前

Implementation of Z-BERT-A: a zero-shot pipeline for unknown intent detection.

Python
39
2 年前

Low-latency ONNX and TensorRT based zero-shot classification and detection with contrastive language-image pre-training based prompts

Jupyter Notebook
38
8 个月前

Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models to do ZSC. Hence, can be lightweight + supports more languages without trading-off accuracy. (Super simple, a 10th-grader could totally write this but since no 10th-grader did, I did) - Prithivi Da

Python
37
3 年前

Codes for the experiments in our EMNLP 2021 paper "Open Aspect Target Sentiment Classification with Natural Language Prompts"

Jupyter Notebook
36
3 年前

[CVPR 2024] The official implementation of paper "Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training"

Python
35
1 年前