Repository navigation

#

zero-shot

Zero-1-to-3: Zero-shot One Image to 3D Object (ICCV 2023)

Python
2874
1 年前

[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation

Python
2653
1 个月前

The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foundation Model..."

Python
1730
1 个月前

Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API 🔥

Python
1678
3 个月前

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

Python
1318
2 年前

Real-time and accurate open-vocabulary end-to-end object detection

Python
1313
4 个月前

[ICLR 2023 Oral] Zero-Shot Image Restoration Using Denoising Diffusion Null-Space Model

Python
1250
1 年前

The online version is temporarily unavailable because we cannot afford the key. You can clone and run it locally. Note: we set defaul openai key. If keys exceed plan and are invalid, please tell us. The response speed depends on openai. ( sometimes, the official is too crowded and slow)

Python
815
1 年前

【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment

Python
802
1 年前

Official implementation of "Segment Any Anomaly without Training via Hybrid Prompt Regularization (SAA+)".

Jupyter Notebook
787
2 个月前

Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm

Python
651
3 年前

[SIGGRAPH Asia 2024 (Journal Track)] StableNormal: Reducing Diffusion Variance for Stable and Sharp Normal

Python
641
5 个月前

[WACV'25 Oral] Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think

Python
416
4 个月前

PyTorch Implementation of StyleSinger(AAAI 2024): Style Transfer for Out-of-Domain Singing Voice Synthesis

Python
370
13 天前

API for the GPT-J language model 🦜. Including a FastAPI backend and a streamlit frontend

Python
337
3 年前

A curated list of awesome instruction tuning datasets, models, papers and repositories.

Python
332
2 年前

PyTorch Implementation of TCSinger(EMNLP 2024): Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control

Python
322
3 天前

A project that optimizes OWL-ViT for real-time inference with NVIDIA TensorRT.

Python
320
2 个月前