Repository navigation

#

text-image-retrieval

Official PyTorch implementation of ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [CVPR 2023 Highlight]

Python
928
1 年前

New generation of CLIP with fine grained discrimination capability, ICML2025

Python
306
6 天前

The back-end of cross-modal retrieval system,wihch will contain services such as semantic location .etc

Python
66
3 年前

PyTorch implementation of 'CLIP' (Radford et al., 2021) from scratch and training it on Flickr8k + Flickr30k

Python
12
2 年前

[ACMMM'25] Referring Expression Instance Retrieval and A Strong End-to-End Baseline

6
3 个月前

Image-Centered Pseudo Label Generation for Weakly Supervised Text-based Person Re-Identification, PRCV 2024

Python
5
1 年前

The LLM-Powered Video Search System is an advanced multimodal video search solution that leverages Large Language Models (LLMs) to enhance video retrieval through text, image, and metadata queries.

Jupyter Notebook
4
4 个月前

PIMA - A Novel Approach for Pill-Prescription Matching with GNN Assistance and Contrastive Learning

Jupyter Notebook
3
3 年前

A search engine, operating on the foundation of the OpenAI Clip Model to retrieve images corresponding to textual queries.

Jupyter Notebook
1
2 年前

Digimon Dataset for MultiModal Machine Learning

Python
0
2 年前

VisAlign: Aligning Visual Representations with Textual Semantics for Image Similarity and Retrieval

Jupyter Notebook
0
5 个月前

PICTOPEDIA – An interactive word-search chatbot powered by the Wikipedia API. Search terms, get instant info, and chat in real time.

JavaScript
0
9 天前