Repository navigation

text-image-retrieval

Website
Wikipedia

EasyNLP: A Comprehensive and Easy-to-use NLP Toolkit

transformers bert 自然语言处理 pretrained-models 深度学习 PyTorch fewshot-learning knowledge-distillation knowledge-pretraining text-image-retrieval text-to-image-synthesis 机器学习 text-classification transfer-learning

Python

2169

257

10 个月前

NVlabs / ODISE

Official PyTorch implementation of ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [CVPR 2023 Highlight]

深度学习 instance-segmentation panoptic-segmentation PyTorch semantic-segmentation diffusion-models text-image-retrieval zero-shot-learning open-vocabulary-segmentation

Python

928

1 年前

360CVGroup / FG-CLIP

New generation of CLIP with fine grained discrimination capability, ICML2025

clip cross-modal-retrieval text-image-retrieval

Python

306

6 天前

xiaoyuan1996 / retrievalSystem

The back-end of cross-modal retrieval system，wihch will contain services such as semantic location .etc

remote-sensing text-image-retrieval

Python

3 年前

BIGBALLON / UME-Search

Toward Universal Multimodal Embedding

image-retrieval image-search large-language-models text-image-retrieval information-retrieval retrieval

Python

2 个月前

KimRass / CLIP

PyTorch implementation of 'CLIP' (Radford et al., 2021) from scratch and training it on Flickr8k + Flickr30k

clip multi-modal zero-shot-classification text-image-retrieval

Python

2 年前

haoxiangzhao12138 / REIR

[ACMMM'25] Referring Expression Instance Retrieval and A Strong End-to-End Baseline

multimodal-deep-learning referring-expression-comprehension text-image-retrieval

3 个月前

LeviWeiZhi / ICPG

Image-Centered Pseudo Label Generation for Weakly Supervised Text-based Person Re-Identification, PRCV 2024

clip text-image-retrieval

Python

1 年前

HTAnh2003 / LLM_Powered_Video_Search

The LLM-Powered Video Search System is an advanced multimodal video search solution that leverages Large Language Models (LLMs) to enhance video retrieval through text, image, and metadata queries.

clip Django Docker faiss multimodal retrieval retrieval-augmented-generation text-image-retrieval yolo

Jupyter Notebook

4 个月前

AIoT-Lab-BKAI / PIMA

PIMA - A Novel Approach for Pill-Prescription Matching with GNN Assistance and Contrastive Learning

深度学习 graph-neural-networks text-image-retrieval

Jupyter Notebook

3 年前

MayssaJaz / Text2Image-Search

A search engine, operating on the foundation of the OpenAI Clip Model to retrieve images corresponding to textual queries.

clip FastAPI open-ai React search-engine text-image-retrieval

Jupyter Notebook

2 年前

lorenzo-stacchio / Digimon_Dataset

Digimon Dataset for MultiModal Machine Learning

clip 深度学习 image-generation text-image-retrieval

Python

2 年前

Chaouki-AI / VisAlign

VisAlign: Aligning Visual Representations with Textual Semantics for Image Similarity and Retrieval

alignment image-retrieval image-to-image-translation text-image-retrieval

Jupyter Notebook

5 个月前

shivareddy2002 / Word-Search-Chatbot-Using-Wikipedia-

PICTOPEDIA – An interactive word-search chatbot powered by the Wikipedia API. Search terms, get instant info, and chat in real time.

聊天机器人 CSS HTML JavaScript text-image-retrieval

JavaScript

9 天前