Repository navigation

#

cross-modality

a state-of-the-art-level open visual language model | 多模态预训练模型

Python
6483
1 年前

TOMM2020 Dual-Path Convolutional Image-Text Embedding with Instance Loss 🐾 https://arxiv.org/abs/1711.05535

MATLAB
290
3 个月前

Update-to-data resources for conditional content generation, including human motion generation, image or video generation and editing.

268
9 个月前

[CVPR2023] The code for 《Position-guided Text Prompt for Vision-Language Pre-training》

Python
152
2 年前

[CVPR 2023] Diverse Embedding Expansion Network and Low-Light Cross-Modality Benchmark for Visible-Infrared Person Re-identification

Python
121
1 年前

PyTorch implementation of the paper "Semantically Tied Paired Cycle Consistency for Zero-Shot Sketch-based Image Retrieval", CVPR 2019.

Python
110
2 年前

[CVPR-2024] The First High Definition (HD) Event based Visual Object Tracking Benchmark Dataset

Python
109
1 个月前

Co-Separating Sounds of Visual Objects (ICCV 2019)

Python
94
2 年前

Demo code for visible thermal (cross-modality) person re-identification

Python
90
6 年前

CM-NAS: Cross-Modality Neural Architecture Search for Visible-Infrared Person Re-Identification (ICCV2021)

Python
48
4 年前

[CVPR2024]Day-Night Cross-domain Vehicle Re-identification

Python
38
6 个月前

A New Strong and Simple Baseline Method for VI-ReID (Bridging the Gap: Multi-level Cross-modality Joint Alignment for Visible-infrared Person Re-identification)

Python
37
1 年前

Pytorch code for Towards a Unified Middle Modality Learning for Visible-Infrared Person Re-Identification

Python
34
9 个月前

[ICME 2024] VRHCF: Cross-Source Point Cloud Registration via Voxel Representation and Hierarchical Correspondence Filtering

Python
28
1 年前