Repository navigation

cross-modal-learning

Website
Wikipedia

[ICLR 2024] Official implementation of " 🦙 Time-LLM: Time Series Forecasting by Reprogramming Large Language Models"

cross-modal-learning cross-modality 深度学习 language-model large-language-models 机器学习 multimodal-deep-learning multimodal-time-series prompt-tuning time-series time-series-analysis time-series-forecasting

Python

2275

394

1 年前

MohamedAfham / CrossPoint

Official implementation of "CrossPoint: Self-Supervised Cross-Modal Contrastive Learning for 3D Point Cloud Understanding" (CVPR, 2022)

self-supervised-learning Point cloud cross-modal-learning transfer-learning unsupervised-learning few-shot-learning 深度学习

Python

257

2 年前

whwu95 / Cap4Video

【CVPR'2023 Highlight & TPAMI】Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval?

cross-modal-learning video-understanding

Python

245

10 个月前

whwu95 / Text4Vis

【AAAI'2023 & IJCV】Transferring Vision-Language Models for Visual Recognition: A Classifier Perspective

cross-modal-learning transfer-learning video-recognition video-understanding action-recognition

Python

196

1 年前

whwu95 / BIKE

【CVPR'2023】Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models

action-recognition cross-modal-learning video-recognition video-understanding

Python

153

1 年前

Toytiny / CMFlow

[CVPR 2023 Highlight 💡] Hidden Gems: 4D Radar Scene Flow Learning Using Cross-Modal Supervision

autonomous-driving cross-modal-learning 深度学习 optical-flow

Python

136

2 年前

choyingw / Cross-Modal-Perceptionist

CVPR 2022: Cross-Modal Perceptionist: Can Face Geometry be Gleaned from Voices?

3D PyTorch 3dmm biometrics 深度学习机器学习 cross-modal-learning 机器视觉 speech-synthesis speech cvpr cvpr2022

Python

129

10 个月前

RunpeiDong / ACT

[ICLR 2023] Autoencoders as Cross-Modal Teachers: Can Pretrained 2D Image Transformers Help 3D Representation Learning?

Point cloud cross-modal-learning representation-learning self-supervised-learning

Python

102

1 年前

WinfredGe / T2S

[IJCAI 2025] Official implementation of "T2S: High-resolution Time Series Generation with Text-to-Series Diffusion Models"

cross-modal-learning cross-modality 深度学习 language-model 机器学习 multimodal-deep-learning multimodal-time-series time-series time-series-analysis

Python

1 个月前

mako443 / Text2Pos-CVPR2022

Code, dataset and models for our CVPR 2022 publication "Text2Pos"

PyTorch 深度学习 Localization (l10n)自然语言处理 language-processing cross-modal cross-modal-retrieval cross-modal-learning 机器视觉 cvpr cvpr2022

Python

3 年前

knightyxp / DGL

[AAAI 2024] DGL: Dynamic Global-Local Prompt Tuning for Text-Video Retrieval.

cross-modal-learning cross-modal-retrieval parameter-efficient-tuning prompt-tuning

Python

1 年前

GaochangWu / FMF-Benchmark

This is a cross-modal benchmark for industrial anomaly detection.

anomaly-detection anomaly-segmentation cross-modal-learning industrial multimodal transformer vit

Python

2 个月前

frank-chris / ImageTextRetrieval

In this work, we implement different cross-modal learning schemes such as Siamese Network, Correlational Network and Deep Cross-Modal Projection Learning model and study their performance. We also propose a modified Deep Cross-Modal Projection Learning model that uses a different image feature extractor. We evaluate the model’s performance on image-text retrieval on a fashion clothing dataset.

image-text-retrieval cross-modal-retrieval cross-modal-learning PyTorch Tensorflow Flask

Jupyter Notebook

4 年前

ospanbatyr / sample-efficient-multimodality

Code for the "Sample-efficient Integration of New Modalities into Large Language Models" paper

foundation-models multimodal-learning cross-modal-learning

Python

1 个月前

StarMoonWang / SeisMoLLM

Official Pytorch Implementation of SeisMoLLM: Advancing Seismic Monitoring via Cross-modal Transfer with Pre-trained Large Language Model

ai4science cross-modal-learning fine-tuning-llm PyTorch

Python

2 个月前

Markin-Wang / CAMANet

[IJBHI 2024] This is the official implementation of CAMANet: Class Activation Map Guided Attention Network for Radiology Report Generation accepted to IEEE Journal of Biomedical and Health Informatics (J-BHI), 2023.

cross-modal-learning

Python

5 个月前

verlab / StraightToThePoint_CVPR_2020

Original PyTorch implementation of the code for the paper "Straight to the Point: Fast-forwarding Videos via Reinforcement Learning Using Textual Data" at the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020

机器视觉 vision-and-language reinforcement-learning agent video-processing video-analysis multimodal-deep-learning multimodal-learning cross-modal-learning cvpr

Python

4 年前

IGITUGraz / MemoryDependentComputation

Code for Limbacher, T., Özdenizci, O., & Legenstein, R. (2022). Memory-enriched computation and learning in spiking neural networks through Hebbian plasticity. arXiv preprint arXiv:2205.11276.

cross-modal-learning neural-networks Python pythorch question-answering recurrent-neural-networks reinforcement-learning

Python

3 年前

codiceSpaghetti / T4SA-2.0

This project creates the T4SA 2.0 dataset, i.e. a big set of data to train visual models for Sentiment Analysis in the Twitter domain using a cross-modal student-teacher approach.

机器视觉 cross-modal-learning dataset-creation 自然语言处理

Jupyter Notebook

4 个月前

PrithivirajDamodaran / WhatTheFood

An intentionally simple Image to Food cross-modal search. Created by Prithiviraj Damodaran.

cross-modal-retrieval cross-modal cross-modal-learning multimodal

4 年前