Repository navigation

#

multimodal-learning

An open-source framework for training large multimodal models.

Python
4016
1 年前

Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)

Python
2079
1 年前
Eurus-Holmes/Awesome-Multimodal-Research

A curated list of Multimodal Related Research.

Python
1368
2 年前

An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"

Python
989
1 年前

ICCV 2023 Papers: Discover cutting-edge research from ICCV 2023, the leading computer vision conference. Stay updated on the latest in computer vision and deep learning, with code included. ⭐ support visual intelligence development!

Python
954
1 年前

This repository contains various models targetting multimodal representation learning, multimodal fusion for downstream tasks such as multimodal sentiment analysis.

OpenEdge ABL
870
3 年前

Papers, code and datasets about deep learning and multi-modal learning for video analysis

812
4 年前
Python
690
2 年前

Multimodal model for text and tabular data with HuggingFace transformers as building block for text data

Python
607
1 年前

A curated list of awesome vision and language resources (still under construction... stay tuned!)

549
1 年前
Python
519
2 个月前

Official Pytorch implementation of "OmniNet: A unified architecture for multi-modal multi-task learning" | Authors: Subhojeet Pramanik, Priyanka Agrawal, Aman Hussain

Python
512
5 年前

ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!

Python
504
5 个月前