Repository navigation

#

multimodal-learning

An open-source framework for training large multimodal models.

Python
3996
1 年前

Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)

Python
2035
1 年前
Eurus-Holmes/Awesome-Multimodal-Research

A curated list of Multimodal Related Research.

Python
1368
2 年前

An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"

Python
976
1 年前

ICCV 2023 Papers: Discover cutting-edge research from ICCV 2023, the leading computer vision conference. Stay updated on the latest in computer vision and deep learning, with code included. ⭐ support visual intelligence development!

Python
954
1 年前

This repository contains various models targetting multimodal representation learning, multimodal fusion for downstream tasks such as multimodal sentiment analysis.

OpenEdge ABL
857
2 年前

Papers, code and datasets about deep learning and multi-modal learning for video analysis

810
4 年前
Python
690
2 年前

Multimodal model for text and tabular data with HuggingFace transformers as building block for text data

Python
605
10 个月前

A curated list of awesome vision and language resources (still under construction... stay tuned!)

545
10 个月前
Python
518
9 天前

Official Pytorch implementation of "OmniNet: A unified architecture for multi-modal multi-task learning" | Authors: Subhojeet Pramanik, Priyanka Agrawal, Aman Hussain

Python
513
5 年前

ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!

Python
499
3 个月前