Repository navigation

#

video-understanding

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

Python
3303
9 个月前

[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding

Python
2144
1 年前

GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Python
1679
13 天前

Awesome video understanding toolkits based on PaddlePaddle. It supports video data annotation tools, lightweight RGB and skeleton based action recognition model, practical applications for video tagging and sport action detection.

Python
1644
8 个月前

[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

Python
1583
2 年前

Code & Models for Temporal Segment Networks (TSN) in ECCV 2016

Python
1563
5 年前

Temporal Segment Networks (TSN) in PyTorch

Python
1079
6 年前

[CVPR 2024 Highlight🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding

Python
939
1 年前
Python
645
6 年前

Official code for Goldfish model for long video understanding and MiniGPT4-video for short video understanding

Python
627
10 个月前
Python
519
2 个月前

A collection of recent video understanding datasets, under construction!

465
7 年前