Repository navigation

#

captioning-videos

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

Python
3214
3 个月前

[CVPR20] Video Object Grounding using Semantic Roles in Language Description (https://arxiv.org/abs/2003.10606)

Python
67
5 年前

[CVPR 2023] Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation

Python
61
2 个月前

[CVPR21] Visual Semantic Role Labeling for Video Understanding (https://arxiv.org/abs/2104.00990)

Python
59
4 年前

PyTorch Implementation of Consensus-based Sequence Training for Video Captioning

Python
59
7 年前

PyTorch code for: Learning to Generate Grounded Visual Captions without Localization Supervision

Python
44
5 年前

A tool for downloading from public image boards (which allow scraping) / preview your images & tags / edit your images & tags. Additional tabs for downloading other desired code repositories as well as S.O.T.A. diffusion and auto-tag/caption models for your purposes. Custom datasets can be added!

Python
37
3 个月前

Transcription and annotation interface for recorded audio or video files

JavaScript
33
2 天前

Video to Language Challenge (MSR-VTT Challenge 2016)

Jupyter Notebook
31
7 年前

An image and video description generator using an CNN-RNN based architecture.

Jupyter Notebook
23
9 个月前

M-VAD Names Dataset. Multimedia Tools and Applications (2019)

Python
20
6 年前

Sample app to add captions to an uploaded video. From api.video (https://api.video)

JavaScript
11
2 年前

Video Search using Natural Language

Python
3
7 年前

Generate TikTok— and Instagram—tailored captions and hashtags for your videos using the power of some super creative robots up in the clouds ☁️ 🤖 💬 ☁️

Python
3
10 个月前

Official Pytorch Implementation of 'LAVCap: LLM-based Audio-Visual Captioning using Optimal Transport' (ICASSP2025)

Python
3
6 天前

A multilingual automatic speech recognition and video captioning tool using faster whisper. Supports real-time translation to english. Runs on consumer grade cpu.

HTML
2
5 个月前

Automated Wistia video captioning tool

Python
0
2 年前