Repository navigation

#

multi-modal-chatgpt

Python
3485
6 个月前

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Python
2989
10 个月前