Repository navigation

#

video-generation

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python
11846
2 个月前

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python
10891
13 天前

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

Python
4935
1 年前

A curated list of recent diffusion models for video generation, editing, and various other applications.

4906
9 小时前

[ECCV 2024] Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance

Python
4230
1 年前

[ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators

Python
4206
2 年前

Wan: Open and Advanced Large-Scale Video Generative Models

Python
3916
16 天前
SandAI-org/MAGI-1

MAGI-1: Autoregressive Video Generation at Scale

Python
3453
2 个月前

InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)

Python
3216
1 年前

Official implementations for paper: VACE: All-in-One Video Creation and Editing

Python
3124
3 个月前

[ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling

Python
3036
8 个月前

[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Python
2922
1 年前

MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising

Python
2766
1 年前

A python tool that uses GPT-4, FFmpeg, and OpenCV to automatically analyze videos, extract the most interesting sections, and crop them for an improved viewing experience.

Python
2502
6 个月前

High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance

Python
2434
22 天前

Quantized Attention achieves speedup of 2-5x and 3-11x compared to FlashAttention and xformers, without lossing end-to-end metrics across language, image, and video models.

Cuda
2238
15 天前

A unified inference and post-training framework for accelerated video generation.

Python
2018
1 分钟前

Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models

Python
1757
2 年前

Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"

Python
1538
2 个月前