Repository navigation

#

video-generation

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python
11970
1 个月前

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python
11102
1 个月前

Wan: Open and Advanced Large-Scale Video Generative Models

Python
9640
16 天前

A curated list of recent diffusion models for video generation, editing, and various other applications.

5059
1 个月前

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

Python
4958
1 年前

[ECCV 2024] Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance

Python
4232
1 年前

[ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators

Python
4214
2 年前
SandAI-org/MAGI-1

MAGI-1: Autoregressive Video Generation at Scale

Python
3499
4 个月前

Official implementations for paper: VACE: All-in-One Video Creation and Editing

Python
3311
5 个月前

InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)

Python
3220
1 年前

[ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling

Python
3093
9 个月前

[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Python
2945
1 年前

MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising

Python
2783
1 年前

A python tool that uses GPT-4, FFmpeg, and OpenCV to automatically analyze videos, extract the most interesting sections, and crop them for an improved viewing experience.

Python
2619
5 小时前

Quantized Attention achieves speedup of 2-5x and 3-11x compared to FlashAttention and xformers, without lossing end-to-end metrics across language, image, and video models.

Cuda
2470
7 天前

High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance

Python
2454
2 个月前

A unified inference and post-training framework for accelerated video generation.

Python
2362
27 分钟前

Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models

Python
1762
2 年前

Matrix-Game 2.0: An Open-Source, Real-Time, and Streaming Interactive World Model

Python
1659
16 小时前