Repository navigation

video-synthesis

Website
Wikipedia

ali-vilab / VGen

Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models

diffusion-models video-synthesis

Python

3134

271

9 个月前

DmitryRyumin / ICCV-2023-Papers

ICCV 2023 Papers: Discover cutting-edge research from ICCV 2023, the leading computer vision conference. Stay updated on the latest in computer vision and deep learning, with code included. ⭐ support visual intelligence development!

Python

954

1 年前

TIGER-AI-Lab / AnyV2V

Code and data for "AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks" [TMLR 2024]

深度学习 generative-ai image-editing image-to-video-generation PyTorch video-editing video-synthesis

Jupyter Notebook

623

1 年前

DmitryRyumin / CVPR-2023-24-Papers

CVPR 2023-2024 Papers: Dive into advanced research presented at the leading computer vision conference. Keep up to date with the latest developments in computer vision and deep learning. Code included. ⭐ support visual intelligence development!

action-recognition autonomous-driving biometrics 机器视觉 cvpr cvpr2023 datasets 深度学习 face-recognition gesture-recognition image-synthesis medical-image-processing multi-modal-learning pattern-recognition segmentation self-supervised-learning video-synthesis cvpr2024

Python

454

1 年前

TIGER-AI-Lab / ConsistI2V

ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation [TMLR 2024]

diffusion-models image-to-video-generation video-generation video-synthesis

Python

250

1 年前

guyyariv / TempoTokens

This repo contains the official PyTorch implementation of: Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation

ai-art 深度学习 diffusion-models generative-ai video-synthesis modelscope PyTorch

Python

128

8 个月前

brandon-rezko / HeyGem

HeyGem — Your AI face, made free

deepfake 机器学习 speech-synthesis text-to-video video-synthesis voice-cloning

9 天前

Bomou-AI / Talking-Head

AI Talking Head: create video from plain text or audio file in minutes, support up to 100+ languages and 350+ voice models.

talking-head lip-sync talking-face talking-heads text-to-video video-synthesis wav2lip talking-head-videos

3 年前

shgaurav1 / DVG

Diverse Video Generation using a Gaussian Process Trigger

FFmpeg gaussian-processes Video video-generation video-generator video-processing video-synthesis convolutional-neural-networks 深度学习深度神经网络机器学习

Python

3 年前

zhengqia / PlugLink_videosyn

视频合成（videosyn）是由PlugLink官方开发的多素材合成插件，主要用于矩阵号发布，解放双手。Video synthesis (videosyn) is a multi-material synthesis plugin developed by PlugLink, mainly used for matrix number publishing, freeing up your hands.

Video video-generation video-synthesis

HTML

1 年前

adrianSRoman / SELDVisualSynth

Generating Diverse Audio-Visual 360º Soundscapes for Sound Event Localization and Detection

audio-visual-learning video-synthesis

Python

2 个月前

vonqo / devola2

devola2 is an open source real-time audio visualizer (video-synth) tool. Purposely designed for live-music performance of B.L.M.D and Even Tide.

audio-visualizer glsl-shaders p5js Three.js webgl video-synthesis

JavaScript

5 个月前

tasinislam21 / FashionFlow

This model synthesises high-fidelity fashion videos from single images featuring spontaneous and believable movements.

人工智能 attention-mechanism cross-attention diffusion-models fashion latent-diffusion video-synthesis

Python

2 年前

cskonopka / ofx-supplemental

Collection of openFrameworks video synthesis examples

fragment-shader glsl openframeworks shaders video-synthesis Xcode

Makefile

1 年前

MECLabTUDA / SG2VID

SG2VID: Scene Graphs Enable Fine-Grained Control for Video Synthesis (MICCAI 2025 - ORAL)

diffusion-models scene-graph video-synthesis

Python

1 个月前

TornadoInsight / AI-Text-Video

AI-Text-Video is an open-source project that leverages advanced artificial intelligence and deep learning technologies to automatically convert written text into engaging videos. This tool generates video content—including visuals, animations, and voiceovers—from simple text input, making it ideal for content creators, educators, marketers, and any

人工智能 content-creation 深度学习 generative-ai 机器学习 multimedia Open Source text-to-speech text-to-video video-editing video-synthesis

Jupyter Notebook

8 天前

Ramtin-ma / VideoSynopsis-FGS

机器视觉 object-detection surveillance-systems tracking video-analysis video-synthesis yolov8

Python

8 个月前

Yazdi9 / Nerf-Video-Synthesis

NeRF- Real-time View Synthesis

3D 机器视觉 nerf video-synthesis

Jupyter Notebook

3 年前

madebynanditaaa / lipgans

LipGANs is a text-to-viseme GAN framework that generates realistic mouth movements directly from text, without requiring audio. It maps phonemes → visemes, predicts phoneme durations, and uses per-viseme 3D GANs to synthesize photorealistic frames that can be exported as PNG sequences, GIFs, or MP4 videos.

Web Accessibility (a11y)机器视觉深度学习 Generative Adversarial Network Keras lip-sync Python speech-synthesis Tensorflow video-synthesis

Python

1 个月前

DeboJp / StoryTeller

An automated video storytelling pipeline that turns online articles into narrated clips with custom scripts, titles, and visuals. Combines scraping, summarization, TTS, and video generation—ideal for tech news recaps, AI-powered media channels, or hands-free content creation.

API 自动化大语言模型 prompt-engineering ranking tts video-synthesis webscraping moviepy pil

HTML

1 个月前