Repository navigation

#

diffusion-transformer

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python
9706
1 天前
Python
2183
2 个月前
Python
2004
3 天前

[NeurIPS 2024] DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation

Python
1135
1 个月前

🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning

Python
775
2 天前

Implementation of F5-TTS in MLX

Python
520
1 个月前

Taming FLUX for Image Inversion & Editing; HunyuanVideo for Video Inversion & Editing! (Official implementation for Taming Rectified Flow for Inversion and Editing.)

Python
471
8 天前

FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis

428
8 天前

🔥🔥🔥Official Codebase of "DiT-3D: Exploring Plain Diffusion Transformers for 3D Shape Generation"

Python
271
1 年前

[ICCV 2023] Efficient Diffusion Training via Min-SNR Weighting Strategy

Python
245
4 个月前

Project Page repo of OmniTalker: Real-Time Text-Driven Talking Head Generation with In-Context Audio-Visual Style Replication

JavaScript
212
4 天前

Adaptive Caching for Faster Video Generation with Diffusion Transformers

Python
145
5 个月前

The official implementation of "CAME: Confidence-guided Adaptive Memory Optimization"

Python
89
1 个月前

Implementation of F5-TTS in Swift using MLX

Swift
63
4 个月前

ArXiv paper Progressive Autoregressive Video Diffusion Models: https://arxiv.org/abs/2410.08151

62
6 个月前

Implementation of Diffusion Transformer Model in Pytorch

Python
57
4 个月前

FORA introduces simple yet effective caching mechanism in Diffusion Transformer Architecture for faster inference sampling.

Python
44
9 个月前

Implementation of RIFT-SVC, a singing voice conversion model based on Rectified Flow Transformer.

Python
40
1 个月前

This repo implements Diffusion Transformers(DiT) in PyTorch and provides training and inference code on CelebHQ dataset

Python
28
3 个月前