Repository navigation

#

dit

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Python
4443
31 分钟前

🔥 [ICCV 2025 Highlight] InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity

Python
2579
1 个月前

Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Training released! Surpasses GPT-4o in ID persistence! Official ComfyUI workflow release! Only 4GB VRAM is enough to run!

Python
1894
3 个月前

Official implementation for "RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers" (ICML 2025)

Python
715
3 个月前

Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high performance and flexibility.

Python
687
12 小时前

⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation (AAAI 2025 Oral)

Python
616
5 个月前

📚A curated list of Awesome Diffusion Inference Papers with Codes: Sampling, Cache, Quantization, Parallelism, etc.🎉

Python
376
1 天前

MoH: Multi-Head Attention as Mixture-of-Head Attention

Python
271
10 个月前
Python
258
1 个月前

All-round Creator and Editor

Python
233
7 个月前

Official implementation of "JavisDiT: Joint Audio-Video Diffusion Transformer with Hierarchical Spatio-Temporal Prior Synchronization"

Python
159
10 天前

NMCN(Niche Multi Channel Network),小眾多頻道網絡,是「同和新媒體矩陣」創始團隊於輿論資本全球化背景下率先提出的一種非營利性的去中心化自媒體聯盟形式,通過聯盟內創作單位的交流互推、共享資源等方式對抗資本侵蝕,在產出卓越作品的同時保障亞文化生存空間,為守護寶貴的非物質文化遺產盡綿薄之力。

Java
148
2 年前

Just another reasonably minimal repo for class-conditional training of pixel-space diffusion transformers.

Python
121
3 个月前

CogVideoX-5B 4-bit quantization model

Python
106
10 个月前

HyperMotion is a pose guided human image animation framework based on a large-scale video diffusion Transformer.

Python
105
1 个月前

OmniVCus: Feedforward Subject-driven Video Customization with Multimodal Control Conditions

74
25 天前