Repository navigation

#

dit

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Python
4007
6 小时前

Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high performance and flexibility.

Python
621
3 天前

⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation (AAAI 2025 Oral)

Python
577
1 个月前

MoH: Multi-Head Attention as Mixture-of-Head Attention

Python
237
6 个月前

All-round Creator and Editor

Python
213
3 个月前

📖A curated list of Awesome Diffusion Inference Papers with codes: Sampling, Caching, Multi-GPUs, etc. 🎉🎉

210
1 个月前

NMCN(Niche Multi Channel Network),小眾多頻道網絡,是「同和新媒體矩陣」創始團隊於輿論資本全球化背景下率先提出的一種非營利性的去中心化自媒體聯盟形式,通過聯盟內創作單位的交流互推、共享資源等方式對抗資本侵蝕,在產出卓越作品的同時保障亞文化生存空間,為守護寶貴的非物質文化遺產盡綿薄之力。

Java
147
2 年前

CogVideoX-5B 4-bit quantization model

Python
105
6 个月前

An open source community implementation of the model from the paper: "Movie Gen: A Cast of Media Foundation Models". Join our community to help implement this model!

Python
60
6 天前

Official implementation of "JavisDiT: Joint Audio-Video Diffusion Transformer with Hierarchical Spatio-Temporal Prior Synchronization"

Python
44
5 天前

This is the project for 'Any2Caption', Interpreting Any Condition to Caption for Controllable Video Generation

30
17 天前

The interface for dit, a universal container file.

Python
25
1 年前

UK - Great.gov - Export Opportunities - Find and apply for overseas opportunities from businesses looking for products or services like yours.

HTML
7
3 天前

从0到1手写基于mnist手写数字数据集的diffusion transformer模型复现

Python
6
5 个月前

M1 is a research project exploring large-scale music generation using diffusion transformers. This repository contains the implementation of our proposed architecture combining recent advances in diffusion models, transformer architectures, and music processing.

Python
5
6 天前