Repository navigation

#

text-to-image-generation

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Python
4442
1 个月前

Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)

Python
1963
2 年前

[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Python
1402
2 个月前

[AAAI 2025]👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing. It enables customizable human image generation with flexible garment, pose, and scene control, ensuring high fidelity and garment consistency for virtual dressing.

Python
1276
3 个月前

UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation

Python
688
19 天前

AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation

Jupyter Notebook
446
4 个月前
Python
374
6 个月前

[CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".

Python
371
12 天前

[CVPR 2025] Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimization

Python
242
4 个月前

Faster generation with text-to-image diffusion models.

Python
224
2 个月前

Official repository for "CFG++: manifold-constrained classifier free guidance for diffusion models" (ICLR2025)

Python
222
5 个月前

[NeurIPS-2023] Annual Conference on Neural Information Processing Systems

Python
209
8 个月前

🔥 [CVPR2024] Official implementation of "Self-correcting LLM-controlled Diffusion Models (SLD)

Python
181
1 年前

🔥🔥🔥A curated list of papers on recent diffusion-based high-resolution image and video synthesis works.

150
8 个月前