Repository navigation
diffusion
- Website
- Wikipedia
Stable Diffusion web UI
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
《李宏毅深度学习教程》(李宏毅老师推荐👍,苹果书🍎),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases
An easy 1-click way to create beautiful artwork on your PC using AI, with no tech knowledge. Provides a browser UI for generating images from text prompts and images. Just enter your text prompt, and see the generated image.
Using Low-rank adaptation to quickly fine-tune diffusion models.
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
Stable Diffusion and Flux in pure C/C++
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
🪩 Create Disco Diffusion artworks in one line
Open Source Application for Advanced LLM + Diffusion Engineering: interact, train, fine-tune, and evaluate large language models on your own computer.
《大模型白盒子构建指南》:一个全手搓的Tiny-Universe
[SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation
An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism
[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation
Core Engine of Singing Voice Conversion & Singing Voice Clone
Kandinsky 2 — multilingual text2image latent diffusion model
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising