Repository navigation

latent-diffusion

Website
Wikipedia

Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, and serves as the foundation for multiple commercial products.

ai-art 人工智能 generative-art image-generation img2img inpainting latent-diffusion Linux macOS outpainting txt2img Windows stable-diffusion

TypeScript

25952

2673

2 天前

Sanster / IOPaint

Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.

inpainting PyTorch lama latent-diffusion mat zits stable-diffusion

Python

22156

2277

5 个月前

yl4579 / StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

深度学习 PyTorch speaker-adaptation speech-synthesis text-to-speech tts wavlm diffusion-models latent-diffusion latent-diffusion-models Generative Adversarial Network

Python

5985

624

1 年前

leejet / stable-diffusion.cpp

Diffusion model(SD,Flux,Wan,...) inference in pure C/C++

人工智能 C++diffusion ggml image-generation latent-diffusion stable-diffusion text2image txt2img image2image img2img flux flux-dev flux-schnell videogeneration wan

C++

4432

426

9 天前

jina-ai / discoart

🪩 Create Disco Diffusion artworks in one line

creative-ai disco-diffusion cross-modal dalle generative-art multimodal diffusion prompts midjourney imgen clip-guided-diffusion latent-diffusion stable-diffusion

Python

3832

244

2 年前

JoePenna / Dreambooth-Stable-Diffusion

Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) by way of Textual Inversion (https://arxiv.org/abs/2208.01618) for Stable Diffusion (https://arxiv.org/abs/2112.10752). Tweaks focused on training faces, objects, and styles.

人工智能 txt2img image-generation 机器学习 model-training img2img latent-diffusion stable-diffusion

Jupyter Notebook

3223

549

2 年前

Stability-AI / stability-sdk

SDK for interacting with stability.ai APIs (e.g. stable diffusion inference)

stable-diffusion ai-art generative-art latent-diffusion multimodal

Jupyter Notebook

2430

346

2 个月前

carefree0910 / carefree-creator

AI magics meet Infinite draw board.

PyTorch stable-diffusion pypi Python latent-diffusion image-to-image inpainting outpainting sketch-to-image super-resolution text-to-image

Jupyter Notebook

1936

179

1 年前

lucidrains / naturalspeech2-pytorch

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

人工智能深度学习 singing-synthesis speech-synthesis latent-diffusion residual-vector-quantization zero-shot

Python

1329

105

2 年前

Uminosachi / sd-webui-inpaint-anything

Inpaint Anything extension performs stable diffusion inpainting on a browser UI using masks from Segment Anything.

ai-art anything diffusers diffusion generative-art gradio huggingface huggingface-diffusers image-generation image2image img2img inpainting latent-diffusion segment segmentation stable-diffusion inpaint-anything segment-anything 插件

Python

1271

117

9 个月前

teticio / audio-diffusion

Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead of images.

huggingface diffusion-models music-generation latent-diffusion

Jupyter Notebook

774

1 年前

mikonvergence / DiffusionFastForward

DiffusionFastForward: a free course and experimental framework for diffusion-based generative models

diffusion-model diffusion-models generative-art generative-model generative-models image-generation latent-diffusion learning-resources

Jupyter Notebook

663

2 年前

Text-to-Audio / Make-An-Audio

PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model

diffusion-models latent-diffusion latent-space text-to-audio

Python

655

1 年前

SkyWorkAIGC / SkyPaint-AI-Diffusion

基于Stable Diffusion优化的AI绘画模型。支持输入中英文文本，可生成多种现代艺术风格的高质量图像。| An optimized text-to-image model based on Stable Diffusion. Both Chinese and English text inputs are available to generate images. The model can generate high-quality images in several modern art styles.