Repository navigation

#

latent-diffusion

invoke-ai/InvokeAI

Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, and serves as the foundation for multiple commercial products.

TypeScript
25743
2 小时前
Sanster/IOPaint

Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.

Python
22024
4 个月前

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Python
5917
1 年前
JoePenna/Dreambooth-Stable-Diffusion

Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) by way of Textual Inversion (https://arxiv.org/abs/2208.01618) for Stable Diffusion (https://arxiv.org/abs/2112.10752). Tweaks focused on training faces, objects, and styles.

Jupyter Notebook
3229
2 年前

SDK for interacting with stability.ai APIs (e.g. stable diffusion inference)

Jupyter Notebook
2437
14 天前

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

Python
1324
2 年前

Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead of images.

Jupyter Notebook
770
1 年前

DiffusionFastForward: a free course and experimental framework for diffusion-based generative models

Jupyter Notebook
659
2 年前

基于Stable Diffusion优化的AI绘画模型。支持输入中英文文本,可生成多种现代艺术风格的高质量图像。| An optimized text-to-image model based on Stable Diffusion. Both Chinese and English text inputs are available to generate images. The model can generate high-quality images in several modern art styles.

655
2 年前

PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model

Python
650
1 年前

The pytorch implementation of our CVPR 2023 paper "Conditional Image-to-Video Generation with Latent Flow Diffusion Models"

Python
460
1 年前

Official Code for ECCV 2024 paper — One-Shot Diffusion Mimicker for Handwritten Text Generation

Python
432
2 个月前