Repository navigation

vqgan

Website
Wikipedia

fishaudio / fish-speech

SOTA Open Source TTS

llama transformer tts valle vits vqgan vqvae

Python

22723

1866

1 个月前

sczhou / CodeFormer

[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer

codebook codeformer face-enhancement face-restoration PyTorch super-resolution vqgan restoration

Python

17394

3618

10 个月前

CasualGANPapers / Make-A-Scene

Pytorch implementation of Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors

gans Generative Adversarial Network vqgan

Python

336

3 年前

youngsheen / SimVQ

[ICCV 2025] SimVQ: Addressing Representation Collapse in Vector Quantized Models with One Linear Layer

audio Image vqgan

Python

285

8 个月前

RQ-Wu / RIDCP_dehazing

[CVPR 2023] | RIDCP: Revitalizing Real Image Dehazing via High-Quality Codebook Priors

深度学习 Image low-level-vision cvpr2023 图像处理 PyTorch vqgan

Python

248

2 年前

chaofengc / FeMaSR

PyTorch codes for "Real-World Blind Super-Resolution via Feature Matching with Implicit High-Resolution Priors", ACM MM2022 (Oral)

super-resolution vqgan

Python

228

2 年前

hhguo / MSMC-TTS

Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS

深度学习 speech-synthesis tts vocoder Generative Adversarial Network text-to-speech vq-vae vqgan

Python

166

1 年前

mehdidc / feed_forward_vqgan_clip

Feed forward VQGAN-CLIP model, where the goal is to eliminate the need for optimizing the latent space of VQGAN for each input prompt

vqgan generative-model text-to-image

Python

137

2 年前

rkhamilton / vqgan-clip-generator

Implements VQGAN+CLIP for image and video generation, and style transfers, based on text and image prompts. Emphasis on ease-of-use, documentation, and smooth video creation.

机器学习图像处理 CUDA vqgan-clip vqgan artificial-neural-networks art

Jupyter Notebook

113

4 年前

pytti-tools / pytti-notebook

Start here

google-colab Generative Adversarial Network generative-art vqgan-clip vqgan style-transfer image-manipulation 动画 image-generation

Jupyter Notebook

110

2 年前

KR-HappyFace / KoDALLE

🇰🇷 Text to Image in Korean

text-to-image vqgan dalle korean PyTorch

Python

4 年前

markweberdev / maskbit

Implementation of the paper "MaskBit: Embedding-free Image Generation from Bit Tokens"

generative-ai image-generation vae vqgan

Jupyter Notebook

4 个月前

joanrod / ocr-vqgan

OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Perceptual loss for clear text-within-image generation. Fork from VQGAN in CompVis/taming-transformers

dataset 深度学习 image-generation image-reconstruction OCR vqgan

Python

3 年前