Repository navigation

#

vqgan

Python
22723
1 个月前

[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer

Python
17394
10 个月前

Pytorch implementation of Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors

Python
336
3 年前

[ICCV 2025] SimVQ: Addressing Representation Collapse in Vector Quantized Models with One Linear Layer

Python
285
8 个月前

[CVPR 2023] | RIDCP: Revitalizing Real Image Dehazing via High-Quality Codebook Priors

Python
248
2 年前

PyTorch codes for "Real-World Blind Super-Resolution via Feature Matching with Implicit High-Resolution Priors", ACM MM2022 (Oral)

Python
228
2 年前
Python
166
1 年前

Feed forward VQGAN-CLIP model, where the goal is to eliminate the need for optimizing the latent space of VQGAN for each input prompt

Python
137
2 年前

Implements VQGAN+CLIP for image and video generation, and style transfers, based on text and image prompts. Emphasis on ease-of-use, documentation, and smooth video creation.

Jupyter Notebook
113
4 年前

🇰🇷 Text to Image in Korean

Python
84
4 年前

Implementation of the paper "MaskBit: Embedding-free Image Generation from Bit Tokens"

Jupyter Notebook
84
4 个月前

OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Perceptual loss for clear text-within-image generation. Fork from VQGAN in CompVis/taming-transformers

Python
81
3 年前

Towards training VQ-VAE models robustly!

Python
80
1 个月前

Implementation of Binary Latent Diffusion

Python
51
2 年前

[ICLR 2024] DAEFR: Dual Associated Encoder for Face Restoration

Python
49
10 个月前

VQ-VAE/GAN implementation in pytorch-lightning

Python
45
10 个月前

Fast and controllable text-to-image model.

Python
40
2 年前