Repository navigation

vqvae

Website
Wikipedia

fishaudio / fish-speech

SOTA Open Source TTS

llama transformer tts valle vits vqgan vqvae

Python

23052

1905

12 天前

AntixK / PyTorch-VAE

A Collection of Variational Autoencoders (VAE) in PyTorch.

PyTorch pytorch-implementation vae vae-implementation 深度学习 reproducible-research paper-implementations pytorch-vae variational-autoencoders architecture vqvae

Python

7366

1167

6 个月前

v-iashin / SpecVQGAN

Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)

transformer vqvae Generative Adversarial Network PyTorch audio-generation melgan multi-modal video-understanding evaluation-metrics audio Video

Jupyter Notebook

366

1 年前

FoundationVision / OmniTokenizer

[NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.

auto-regressive-model image-generation tokenization vae video-generation vqvae

Python

314

1 年前

k2kobayashi / crank

A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder

speech-synthesis voice-conversion vqvae adversarial-learning vocoder

Python

171

1 年前

ZhengdiYu / SignAvatars

(ECCV 2024) SignAvatars: A Large-scale 3D Sign Language Holistic Motion Dataset and Benchmark

human-pose-estimation motion-generation smplx vqvae eccv2024

Python

130

3 个月前

Vermeille / Torchelie

Torchélie is a set of utility functions, layers, losses, models, trainers and other things for PyTorch.

PyTorch utils perceptual loss Generative Adversarial Network vqvae torch

Python

110

2 个月前

haoliuhl / language-quantized-autoencoders

Language Quantized AutoEncoders

bert large-language-models multimodal roberta vqvae

Python

109

3 年前

mahmoodlab / SISH

Fast and scalable search of whole-slide images via self-supervised deep learning - Nature Biomedical Engineering

pathology image-retrieval image-search-engine histopathology friendly interactive shell 深度学习 vqvae

Python

108

2 年前

Neur-IO / OptVQ

Towards training VQ-VAE models robustly!

optimal-transport vq-vae vqgan vqvae

Python

3 个月前

hqyyqh888 / RobustSemanComm

Demo of robust semantic communication against semantic noise

mask vqvae

Python

2 年前

explainingai-code / VQVAE-Pytorch

This repo implements VQVAE on mnist and as well as colored version of mnist images. It also implements simple LSTM for generating sample numbers using the encoder outputs of trained VQVAE

PyTorch vq-vae vqvae

Python

2 年前

vsimkus / vae-voice-conversion

Voice conversion (VC) investigation using three variants of VAE

vae voice-conversion 机器学习 vqvae

Python

6 年前

SerezD / vqvae-vqgan-pytorch-lightning

VQ-VAE/GAN implementation in pytorch-lightning

深度学习 PyTorch pytorch-lightning vqgan vqvae

Python

1 年前

FoundationVision / BitVAE

official training and inference code of bitwise tokenizer

autoregressive-models image-generation vae vqvae

Python

5 个月前

affjljoo3581 / Inverse-DALL-E-for-Optical-Character-Recognition

Inverse DALL-E for Optical Character Recognition

dalle 自然语言处理 gpt2 huggingface image-captioning image-generation image-to-text multimodal OCR optical-character-recognition PyTorch text-to-image transformers vqvae

Python

3 年前

amzn / sparse-vqvae

Experimental implementation for a sparse-dictionary based version of the VQ-VAE2 paper

vqvae

Python

2 年前

MIMICLab / BITTERS

Large-Scale Bidirectional Training for Zero-Shot Image Captioning

深度学习 image-captioning PyTorch pytorch-lightning vqvae bitters transformer

Python

3 年前

BhanuPrakashPebbeti / Image-Generation-Using-VQVAE

Image Generation using VQVAE and GPT Models

深度学习 vqvae gpt image-generation 人工智能

Jupyter Notebook

7 个月前

jaywalnut310 / Vector-Quantized-Autoencoders

Tensorflow Implementation of "Theory and Experiments on Vector Quantized Autoencoders"

vqvae vae autoencoder transformer Tensorflow 深度学习

Python

7 年前