Repository navigation

#

attention-is-all-you-need

A PyTorch implementation of the Transformer model in "Attention is All You Need".

Python
9350
1 年前

A TensorFlow Implementation of the Transformer: Attention Is All You Need

Python
4391
2 年前
gordicaleksa/pytorch-original-transformer

My implementation of the original transformer model (Vaswani et al.). I've additionally included the playground.py file for visualizing otherwise seemingly hard concepts. Currently included IWSLT pretrained models.

Jupyter Notebook
1038
5 年前
Jupyter Notebook
1001
1 年前

A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.

Python
799
2 年前

A Keras+TensorFlow Implementation of the Transformer: Attention Is All You Need

Python
712
4 年前

Implementation of plug in and play Attention from "LongNet: Scaling Transformers to 1,000,000,000 Tokens"

Python
709
2 年前

Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.

Python
626
2 年前

A Pytorch Implementation of "Attention is All You Need" and "Weighted Transformer Network for Machine Translation"

Python
561
5 年前

USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference

Python
547
1 个月前

An open source implementation of "Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning", an all-new multi modal AI that uses just a decoder to generate both text and images

Python
363
2 年前

Implementation of the ScreenAI model from the paper: "A Vision-Language Model for UI and Infographics Understanding"

Python
355
16 天前

[CVPR 2024] Official implementation of CVPR 2024 paper: "Inversion-Free Image Editing with Natural Language"

Python
341
1 年前