Repository navigation

#

attention-is-all-you-need

A PyTorch implementation of the Transformer model in "Attention is All You Need".

Python
9141
1 年前

A TensorFlow Implementation of the Transformer: Attention Is All You Need

Python
4358
2 年前
gordicaleksa/pytorch-original-transformer

My implementation of the original transformer model (Vaswani et al.). I've additionally included the playground.py file for visualizing otherwise seemingly hard concepts. Currently included IWSLT pretrained models.

Jupyter Notebook
1024
4 年前
Jupyter Notebook
895
10 个月前

A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.

Python
786
2 年前

A Keras+TensorFlow Implementation of the Transformer: Attention Is All You Need

Python
712
4 年前

Implementation of plug in and play Attention from "LongNet: Scaling Transformers to 1,000,000,000 Tokens"

Python
704
1 年前

Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.

Python
619
2 年前

A Pytorch Implementation of "Attention is All You Need" and "Weighted Transformer Network for Machine Translation"

Python
556
5 年前

USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference

Python
475
2 天前

An open source implementation of "Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning", an all-new multi modal AI that uses just a decoder to generate both text and images

Python
360
1 年前

Implementation of the ScreenAI model from the paper: "A Vision-Language Model for UI and Infographics Understanding"

Python
337
16 天前

[CVPR 2024] Official implementation of CVPR 2024 paper: "Inversion-Free Image Editing with Natural Language"

Python
327
1 年前