Repository navigation

#

attention-is-all-you-need

A PyTorch implementation of the Transformer model in "Attention is All You Need".

Python
9404
1 年前

A TensorFlow Implementation of the Transformer: Attention Is All You Need

Python
4416
2 年前
gordicaleksa/pytorch-original-transformer

My implementation of the original transformer model (Vaswani et al.). I've additionally included the playground.py file for visualizing otherwise seemingly hard concepts. Currently included IWSLT pretrained models.

Jupyter Notebook
1056
5 年前
Jupyter Notebook
1046
1 年前

A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.

Python
804
2 年前

A Keras+TensorFlow Implementation of the Transformer: Attention Is All You Need

Python
713
4 年前

Implementation of plug in and play Attention from "LongNet: Scaling Transformers to 1,000,000,000 Tokens"

Python
710
2 年前

Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.

Python
634
2 年前

USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference

Python
571
16 天前

A Pytorch Implementation of "Attention is All You Need" and "Weighted Transformer Network for Machine Translation"

Python
563
5 年前

Implementation of the ScreenAI model from the paper: "A Vision-Language Model for UI and Infographics Understanding"

Python
366
1 个月前

An open source implementation of "Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning", an all-new multi modal AI that uses just a decoder to generate both text and images

Python
361
2 年前

[CVPR 2024] Official implementation of CVPR 2024 paper: "Inversion-Free Image Editing with Natural Language"

Python
342
1 年前