Repository navigation

gpt-2

Website
Wikipedia

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RNN and transformer - great performance, linear time, constant space (no kv-cache), fast training, infinite ctx_len, and free sentence embedding.

attention-mechanism 深度学习 gpt gpt-2 gpt-3 language-model linear-attention lstm PyTorch rnn transformer transformers rwkv ChatGPT

Python

13523

912

12 天前

microsoft / LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

gpt-2 adaptation language-model gpt-3 low-rank PyTorch 深度学习 roberta deberta lora

Python

11767

743

4 个月前

codota / TabNine

AI Code Completions

人工智能 gpt-2 VS Code Extension sublime-package vim-plugin JavaScript TypeScript Rust C++Ruby Java Python Go Lua Bash Swift PHP Atom

Shell

10754

507

10 个月前

NielsRogge / Transformers-Tutorials

This repository contains demos I made with the Transformers library by HuggingFace.

transformers PyTorch bert vision-transformer layoutlm gpt-2

Jupyter Notebook

10733

1593

3 个月前

EleutherAI / gpt-neo

An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.

language-model transformers gpt gpt-2 gpt-3

Python

8288

960

3 年前

Morizeyao / GPT2-Chinese

Chinese version of GPT2 training code, using BERT tokenizer.

transformer gpt-2 chinese 自然语言处理 text-generation

Python

7556

1709

1 年前

FoundationVision / VAR

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

auto-regressive-model diffusion-models image-generation transformers autoregressive-models generative-ai generative-model gpt gpt-2 large-language-models vision-transformer neurips

Jupyter Notebook

7537

467

1 个月前

lonePatient / awesome-pretrained-chinese-nlp-models

Awesome Pretrained Chinese NLP Models，高质量中文预训练模型&大模型&多模态模型&大语言模型集合

chinese 自然语言处理 pretrained-models bert roberta xlnet nezha ernie gpt gpt-2 dataset 大语言模型 large-language-models

Python

5217

494

4 天前

jaymody / picoGPT

An unnecessarily tiny implementation of GPT-2 in NumPy.

深度学习 gpt gpt-2 large-language-models 机器学习神经网络 Python 自然语言处理

Python

3345

433

2 年前

dbiir / UER-py

Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo

bert pre-training fine-tuning gpt chinese 自然语言处理 PyTorch elmo classification ner t5 unilm roberta albert gpt-2 model-zoo bart xlm-roberta

Python

3060

523

1 年前

yangjianxin1 / GPT2-chitchat

GPT2 for Chinese chitchat/用于中文闲聊的GPT2模型(实现了DialoGPT的MMI思想)

transformer gpt2 gpt-2 chichat 自然语言处理 text-generation dialogue-model dialogpt

Python

3014

678

1 年前

guillaume-be / rust-bert

Rust native ready-to-use NLP pipelines and transformer-based models (BERT, DistilBERT, GPT2,...)

深度学习自然语言处理 transformer bert Rust 机器学习 ner sentiment-analysis question-answering language-generation gpt-2 roberta gpt bart electra translation

Rust

2823

223

1 个月前

stochasticai / xTuring

Build, customize and control you own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-source LLMs. Join our discord community: https://discord.gg/TgHXuSJEk6

深度学习 fine-tuning gpt-2 gpt-j llama 大语言模型 lora language-model alpaca finetuning adapter gen-ai generative-ai mistral peft quantization

Python

2642

205

7 个月前

BrikerMan / Kashgari

Kashgari is a production-level NLP Transfer learning framework built on top of tf.keras for text-labeling and text-classification, includes Word2Vec, BERT, and GPT2 Language Embedding.

自然语言处理 sequence-labeling text-classification bert-model ner 机器学习 nlp-framework named-entity-recognition gpt-2 transfer-learning seq2seq bert text-labeling

Python

2391

435

8 个月前

asyml / texar

Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://casl-project.ai/

机器学习自然语言处理 Tensorflow 深度学习 text-generation Python machine-translation dialog-systems texar bert gpt-2 xlnet text-data data-processing

Python

2388

372

4 年前

microsoft / DialoGPT

Large-scale pretraining for dialogue

dialogue 机器学习 PyTorch transformer text-generation dialogpt gpt-2 text-data data-processing

Python

2381

346

3 年前