Repository navigation

#

t5-model

中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调,给出三元组信息抽取微调示例。

Python
1582
1 年前

pytorch实现 Bert 做seq2seq任务,使用unilm方案,现在也可以做自动摘要,文本分类,情感分析,NER,词性标注等任务,支持t5模型,支持GPT2进行文章续写。

Python
1300
3 年前

simpleT5 is built on top of PyTorch-lightning⚡️ and Transformers🤗 that lets you quickly train your T5 models.

Python
397
2 年前

Official implementation of the paper "CoEdIT: Text Editing by Task-Specific Instruction Tuning" (EMNLP 2023)

Shell
129
1 年前

Official implementation of the paper "CoEdIT: Text Editing by Task-Specific Instruction Tuning" (EMNLP 2023)

Shell
129
1 年前

AraT5: Text-to-Text Transformers for Arabic Language Understanding

91
1 年前

The "LLM Projects Archive" is a centralized GitHub repository, offering a diverse collection of Language Model Models projects. A valuable resource for researchers, developers, and enthusiasts, it showcases the latest advancements and applications in the realm of LLMs. Explore and contribute to the dynamic landscape of language model projects.

Python
63
7 个月前

Implement Question Generator with SOTA pre-trained Language Models (RoBERTa, BERT, GPT, BART, T5, etc.)

Python
49
3 年前

📚 Unearth the Core of Knowledge in Minutes 📓

Python
45
1 年前

[Pytorch] Unofficial Implementation of "Recommender Systems with Generative Retrieval"

Python
45
1 个月前

A extension of Transformers library to include T5ForSequenceClassification class.

Python
39
2 年前

Abstractive text summarization by fine-tuning seq2seq models.

Python
38
5 年前

Materials for "IT5: Large-scale Text-to-text Pretraining for Italian Language Understanding and Generation" 🇮🇹

Jupyter Notebook
30
1 年前

This repository contains the data and code for the paper "Diverse Text Generation via Variational Encoder-Decoder Models with Gaussian Process Priors" (SPNLP@ACL2022)

Python
25
3 年前

Training T5 to perform numerical reasoning.

Jupyter Notebook
24
4 年前

Repository about small code models

Python
23
2 个月前

In this implementation, using the Flan T5 large language model, we performed the Text Classification task on the IMDB dataset and obtained a very good accuracy of 93%.

Jupyter Notebook
21
2 年前

End-to-End Model - Finetuned T5 for Text-to-SPARQL Task

Jupyter Notebook
19
4 年前