Repository navigation

#

t5-model

中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调,给出三元组信息抽取微调示例。

Python
1515
1 年前

pytorch实现 Bert 做seq2seq任务,使用unilm方案,现在也可以做自动摘要,文本分类,情感分析,NER,词性标注等任务,支持t5模型,支持GPT2进行文章续写。

Python
1296
3 年前

simpleT5 is built on top of PyTorch-lightning⚡️ and Transformers🤗 that lets you quickly train your T5 models.

Python
394
2 年前

Official implementation of the paper "CoEdIT: Text Editing by Task-Specific Instruction Tuning" (EMNLP 2023)

Shell
121
7 个月前

Official implementation of the paper "CoEdIT: Text Editing by Task-Specific Instruction Tuning" (EMNLP 2023)

Shell
111
7 个月前

AraT5: Text-to-Text Transformers for Arabic Language Understanding

90
1 年前

The "LLM Projects Archive" is a centralized GitHub repository, offering a diverse collection of Language Model Models projects. A valuable resource for researchers, developers, and enthusiasts, it showcases the latest advancements and applications in the realm of LLMs. Explore and contribute to the dynamic landscape of language model projects.

Python
54
3 个月前

Implement Question Generator with SOTA pre-trained Language Models (RoBERTa, BERT, GPT, BART, T5, etc.)

Python
49
3 年前

📚 Unearth the Core of Knowledge in Minutes 📓

Python
45
1 年前

A extension of Transformers library to include T5ForSequenceClassification class.

Python
38
2 年前

Abstractive text summarization by fine-tuning seq2seq models.

Python
37
4 年前

Materials for "IT5: Large-scale Text-to-text Pretraining for Italian Language Understanding and Generation" 🇮🇹

Jupyter Notebook
30
10 个月前

This repository contains the data and code for the paper "Diverse Text Generation via Variational Encoder-Decoder Models with Gaussian Process Priors" (SPNLP@ACL2022)

Python
25
3 年前

Training T5 to perform numerical reasoning.

Jupyter Notebook
24
4 年前

In this implementation, using the Flan T5 large language model, we performed the Text Classification task on the IMDB dataset and obtained a very good accuracy of 93%.

Jupyter Notebook
21
2 年前

End-to-End Model - Finetuned T5 for Text-to-SPARQL Task

Jupyter Notebook
19
3 年前

Automated Headline generation and Aspect Based Sentiment Analysis

Jupyter Notebook
15
2 年前

About Code for the paper "NASH: A Simple Unified Framework of Structured Pruning for Accelerating Encoder-Decoder Language Models" (EMNLP 2023 Findings)

Python
15
2 年前