Repository navigation

#

chinese-dataset

开源SFT数据集整理,随时补充

507
2 年前

【逐条处理完成】人为审核+修改每一条的弱智吧精选问题QA数据集

185
9 天前

汉字数据集,包括汉字的相关信息,例如笔画数、部首、拼音、英文释义/同义词等。

115
5 年前
13
9 个月前

[CHABCNet] ABCNet on the Chinese dataset, building on Detectron2 (Facebook AI Research)

Python
11
2 年前

🧠️🖥️2️⃣️0️⃣️0️⃣️1️⃣️🔠️🔢️ The linguistic:Chinese-Traditional category for AI2001, containing Chinese (Traditional) language linguistic datasets

R
3
2 年前

🧠️🖥️2️⃣️0️⃣️0️⃣️1️⃣️🔠️🔢️ The linguistic:Chinese-Simplified category for AI2001, containing Chinese (Simplified) language linguistic datasets

R
2
2 年前

中国40年春晚小品类节目的文本数据及数据分析 Text Data and Data Analysis of Chinese Spring Festival Gala Comedy Sketches Over 40 Years

Python
1
4 个月前

Top Economics Journals Publications Dataset and Data Analysis: Top 5 English Journals and Top 3 Chinese Journals

Python
1
4 个月前

2003-2023焦点访谈节目文本数据及数据分析 Text Data and Data Analysis of Focus Report, a Chinese Investigative TV Program, 2003-2023

Python
1
4 个月前

Code repository for training Taiwan-ELM models, including data preprocessing, tokenizer development, and model fine-tuning.

Jupyter Notebook
0
8 个月前