Repository navigation

#

dataset

Faker is a Python package that generates fake data for you.

Python
18735
19 天前

A powerful tool for creating fine-tuning datasets for LLM

JavaScript
11003
5 天前
Python
10302
4 个月前
googlecreativelab/quickdraw-dataset

Documentation on how to access and use the Quick, Draw! Dataset.

6544
7 个月前
mdn/browser-compat-data

Browser compatibility data for Web technologies as displayed on MDN

JSON
5432
1 天前

Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合

Python
5404
15 天前

esProc SPL is a JVM-based programming language designed for structured data computation, serving as both a data analysis tool and an embedded computing engine.

Java
4670
6 天前

CSGHub is a brand-new open-source platform for managing LLMs, developed by the OpenCSG team. It offers both open-source and on-premise/SaaS solutions, with features comparable to Hugging Face. Gain full control over the lifecycle of LLMs, datasets, and agents, with Python SDK compatibility with Hugging Face. Join us! ⭐️

Vue
4490
6 天前

TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...

Python
4487
3 天前

SQL Translator is a tool for converting natural language queries into SQL code using artificial intelligence. This project is 100% free and open source.

TypeScript
4291
3 个月前

中文人名语料库。人名生成器。中文姓名,姓氏,名字,称呼,日本人名,翻译人名,英文人名。可用于中文分词、人名实体识别。

4194
2 年前