Repository navigation

#

dataset

Faker is a Python package that generates fake data for you.

Python
18269
2 天前
Python
9913
5 个月前
googlecreativelab/quickdraw-dataset

Documentation on how to access and use the Quick, Draw! Dataset.

6357
1 个月前

A powerful tool for creating fine-tuning datasets for LLM

JavaScript
5522
5 天前

Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合

Python
5216
3 天前
mdn/browser-compat-data

This repository contains compatibility data for Web technologies as displayed on MDN

JSON
5196
2 天前

esProc SPL is a JVM-based programming language designed for structured data computation, serving as both a data analysis tool and an embedded computing engine.

Java
4634
3 天前

TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...

Python
4392
3 天前

SQL Translator is a tool for converting natural language queries into SQL code using artificial intelligence. This project is 100% free and open source.

TypeScript
4265
9 个月前

中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

Python
4112
1 年前