Repository navigation

#

synthetic-data

Mimesis is a robust data generator for Python that can produce a wide range of fake data in multiple languages.

Python
4535
24 天前
nucleuscloud/neosync

Open Source Data Security Platform for Developers to Monitor and Detect PII, Anonymize Production Data and Sync it across environments.

Go
3846
10 小时前
Kiln-AI/Kiln

The easiest tool for fine-tuning LLM models, synthetic data generation, and collaborating on datasets.

Python
3390
8 小时前
argilla-io/distilabel

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

Python
2640
5 天前
Java
2467
17 小时前

SDG is a specialized framework designed to generate high-quality structured tabular data.

Python
2344
1 个月前
C++
1985
1 个月前
sdv-dev/CTGAN
Python
1374
2 天前

A framework for comprehensive diagnosis and optimization of agents using simulated, realistic synthetic interactions

Python
1025
15 天前

A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.

Python
767
2 个月前