Repository navigation

#

data-generation

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook
16970
1 年前

List of useful data augmentation resources. You will find here some not common techniques, libraries, links to GitHub repos, papers, and others.

1642
1 年前

Awesome Artificial Intelligence, Machine Learning and Deep Learning as we learn it. Study notes and a curated list of awesome resources of such topics.

Jupyter Notebook
1583
2 天前
sdv-dev/CTGAN
Python
1465
3 天前

Data generation and property-based testing for Elixir. 🔮

Elixir
918
16 天前

Generate strings that match a given regular expression

Ruby
523
1 年前

Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POCs, and other uses in Databricks environments including in Delta Live Tables pipelines

Python
429
12 天前

C++ Faker library for generating fake (but realistic) data.

C++
387
18 天前

GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation

Python
381
5 天前

Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and text alignment capabilities.

Jupyter Notebook
334
2 年前

A novel approach for synthesizing tabular data using pretrained large language models

Python
323
3 个月前

Generate random data sets

R
256
3 年前