Repository navigation

#

data-generation

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook
16147
7 个月前

List of useful data augmentation resources. You will find here some not common techniques, libraries, links to GitHub repos, papers, and others.

1632
8 个月前

Awesome Artificial Intelligence, Machine Learning and Deep Learning as we learn it. Study notes and a curated list of awesome resources of such topics.

Jupyter Notebook
1527
9 个月前
sdv-dev/CTGAN
Python
1374
2 天前

Data generation and property-based testing for Elixir. 🔮

Elixir
898
22 天前

Generate strings that match a given regular expression

Ruby
521
1 年前

Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POCs, and other uses in Databricks environments including in Delta Live Tables pipelines

Python
398
1 个月前

C++ Faker library for generating fake (but realistic) data.

C++
347
1 个月前

Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and text alignment capabilities.

Jupyter Notebook
322
1 年前

A novel approach for synthesizing tabular data using pretrained large language models

Python
310
6 个月前

Generate random data sets

R
256
3 年前

A suite of auto-regressive and Seq2Seq (sequence-to-sequence) transformer models for tabular and relational synthetic data generation.

Jupyter Notebook
225
1 个月前