Repository navigation

#

data-generation

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook
16801
1 年前

List of useful data augmentation resources. You will find here some not common techniques, libraries, links to GitHub repos, papers, and others.

1644
1 年前

Awesome Artificial Intelligence, Machine Learning and Deep Learning as we learn it. Study notes and a curated list of awesome resources of such topics.

Jupyter Notebook
1573
1 个月前
sdv-dev/CTGAN
Python
1442
2 天前

Data generation and property-based testing for Elixir. 🔮

Elixir
912
2 个月前

Generate strings that match a given regular expression

Ruby
522
1 年前

Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POCs, and other uses in Databricks environments including in Delta Live Tables pipelines

Python
420
8 天前

C++ Faker library for generating fake (but realistic) data.

C++
374
2 天前

Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and text alignment capabilities.

Jupyter Notebook
332
2 年前

A novel approach for synthesizing tabular data using pretrained large language models

Python
321
2 个月前

GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation

Python
308
1 天前

Generate random data sets

R
258
3 年前