Repository navigation

#

data-transformation

mahmoud/glom

☄️ Python's nested data operator (and CLI), for all your declarative restructuring needs. Got data? Glom it! ☄️

Python
2061
7 个月前

Logical Replication extension for PostgreSQL 17, 16, 15, 14, 13, 12, 11, 10, 9.6, 9.5, 9.4 (Postgres), providing much faster replication than Slony, Bucardo or Londiste, as well as cross-version upgrades.

C
1142
11 天前

Build data pipelines with SQL and Python, ingest data from different sources, add quality checks, and build end-to-end flows.

Go
969
2 小时前

A block-based API for NSValueTransformer, with a growing collection of useful examples.

Objective-C
841
4 年前

Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.

Go
752
1 年前

Microsoft Program Synthesis using Examples SDK is a framework of technologies for the automatic generation of programs from input-output examples. This repo includes samples and sample data for the Microsoft Program Synthesis using Example SDK.

C#
648
15 天前

💄 Durable and asynchronous data imports for consuming data at scale and publishing testable SDKs.

PHP
613
6 个月前

Official Repository of "LLM × DATA" Survey Paper

408
1 个月前
Tcl
315
9 个月前

Low-code Python library to safely use notebooks in production: schedule workflows, generate assets, trigger webhooks, send notifications, build pipelines, manage secrets (Cloud-only)

Python
285
6 个月前

An Extensible Suite of High-Performance and Low-Dependency Packages for Statistical Computing and Data Manipulation in R

R
276
3 个月前

📄 Concise selector to extract JSON from HTML.

TypeScript
273
1 年前

A curated list of Clojure resources for dealing with domain-specific languages.

182
1 年前

Clojure Query: A Command-line Data Processor for JSON, YAML, EDN, XML and more

Clojure
181
1 年前

Data transformation and utility functions for R

R
160
21 天前