Repository navigation

#

data-preparation

Visual Data Preparation and Transformation. Low-Code Python-based ETL.

TypeScript
1090
12 天前

An open source book to learn data science, data analysis and machine learning, suitable for all ages!

TeX
224
1 年前

🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)

Vue
141
2 年前

【AAAI'2021】MVFNet: Multi-View Fusion Network for Efficient Video Recognition

Python
133
3 年前

Prosto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby

Python
91
4 年前

ABAP unit testing framework, prepare in Excel, reuse in abap code

ABAP
69
7 天前

This repository contains my implementations of the algorithms which MoNuSAC participants could use for data preparation to train their models at ISBI 2020.

Jupyter Notebook
64
4 年前

Go web crawler to scrape documentation sites and convert content to clean Markdown for LLM ingestion (RAG, training data).

Go
55
1 个月前

“Data science” is just about as broad of a term as they come. It may be easiest to describe what it is by listing its more concrete components: Data exploration & analysis. Included here: Pandas; NumPy; SciPy; a helping hand from Python's Standard Library.

Jupyter Notebook
46
5 年前

Market Mix Modelling for an eCommerce firm to estimate the impact of various marketing levers on sales

R
43
4 年前

Accelerating AI Training and Inference from Storage Perspective (Must-read Papers on Storage for AI)

37
1 个月前

Data preparation for data science projects.

R
31
2 年前

Foofah: programming-by-example data transformation program synthesizer

CSS
28
7 年前