Repository navigation

#

data-wrangling

OpenRefine is a free, open source power tool for working with messy data and improving it

Java
11533
13 小时前

Select, put and delete data from JSON, TOML, YAML, XML and CSV files with a single tool. Supports conversion between formats and can be used as a Go package.

Go
7627
13 天前
khanhnamle1994/cracking-the-data-science-interview

A Collection of Cheatsheets, Books, Questions, and Portfolio For DS/ML Interview Prep

Jupyter Notebook
4257
1 年前

Zui is a powerful desktop application for exploring and working with data. The official front-end to the Zed lake.

TypeScript
1892
11 天前

A Python toolbox for gaining geometric insights into high-dimensional data

Python
1869
3 个月前

Materials for following along with Hands-On Data Analysis with Pandas – Second Edition

Jupyter Notebook
662
5 个月前

Microsoft Program Synthesis using Examples SDK is a framework of technologies for the automatic generation of programs from input-output examples. This repo includes samples and sample data for the Microsoft Program Synthesis using Example SDK.

C#
651
10 天前

Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.

C++
424
4 天前

Materials for following along with Hands-On Data Analysis with Pandas.

Jupyter Notebook
416
8 个月前

An introductory workshop on pandas with notebooks and exercises for following along. Slides contain all solutions.

Jupyter Notebook
408
4 天前

Pacote que trata e organiza os dados do Cadastro Nacional da Pessoa Jurídica (CNPJ)

R
321
4 年前
Tcl
315
10 个月前