Repository navigation

#

data-wrangling

OpenRefine is a free, open source power tool for working with messy data and improving it

Java
11469
8 小时前

Select, put and delete data from JSON, TOML, YAML, XML and CSV files with a single tool. Supports conversion between formats and can be used as a Go package.

Go
7529
15 小时前
khanhnamle1994/cracking-the-data-science-interview

A Collection of Cheatsheets, Books, Questions, and Portfolio For DS/ML Interview Prep

Jupyter Notebook
4194
1 年前

Zui is a powerful desktop application for exploring and working with data. The official front-end to the Zed lake.

TypeScript
1876
3 小时前

A Python toolbox for gaining geometric insights into high-dimensional data

Python
1855
1 个月前

Materials for following along with Hands-On Data Analysis with Pandas – Second Edition

Jupyter Notebook
651
3 个月前

Microsoft Program Synthesis using Examples SDK is a framework of technologies for the automatic generation of programs from input-output examples. This repo includes samples and sample data for the Microsoft Program Synthesis using Example SDK.

C#
648
15 天前

Materials for following along with Hands-On Data Analysis with Pandas.

Jupyter Notebook
417
7 个月前

Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.

C++
414
19 天前

An introductory workshop on pandas with notebooks and exercises for following along. Slides contain all solutions.

Jupyter Notebook
406
10 天前

Data Analysis and Visualization in R for Ecologists

R
324
9 小时前

Pacote que trata e organiza os dados do Cadastro Nacional da Pessoa Jurídica (CNPJ)

R
322
4 年前
Tcl
315
9 个月前