Repository navigation

#

data-cleaning

Python
3756
20 小时前

Jupyter notebook and datasets from the pandas video series

Jupyter Notebook
2178
1 年前
R
1409
4 个月前

An open-source educational chat model from ICALK, East China Normal University. 开源中英教育对话大模型。(通用基座模型,GPU部署,数据清理) 致敬: LLaMA, MOSS, BELLE, Ziya, vLLM

Jupyter Notebook
785
6 个月前

Easy to use Python library of customized functions for cleaning and analyzing data.

Python
509
4 个月前

Schema-Inspector is a simple JavaScript object sanitization and validation module.

JavaScript
505
5 个月前

The toolkit to test, validate, and evaluate your models and surface, curate, and prioritize the most valuable data for labeling.

Python
449
11 天前

Professional data validation for the R environment

R
420
2 个月前

Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.

C++
401
7 天前

Deal with bad samples in your dataset dynamically, use Transforms as Filters, and more!

Python
377
3 年前

Exploratory data analysis 📊using python 🐍of used car 🚘 database taken from ⓚ𝖆𝖌𝖌𝖑𝖊

Jupyter Notebook
225
6 年前