Repository navigation

#

pydata

Parallel computing with task scheduling

Python
13507
1 天前
Python
3998
1 个月前
Python
3368
2 年前

Extract data from a wide range of Internet sources into a pandas DataFrame.

Python
3102
6 个月前

A distributed task scheduler for Dask

Python
1651
7 小时前

Clean APIs for data cleaning. Python implementation of R package Janitor

Python
1458
5 天前

A clean, three-column Sphinx theme with Bootstrap for the PyData community

Python
725
18 小时前

Scalable genetics toolkit

Python
263
4 天前

RFC document, tooling and other content related to the array API standard

Python
256
1 个月前

Resources for Advancing into Analytics: From Excel to R and Python by George Mount (O'Reilly Media, 2021)

Jupyter Notebook
222
2 年前

A consistent table management library in python

Python
160
2 年前

Python library for GraphBLAS: high-performance sparse linear algebra for scalable graph analytics

Jupyter Notebook
139
1 个月前

Machine learning with scikit-learn tutorial at PyData Chicago 2016

Jupyter Notebook
128
9 年前

Introduction to Machine Learning with Time Series at PyData Festival Amsterdam 2020

Jupyter Notebook
124
4 年前

vtreat is a data frame processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. Distributed under a BSD-3-Clause license.

Python
121
9 个月前