Repository navigation

#

pydata

Parallel computing with task scheduling

Python
13132
3 天前

STUMPY is a powerful and scalable Python library for modern time series analysis

Python
3895
12 天前
Python
3351
1 年前

Extract data from a wide range of Internet sources into a pandas DataFrame.

Python
3026
16 天前

A distributed task scheduler for Dask

Python
1625
7 小时前

Clean APIs for data cleaning. Python implementation of R package Janitor

Python
1411
2 天前

A clean, three-column Sphinx theme with Bootstrap for the PyData community

Python
689
10 天前

Scalable genetics toolkit

Python
253
5 天前

RFC document, tooling and other content related to the array API standard

Python
233
16 天前

Resources for Advancing into Analytics: From Excel to R and Python by George Mount (O'Reilly Media, 2021)

Jupyter Notebook
209
1 年前

A consistent table management library in python

Python
159
2 年前

Python library for GraphBLAS: high-performance sparse linear algebra for scalable graph analytics

Jupyter Notebook
132
12 天前

Machine learning with scikit-learn tutorial at PyData Chicago 2016

Jupyter Notebook
128
9 年前

Introduction to Machine Learning with Time Series at PyData Festival Amsterdam 2020

Jupyter Notebook
122
4 年前

vtreat is a data frame processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. Distributed under a BSD-3-Clause license.

Python
120
3 个月前