Repository navigation

#

dask

Parallel computing with task scheduling

Python
13132
3 天前

STUMPY is a powerful and scalable Python library for modern time series analysis

Python
3895
12 天前
pydata/xarray

N-D labeled arrays and datasets in Python

Python
3766
12 小时前

Mars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.

Python
2722
1 年前

A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner

Python
2601
1 年前

A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.

Python
2071
21 天前

A distributed task scheduler for Dask

Python
1625
7 小时前
Python
1141
2 个月前

Python package for earth-observing satellite data processing

Python
1099
8 天前

Scalable machine 🤖 learning for time series forecasting.

Python
1009
19 天前

Lightweight and extensible compatibility layer between dataframe libraries!

Python
931
2 小时前

Fast data store for Pandas time-series data

Python
576
9 个月前

Pandas, Polars, Spark, and Snowpark DataFrame comparison for humans and more!

Python
548
2 天前

Distributed SQL Engine in Python using Dask

Python
401
8 个月前
Python
364
12 天前

Library of derived climate variables, ie climate indicators, based on xarray.

Python
354
5 天前