Repository navigation

#

apache-arrow

Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, and PyTorch with more integrations coming..

Rust
5230
6 小时前
aws/aws-sdk-pandas

pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).

Python
4045
15 天前

❄️ Coolest database around 🧊 Embeddable column database written in Go.

Go
1449
6 天前

Manipulate JSON-like data with NumPy-like idioms.

Python
899
21 小时前
TypeScript
773
2 个月前

Geospatial extensions for Polars

Rust
734
1 年前

Rust-based WebAssembly bindings to read and write Apache Parquet data

Rust
604
19 小时前

Multi-Modal Database replacing MongoDB, Neo4J, and Elastic with 1 faster ACID solution, with NetworkX and Pandas interfaces, and bindings for C 99, C++ 17, Python 3, Java, GoLang 🗄️

C++
603
2 年前

Specification for storing geospatial data in Apache Arrow

480
3 个月前

Apache Kafka® compatible broker with S3, PostgreSQL, Apache Iceberg and Delta Lake

Rust
436
4 小时前

GeoArrow in Rust, Python, and JavaScript (WebAssembly) with vectorized geometry operations

Rust
354
4 天前

Official Julia implementation of Apache Arrow

Julia
293
3 个月前

A Rust DataFrame implementation, built on Apache Arrow

Rust
280
5 年前

A SQLite vtable extension to read Parquet files

C++
271
4 年前

Official Go implementation of Apache Arrow

Assembly
237
4 天前

Fletcher: A framework to integrate FPGA accelerators with Apache Arrow

VHDL
228
9 天前

Manipulate arrays of complex data structures as easily as Numpy.

Python
214
5 年前