Repository navigation

#

apache-arrow

Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, and PyTorch with more integrations coming..

Rust
5449
1 天前
aws/aws-sdk-pandas

pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).

Python
4065
20 天前

❄️ Coolest database around 🧊 Embeddable column database written in Go.

Go
1459
6 天前

Manipulate JSON-like data with NumPy-like idioms.

Python
908
4 天前
TypeScript
784
3 个月前

Geospatial extensions for Polars

Rust
752
1 年前

Rust-based WebAssembly bindings to read and write Apache Parquet data

Rust
618
1 天前

Multi-Modal Database replacing MongoDB, Neo4J, and Elastic with 1 faster ACID solution, with NetworkX and Pandas interfaces, and bindings for C 99, C++ 17, Python 3, Java, GoLang 🗄️

C++
606
2 年前

Apache Kafka® compatible broker with S3, PostgreSQL, Apache Iceberg and Delta Lake

Rust
497
3 天前

Specification for storing geospatial data in Apache Arrow

491
4 个月前

GeoArrow in Rust, Python, and JavaScript (WebAssembly) with vectorized geometry operations

Rust
364
9 天前

Official Julia implementation of Apache Arrow

Julia
294
16 天前

A Rust DataFrame implementation, built on Apache Arrow

Rust
280
5 年前

A SQLite vtable extension to read Parquet files

C++
270
4 年前

Official Go implementation of Apache Arrow

Assembly
258
16 小时前

Fletcher: A framework to integrate FPGA accelerators with Apache Arrow

VHDL
228
2 个月前

ParquetSharp is a .NET library for reading and writing Apache Parquet files.

C#
218
1 天前