Repository navigation

#

apache-arrow

Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, and PyTorch with more integrations coming..

Rust
4494
1 天前
aws/aws-sdk-pandas

pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).

Python
4005
1 天前

❄️ Coolest database around 🧊 Embeddable column database written in Go.

Go
1407
4 天前

Manipulate JSON-like data with NumPy-like idioms.

Python
875
2 天前
TypeScript
746
12 天前

Geospatial extensions for Polars

Rust
695
8 个月前

Multi-Modal Database replacing MongoDB, Neo4J, and Elastic with 1 faster ACID solution, with NetworkX and Pandas interfaces, and bindings for C 99, C++ 17, Python 3, Java, GoLang 🗄️

C++
591
2 年前

Rust-based WebAssembly bindings to read and write Apache Parquet data

Rust
571
6 天前

Specification for storing geospatial data in Apache Arrow

461
13 天前

GeoArrow in Rust, Python, and JavaScript (WebAssembly) with vectorized geometry operations

Rust
317
11 小时前

Official Julia implementation of Apache Arrow

Julia
289
9 天前

A Rust DataFrame implementation, built on Apache Arrow

Rust
281
4 年前

A SQLite vtable extension to read Parquet files

C++
271
4 年前

Fletcher: A framework to integrate FPGA accelerators with Apache Arrow

VHDL
225
16 天前

Manipulate arrays of complex data structures as easily as Numpy.

Python
214
4 年前

ParquetSharp is a .NET library for reading and writing Apache Parquet files.

C#
200
4 天前
Python
156
1 年前