Repository navigation

#

data-format

Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, and PyTorch with more integrations coming..

Rust
4488
31 分钟前
Rust
3536
7 天前

High performance data store solution

Scala
1435
19 天前

CoffeeScript-Object-Notation. Same as JSON but for CoffeeScript objects.

CoffeeScript
1339
1 年前

Chepy is a python lib/cli equivalent of the awesome CyberChef tool.

Python
969
4 天前

Python library for reading and writing well data using Log ASCII Standard (LAS) files

Lasso
357
19 天前

A super-fast, compact, JSON-equivalent binary data format

C++
320
1 天前

Read and explore NetCDF files

TypeScript
155
2 年前

STON - Smalltalk Object Notation - A lightweight text-based, human-readable data interchange format for class-based object-oriented languages like Smalltalk.

Smalltalk
138
1 年前

EDN parser and generator that works with plain JS data, with support for TS and node streams

TypeScript
101
10 个月前

Just Data. Save up to 85% network bandwidth and storage.

89
2 年前

🎀 Awesome Zarr resources

86
10 个月前

Python library for working with OMF files

Jupyter Notebook
78
2 年前

Parse and write environment files with Node.js

TypeScript
62
6 个月前

PEtab - an SBML and TSV based data format for parameter estimation problems in systems biology

61
22 天前

Data format specification schema for the NWB neurophysiology data format

57
23 天前

A human readable object notation / serialization format that syntactic similar to Rust and completely supports Serde's data model.

Rust
53
1 个月前

A serializer/deserializer for the rhythm game chart format simai.

C#
38
1 个月前

Data formats for gamma-ray astronomy

Python
30
2 年前