Repository navigation

#

data-format

Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, and PyTorch with more integrations coming..

Rust
5449
1 天前
Rust
3712
13 天前

High performance data store solution

Scala
1439
3 个月前

CoffeeScript-Object-Notation. Same as JSON but for CoffeeScript objects.

CoffeeScript
1344
1 个月前

Chepy is a python lib/cli equivalent of the awesome CyberChef tool.

Python
1010
18 小时前

Python library for reading and writing well data using Log ASCII Standard (LAS) files

Lasso
370
2 个月前

A super-fast, compact, JSON-equivalent binary data format

C++
323
18 天前

Read and explore NetCDF files

TypeScript
161
2 年前

STON - Smalltalk Object Notation - A lightweight text-based, human-readable data interchange format for class-based object-oriented languages like Smalltalk.

Smalltalk
140
23 天前

EDN parser and generator that works with plain JS data, with support for TS and node streams

TypeScript
112
2 个月前

🎀 Awesome Zarr resources

92
1 年前

Just Data. Save up to 85% network bandwidth and storage.

89
2 年前

Python library for working with OMF files

Jupyter Notebook
78
3 年前

Parse and write environment files with Node.js

TypeScript
62
1 个月前

PEtab - an SBML and TSV based data format for parameter estimation problems in systems biology

60
2 天前

Data format specification schema for the NWB neurophysiology data format

60
2 天前

Python API for geoh5, an open file format for geoscientific data.

Python
58
1 天前

A human readable object notation / serialization format that syntactic similar to Rust and completely supports Serde's data model.

Rust
58
11 天前

A serializer/deserializer for the rhythm game chart format simai.

C#
45
2 个月前