Repository navigation
chunking
- Website
- Wikipedia
🦛 CHONK your texts with Chonkie ✨ - The no-nonsense RAG chunking library
NCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence labeling tasks (e.g. NER, POS, Segmentation). It includes character LSTM/CNN, word LSTM/CNN and softmax/CRF components.
Content-Addressable Data Synchronization Tool
An extensible Java framework for building event-driven applications that break up XML and non-XML data into chunks for data integration
A package for parsing PDFs and analyzing their content using LLMs.
The RAG Experiment Accelerator is a versatile tool designed to expedite and facilitate the process of conducting experiments and evaluations using Azure Cognitive Search and RAG pattern.
A TensorFlow implementation of Neural Sequence Labeling model, which is able to tackle sequence labeling tasks such as POS Tagging, Chunking, NER, Punctuation Restoration and etc.
A new chunking strategy developed by ZeroEntropy for general semantic chunking using Llama-70B.
An LLM GUI application; enables you to interact with your files, offering dynamic parameters that can modify response behavior during runtime.
a modular multimodal framework for ai applications
webpack 2, react hotloader 3, react router v4, code splitting and more
🍱 semantic-chunking ⇢ semantically create chunks from large document for passing to LLM workflows
📑 Split Laravel jobs into multiple separate job chunks
An asynchronous event-driven HTTP client based on netty.
Грамматический Словарь Русского Языка (+ английский, японский, etc)
Fast multi-threaded content-dependent chunking deduplication for Buffers in C++ with a reference implementation in Javascript. Ships with extensive tests, a fuzz test and a benchmark.