Repository navigation

#

semi-structured-data

STaRK: Benchmarking LLM Retrieval on Textual and Relational Knowledge Bases (NeurIPS D&B 2024)

Python
307
4 个月前

Refinery is a tool to extract and transform semi-structured data from Excel spreadsheets of different layouts in a declarative way.

Kotlin
50
3 个月前

Programming language for symbolic computation with unusual combination of pattern matching features: Tree patterns, associative patterns and expressions embedded in patterns.

C
47
16 天前

A dataset for extracting information from repair manuals

Python
21
5 年前

Implementation of the semi-structured inference model in our ACL 2020 paper, INFOTABS: Inference on Tables as Semi-structured Data.

Python
18
3 年前

Repository containing code for the NAACL 2021 paper (Incorporating External Knowledge to Enhance Tabular Reasoning)

Python
17
4 年前

Endoscopic and Pathological data extraction for various endo-pathological data extraction

R
13
8 个月前

An ActiveModel extension to model your semi-structured data using embedded associations

Ruby
9
3 年前

Urban Dict spelling variant dataset. Source code of How to Evaluate Word Representations of Informal Domain?

Jupyter Notebook
6
5 年前

This repository contains the official code for the paper : Realistic Data Augmentation Framework for Enhancing Tabular Reasoning (Findings-EMNLP, 2022).

HTML
6
5 个月前

Coherent data analysis library

C++
5
4 个月前

Schema inference for semistructured data using Formal Concept Analysis

Java
5
8 年前

Implementation of the semi-structured inference model in our ACL 2023 paper: INFOSYNC: Information Synchronization across Multilingual Semi-structured Tables.

HTML
3
2 年前

An open collection includes 100+ semi-structured textual datasets. (LOG datasets, TXT datasets, CSV datasets etc.)

PHP
2
3 个月前

Web-based workflow management system that computes candidate tool workflows given input file(s) and the user's requirements regarding the output. Afterwards, runs a workflow selected by the user from the list of candidates. Implemented in Bracmat (~75%) and Java (~25%).

Java
2
1 个月前

Java Standalone application for querying XML documents with requests with preferences (GTPs requests with preferences)

Java
2
5 年前

Framework to manipulate semi structured documents and extract data from them

Java
1
1 个月前

Eloquent Serialized LOB is a trait for Laravel Eloquent models that allows Serialized LOB pattern

PHP
1
2 年前