Repository navigation

#

data-filtering

Official Repository of "LLM × DATA" Survey Paper

414
1 个月前

DSIR large-scale data selection framework for language model training

Python
258
1 年前

⏳ Provide filtering, sanitizing, and conversion of Golang data. 提供对Golang数据的过滤,净化,转换。

Go
151
1 天前

A GraphQL like interface to map a request to eloquent query with data transformation for Laravel.

PHP
79
8 年前
C++
65
1 年前

Official implementation of our paper "Finetuned Multimodal Language Models are High-Quality Image-Text Data Filters".

Python
65
4 个月前

一个用于模块化管理前端请求的工具

JavaScript
40
3 年前

[ACL 2025 main] SCAR: Data Selection via Style Consistency-Aware Response Ranking for Efficient Instruction-Tuning of Large Language Models

Python
36
14 天前

R Tutorial: useful R codes for cleaning and filtering data from Qualtrics surveys, and for creating new variables in the dataframe. With step-by-step explanations.

R
17
3 年前

This repository contains all (Python 3) code and libraries required for the 2022-2023 Notre Dame Rocketry Team (NDRT) Apogee Control System (ACS). It also contains sensor/actuator example code and flight data.

Python
11
2 年前

EpiMethEx (Epigenetic Methylation and Expression), a R package to perform a large-scale integrated analysis by cyclic correlation analyses between methylation and gene expression data.

R
8
7 年前

Data extraction from smartphones and GPS and Accelerometer data "fusion" with Kalman filter.

Java
6
3 年前

Base-call error-filtering and read preprocessing pipeline for fastq libraries

Python
4
4 年前

Anonymises data inside text files and in sheet files. It recognises and removes various sorts of personally identifiable information (PII). Each removed part is replaced with a suitable generic text, depending on the type of removed data. Currently English and Russian languages are supported. Russian works both with Cyrillic and Latin characters.

Python
3
1 年前

CDC Connect is a cross-platform mobile application built in React Native using JavaScript. The app is designed for data collection with a focus on surveys.

JavaScript
3
1 年前

Make the data grid's Auto Filter Row insensitive to accents.

Visual Basic .NET
2
2 个月前