Repository navigation

#

lm

General technology for enabling AI capabilities w/ LLMs and MLLMs

Python
4144
3 个月前

📃Language Model based sentences scoring library

Python
309
4 年前

Launch monitor using low-cost raspberry pi and camera hardware to determine ball launch speed, angles and spin. See https://discord.gg/vGuyAAxXJH (permalink)

C++
238
1 天前

ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations [COLM 2025]

Python
232
3 个月前

ggplot-based graphics and useful functions for GAMs fitted using the mgcv package

R
228
23 天前

ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward experts. We released a collection of ModuleFormer-based Language Models (MoLM) ranging in scale from 4 billion to 8 billion parameters.

Python
223
17 天前

A unified framework for data analysis with GLM/GLMM in R

R
122
9 个月前

TOEIC(Test of English for International Communication) solving using pytorch-pretrained-BERT model.

Python
122
6 年前

Bangla-Bert is a pretrained bert model for Bengali language

Jupyter Notebook
81
5 个月前

The LM Contamination Index is a manually created database of contamination evidences for LMs.

Python
80
1 年前

LLM面试常见手撕合集

Jupyter Notebook
65
1 个月前

Korean text normalization and language preparation package for LM in Kaldi-based ASR system

Python
62
5 年前

🐍 Python library for n-gram models in ARPA format

Python
40
3 年前

Codes for the experiments in our EMNLP 2021 paper "Open Aspect Target Sentiment Classification with Natural Language Prompts"

Jupyter Notebook
36
4 年前

Embeddings: State-of-the-art Text Representations for Natural Language Processing tasks, an initial version of library focus on the Polish Language

Python
36
2 年前

The official repo for "TheoremQA: A Theorem-driven Question Answering dataset" (EMNLP 2023)

Python
34
1 年前

The official repo for “Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem” [EMNLP25]

Python
32
1 个月前

Automatically extracts NT and LM hashes from Windows memory dumps based on volatility.

Shell
26
2 年前