Repository navigation
lm
- Website
- Wikipedia
General technology for enabling AI capabilities w/ LLMs and MLLMs
📃Language Model based sentences scoring library
Launch monitor using low-cost raspberry pi and camera hardware to determine ball launch speed, angles and spin. See https://discord.gg/vGuyAAxXJH (permalink)
ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations [COLM 2025]
ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward experts. We released a collection of ModuleFormer-based Language Models (MoLM) ranging in scale from 4 billion to 8 billion parameters.
Bangla-Bert is a pretrained bert model for Bengali language
The LM Contamination Index is a manually created database of contamination evidences for LMs.
Korean text normalization and language preparation package for LM in Kaldi-based ASR system
🐍 Python library for n-gram models in ARPA format
Codes for the experiments in our EMNLP 2021 paper "Open Aspect Target Sentiment Classification with Natural Language Prompts"
Embeddings: State-of-the-art Text Representations for Natural Language Processing tasks, an initial version of library focus on the Polish Language
The official repo for “Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem” [EMNLP25]
Automatically extracts NT and LM hashes from Windows memory dumps based on volatility.