Repository navigation

#

long-context

Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).

Python
7027
1 个月前

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

Python
2681
1 年前

[ICLR 2025] LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs

Python
1713
2 个月前

LongBench v2 and LongBench (ACL 25'&24')

Python
946
7 个月前

Implementation of MEGABYTE, Predicting Million-byte Sequences with Multiscale Transformers, in Pytorch

Python
649
8 个月前

Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch

Python
536
3 个月前

LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA

Python
504
8 个月前

Implementation of Recurrent Memory Transformer, Neurips 2022 paper, in Pytorch

Python
413
7 个月前

The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for Understanding Extremely Long Sequences with Training-Free Memory"

Python
377
1 年前

Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718

Python
345
1 年前

✨✨Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuracy

Python
296
3 个月前

PyTorch implementation of Infini-Transformer from "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" (https://arxiv.org/abs/2404.07143)

Python
291
1 年前

[COLM 2024] TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding

Python
261
1 年前

[EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs

Python
253
8 个月前

[ICML 2025 Spotlight] ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference

Python
232
4 个月前

open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality

Python
207
1 年前

awesome llm plaza: daily tracking all sorts of awesome topics of llm, e.g. llm for coding, robotics, reasoning, multimod etc.

207
11 小时前

ACL 2024 | LooGLE: Long Context Evaluation for Long-Context Language Models

Python
185
10 个月前