Repository navigation

#

hallucinations

Leaderboard Comparing LLM Performance at Producing Hallucinations when Summarizing Short Documents

Python
2130
3 天前

List of papers on hallucination detection in LLMs.

839
13 天前

✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models

Python
634
4 个月前

Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"

Python
480
3 个月前
Python
318
2 个月前
Python
162
5 个月前

Code for ACL 2024 paper "TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space"

Python
147
1 年前

Hallucinations (Confabulations) Document-Based Benchmark for RAG. Includes human-verified questions and answers.

HTML
124
2 天前

Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"

Python
124
10 个月前

Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"

Python
120
8 个月前

Initiative to evaluate and rank the most popular LLMs across common task types based on their propensity to hallucinate.

107
7 个月前

mPLUG-HalOwl: Multimodal Hallucination Evaluation and Mitigating

Python
94
1 年前

[ICML 2024] Official implementation for "HALC: Object Hallucination Reduction via Adaptive Focal-Contrast Decoding"

Python
86
5 个月前

Repository for the paper "Cognitive Mirage: A Review of Hallucinations in Large Language Models"

47
1 年前

Official repo for SAC3: Reliable Hallucination Detection in Black-Box Language Models via Semantic-aware Cross-check Consistency

Jupyter Notebook
35
3 个月前

The implementation for EMNLP 2023 paper ”Beyond Factuality: A Comprehensive Evaluation of Large Language Models as Knowledge Generators“

Python
30
1 年前