Repository navigation

#

hallucinations

Leaderboard Comparing LLM Performance at Producing Hallucinations when Summarizing Short Documents

Python
2676
5 天前

List of papers on hallucination detection in LLMs.

936
2 个月前

✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models

Python
639
8 个月前

Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"

Python
506
7 个月前

Protect your AI agents and GenAI apps with confidence

Python
357
19 天前

Hallucinations (Confabulations) Document-Based Benchmark for RAG. Includes human-verified questions and answers.

HTML
217
12 天前
Python
172
2 个月前

Code for ACL 2024 paper "TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space"

Python
146
1 年前

Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"

Python
132
1 年前

Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"

Python
130
1 年前

Initiative to evaluate and rank the most popular LLMs across common task types based on their propensity to hallucinate.

113
22 天前

mPLUG-HalOwl: Multimodal Hallucination Evaluation and Mitigating

Python
96
2 年前

[ICML 2024] Official implementation for "HALC: Object Hallucination Reduction via Adaptive Focal-Contrast Decoding"

Python
93
9 个月前

Repository for the paper "Cognitive Mirage: A Review of Hallucinations in Large Language Models"

47
2 年前

Official repo for SAC3: Reliable Hallucination Detection in Black-Box Language Models via Semantic-aware Cross-check Consistency

Jupyter Notebook
35
7 个月前

[EMNLP 2023] Beyond Factuality: A Comprehensive Evaluation of Large Language Models as Knowledge Generators

Python
32
2 年前