Repository navigation

#

hallucination

Loki: Open-source solution designed to automate the process of verifying factuality

Python
1064
7 个月前

✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models

Python
634
4 个月前

RefChecker provides automatic checking pipeline and benchmark dataset for detecting fine-grained hallucinations generated by Large Language Models.

Python
361
5 个月前

[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models

Python
280
5 个月前

Explore concepts like Self-Correct, Self-Refine, Self-Improve, Self-Contradict, Self-Play, and Self-Knowledge, alongside o1-like reasoning elevation🍓 and hallucination alleviation🍄.

Jupyter Notebook
166
4 个月前
Python
162
5 个月前

😎 curated list of awesome LMM hallucinations papers, methods & resources.

150
1 年前

Code for ACL 2024 paper "TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space"

Python
147
1 年前
Python
141
2 个月前

up-to-date curated list of state-of-the-art Large vision language models hallucinations research work, papers & resources

120
14 天前

[IJCAI 2024] FactCHD: Benchmarking Fact-Conflicting Hallucination Detection

Python
87
1 年前

This is the official repo for Debiasing Large Visual Language Models, including a Post-Hoc debias method and Visual Debias Decoding strategy.

Python
78
2 个月前

Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"

Python
63
1 年前

[ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation

Python
57
4 个月前

Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute, relative and much more. It contains a list of all the available tool, methods, repo, code etc to detect hallucination, LLM evaluation, grading and much more.

Jupyter Notebook
49
9 个月前