Repository navigation

hallucination

Website
Wikipedia

Libr-AI / OpenFactVerification

Loki: Open-source solution designed to automate the process of verifying factuality

人工智能 factuality hallucination

Python

1103

1 年前

cvs-health / uqlm

UQLM: Uncertainty Quantification for Language Models, is a Python package for UQ-based LLM hallucination detection

ai-safety hallucination 大语言模型 llm-evaluation uncertainty-estimation uncertainty-quantification

Python

855

5 天前

jxzhangjhu / Awesome-LLM-Uncertainty-Reliability-Robustness

Awesome-LLM-Robustness: a curated list of Uncertainty, Reliability and Robustness in Large Language Models

Awesome Lists calibration gpt-3 gpt-4 大语言模型 reliability robustness safety uncertainty-estimation uncertainty-quantification ChatGPT prompt-engineering prompting chain-of-thought in-context-learning large-language-models hallucination

776

3 个月前

onestardao / WFGY

WFGY 2.0 — Semantic Reasoning Engine for LLMs (MIT). Fixes RAG/OCR drift, collapse & “ghost matches” via symbolic overlays + logic patches. Autoboot; OneLine & Flagship. ⭐ Star if you explore semantic RAG or hallucination mitigation.

alignment embedding knowledge-graph 大语言模型 Open Source reasoning semantic-engine transformer hallucination rag

Python

659

1 天前

VITA-MLLM / Woodpecker

✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models

hallucination hallucinations large-language-models 大语言模型 mllm multimodal-large-language-models multimodality

Python

639

8 个月前

amazon-science / RefChecker

RefChecker provides automatic checking pipeline and benchmark dataset for detecting fine-grained hallucinations generated by Large Language Models.

factuality hallucination 大语言模型

Python

383

3 个月前

MigoXLab / dingo

Dingo: A Comprehensive AI Data Quality Evaluation Tool

data-quality 数据科学 data-validation gpt 大语言模型 Apache Spark vlm dataquality datascience openai deepseek qwen hallucination

JavaScript

352

9 天前

tianyi-lab / HallusionBench

[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models

benchmark vlms gpt-4 gpt-4v llava benchmarks hallucination 大语言模型 lmm large-language-models large-vision-language-models

Python

297

9 个月前

FuxiaoLiu / LRV-Instruction

[ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning

evaluation gpt-4 hallucination object-detection vision vqa llama vicuna llava gpt multimodal prompt-engineering ChatGPT evaluation-metrics foundation-models vision-and-language iclr iclr2024

Python

285

1 年前

shufangxun / LLaVA-MoD

[ICLR 2025] LLaVA-MoD: Making LLaVA Tiny via MoE-Knowledge Distillation

mixture-of-experts multimodal-large-language-models knowledge-distillation hallucination rlhf llava qwen 大语言模型 mllm moe large-language-models distillation

Python

191

5 个月前

IAAR-Shanghai / UHGEval

[ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA, etc.

benchmark dataset evaluation 大语言模型 ChatGPT gpt-3 gpt-4 hallucinations large-language-models qwen hallucination huggingface huggingface-transformers openai openai-api ceval

Python

172

2 个月前

IAAR-Shanghai / ICSFSurvey

Explore concepts like Self-Correct, Self-Refine, Self-Improve, Self-Contradict, Self-Play, and Self-Knowledge, alongside o1-like reasoning elevation🍓 and hallucination alleviation🍄.

大语言模型 large-language-models self-improvement chain-of-thought hallucination reasoning data-augmentation decoding knowledge-distillation

Jupyter Notebook

169

8 个月前

NishilBalar / Awesome-LVLM-Hallucination

up-to-date curated list of state-of-the-art Large vision language models hallucinations research work, papers & resources

hallucination large-vision-language-models multimodal-large-language-models large-language-models 大语言模型 mllm

156

24 天前

AmourWaltz / Reliable-LLM

hallucination knowledge reliable uncertainty

JavaScript

154

1 年前

zjunlp / KnowledgeCircuits

[NeurIPS 2024] Knowledge Circuits in Pretrained Transformers

人工智能 interpretability large-language-models 自然语言处理 circuit hallucination transformer

Python

153

6 个月前

xieyuquanxx / awesome-Large-MultiModal-Hallucination

😎 curated list of awesome LMM hallucinations papers, methods & resources.

hallucination multi-modal lmm multimodal

149

1 年前

ictnlp / TruthX

Code for ACL 2024 paper "TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space"

hallucinations language-model 大语言模型 llm-inference baichuan chatglm ChatGPT gpt-4 hallucination llama llama2 mistral safety representation explainable-ai llama3

Python

146

1 年前

zjunlp / Deco

[ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation

人工智能 decoding hallucination large-language-models mllm multimodal-large-language-models 自然语言处理 iclr2025

Python

8 个月前

zjunlp / FactCHD

[IJCAI 2024] FactCHD: Benchmarking Fact-Conflicting Hallucination Detection

large-language-models hallucination knowledge 自然语言处理 benchmark dataset

Python

1 年前

yfzhang114 / LLaVA-Align

[ACM Multimedia 2025] This is the official repo for Debiasing Large Visual Language Models, including a Post-Hoc debias method and Visual Debias Decoding strategy.

hallucination large-vision-language-models

Python

6 个月前