Repository navigation

#

factuality

WikiChat is an improved RAG. It stops the hallucination of large language models by retrieving data from a corpus.

Python
1500
4 个月前

Loki: Open-source solution designed to automate the process of verifying factuality

Python
1103
1 年前

Benchmarking long-form factuality in large language models. Original code for our paper "Long-form factuality in large language models".

Python
638
14 天前

Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"

Python
506
7 个月前

RefChecker provides automatic checking pipeline and benchmark dataset for detecting fine-grained hallucinations generated by Large Language Models.

Python
383
3 个月前

A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation"

Python
373
4 个月前

[Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers

Python
132
1 年前

Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"

Python
131
1 年前

This AI fact-checking system, built with LangGraph, dissects text into verifiable claims, cross-referencing them with real-world evidence via web searches. It then generates detailed accuracy reports, ideal for combating misinformation in LLM outputs, news, or any text.

TypeScript
59
18 天前

Code for the arXiv paper: "LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond"

Jupyter Notebook
59
7 个月前

Implementation of the paper "FactGraph: Evaluating Factuality in Summarization with Semantic Graph Representations (NAACL 2022)"

Python
50
2 年前

OLAPH: Improving Factuality in Biomedical Long-form Question Answering

Python
39
1 年前

Distillation Contrastive Decoding: Improving LLMs Reasoning with Contrastive Decoding and Distillation

Python
35
1 年前

[EMNLP 2023] Beyond Factuality: A Comprehensive Evaluation of Large Language Models as Knowledge Generators

Python
32
2 年前

SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Model https://arxiv.org/pdf/2411.02433

Python
28
8 个月前

KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality

Python
26
1 个月前

Code and data for the ACL 2024 Findings paper "Do LVLMs Understand Charts? Analyzing and Correcting Factual Errors in Chart Captioning"

Jupyter Notebook
26
1 年前

Source code of our EMNLP 2024 paper "FactAlign: Long-form Factuality Alignment of Large Language Models"

Jupyter Notebook
19
1 年前

Code for paper "Factual Confidence of LLMs: on Reliability and Robustness of Current Estimators"

Python
15
9 个月前

Code and data for the Dreyer et al (2023) paper on abstractiveness and factuality in abstractive summarization

Python
12
2 年前