faithfulness

Here are 30 public repositories matching this topic...

pkuserc / ChatGPT_for_IE

Evaluating ChatGPT’s Information Extraction Capabilities: An Assessment of Performance, Explainability, Calibration, and Faithfulness

performance evaluation information-extraction calibration named-entity-recognition event-detection event-extraction relation-extraction entity-typing relation-classification explainability large-language-models chatgpt faithfulness

Updated Aug 17, 2024
Python

MinhVuong2000 / LLMReasonCert

Star

[ACL'24] Official Implementation of the paper "Direct Evaluation of Chain-of-Thought in Multi-hop Reasoning with Knowledge Graphs"(https://aclanthology.org/2024.findings-acl.168)

framework evaluation knowledge-graph reasoning evaluation-framework llms faithfulness

Updated Apr 22, 2025
Python

bcdnlp / FAITHSCORE

Star

FaithScore: Fine-grained Evaluations of Hallucinations in Large Vision-Language Models

evaluation-metrics hallucination faithfulness multimodal-large-language-models

Updated Nov 27, 2025
Python

LCM-Lab / L-CITEEVAL

Star

Evaluating the faithfulness of long-context language models

benchmark faithfulness long-context-evaluation

Updated Oct 21, 2024
Python

khuangaf / CHOCOLATE

Star

Code and data for the ACL 2024 Findings paper "Do LVLMs Understand Charts? Analyzing and Correcting Factual Errors in Chart Captioning"

factuality faithfulness large-vision-language-models chart-understanding chart-captioning chart-summarization

Updated Jun 5, 2024
Jupyter Notebook

YisongMiao / DiSQ-Score

Star

The Dataset and Official Implementation for <Discursive Socratic Questioning: Evaluating the Faithfulness of Language Models’ Understanding of Discourse Relations> @ ACL 2024

evaluation discourse language-model faithfulness socratic-method

Updated Aug 7, 2024
Python

About The corresponding code from our paper " Making Reasoning Matter: Measuring and Improving Faithfulness of Chain-of-Thought Reasoning" . Do not hesitate to open an issue if you run into any trouble!

nlp reasoning faithfulness chain-of-thought-reasoning

Updated Jan 14, 2026
Python

KomeijiForce / Active_Passive_Constraint_Koishiday_2024

Star

Koishi's Day 2024 Paper (NeurIPS 2024): An advanced persona-driven role-playing system with global faithfulness quantification and optimization. In memory of the Koishi's Day of 2024.

role-playing metrics global-optimization quantification factuality-checking faithfulness komeiji ai-character

Updated Oct 19, 2025
Python

rodolfboctor / rag-eval-toolkit

Star

Open-source Python toolkit for evaluating RAG pipelines. LLM-as-judge for faithfulness, relevancy, and context precision with Claude and GPT-4 backends.

python nlp evaluation claude rag gpt-4 llm faithfulness

Updated Mar 28, 2026
Python

SupermatAI / supermat

Star

Novel data representation leading to granular citations and higher accuracy

python data machine-learning ai vector context embeddings citations accuracy chunking rouge-metric traceability unstructured-data citations-data datalabeling hallucinations llm faithfulness graphrag

Updated Feb 18, 2025
Python

Trustworthy-ML-Lab / Training_Trustworthy_LRM_with_Refine

Star

A new training framework for Trustworthy Large Reasoning Models

machine-learning deep-learning interpretability trustworthy-ai llms faithfulness llms-reasoning

Updated Oct 31, 2025
Python

vggls / medical_xai

Star

On the evaluation of deep learning interpretability methods for medical images under the scope of faithfulness

computer-vision grad-cam haas x-ray digital-pathology explainable-ai medical-ai aopc hirescam faithfulness max-sensitivity

Updated Jan 14, 2025
Jupyter Notebook

luka-group / Causal-View-of-Entity-Bias

Star

[EMNLP 2023] A Causal View of Entity Bias in (Large) Language Models

debiasing large-language-models faithfulness causal-intervention knowledge-conflicts

Updated Mar 21, 2024
Python

du-nlp-lab / FIFA

Star

FIFA: Unified Faithfulness Evaluation Framework for Text-to-Video and Video-to-Text Generation

metrics video-generation hallucination faithfulness multimodal-large-language-models evluation

Updated Jul 27, 2025
Python

Bar-A-94 / FaithfulnessSerum

Star

Official PyTorch implementation of Faithfulness Serum (ACL Main 2026) - a training-free method that improves the faithfulness of LLM explanations by guiding generation with attribution-based signals.

nlp attribution interpretability explainability llm faithfulness acl-2026

Updated Apr 18, 2026
Python

narutatsuri / Unbiased-Perspective-Summarization

Star

[ACL 2025] Reranking-based Generation for Unbiased Perspective Summarization

machine-learning evaluation summarization prompting rlhf faithfulness

Updated Jun 23, 2025
Python

sinaabbasi1 / NormXLogit

Star

The official repo for the EMNLP 2025 paper "NormXLogit: The Head-on-Top Never Lies"

nlp transformers interpretability explainability plausibility llm faithfulness emnlp2025

Updated Nov 5, 2025
Jupyter Notebook

obielin / rag-eval-kit

Star

End-to-end RAG evaluation kit. Auto-generate test questions from your corpus, score responses on faithfulness/relevance/completeness with LLM-as-judge, produce quality reports. Works with any RAG implementation.

python evaluation quality-metrics rag llm anthropic faithfulness retrieval-augmented-generation llm-evaluation hallucination-detection

Updated Apr 10, 2026
Python

Maguids / XAI-Techniques--Students-Dropout-Dataset

Star

This project applies Explainable AI techniques to a Student Dropout dataset, covering pre-, in- and post-modeling explanations, as well as an analysis of their quality. The project was developed for the "Adavnced Topics on Machine Learning" course. 1st Semester of the 1st Year of the Master's Degree in Artificial Intelligence.

prototype anchors xgboost pca mlp tsne decision-tree lime xai surrogate-models criticism shap pfi faithfulness

Updated Dec 16, 2025
Jupyter Notebook

tarun-ks / MicroGuard

Star

nlp evaluation lora rag edge-ai faithfulness small-language-models hallucination-detection

Updated Mar 24, 2026
Jupyter Notebook

Improve this page

Add a description, image, and links to the faithfulness topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the faithfulness topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

faithfulness

Here are 30 public repositories matching this topic...

pkuserc / ChatGPT_for_IE

MinhVuong2000 / LLMReasonCert

bcdnlp / FAITHSCORE

LCM-Lab / L-CITEEVAL

khuangaf / CHOCOLATE

YisongMiao / DiSQ-Score

debjitpaul / Causal_CoT

KomeijiForce / Active_Passive_Constraint_Koishiday_2024

rodolfboctor / rag-eval-toolkit

SupermatAI / supermat

Trustworthy-ML-Lab / Training_Trustworthy_LRM_with_Refine

vggls / medical_xai

luka-group / Causal-View-of-Entity-Bias

du-nlp-lab / FIFA

Bar-A-94 / FaithfulnessSerum

narutatsuri / Unbiased-Perspective-Summarization

sinaabbasi1 / NormXLogit

obielin / rag-eval-kit

Maguids / XAI-Techniques--Students-Dropout-Dataset

tarun-ks / MicroGuard

Improve this page

Add this topic to your repo