Skip to main navigation Skip to search Skip to main content

Towards Mitigation of Hallucination for LLM-Empowered Agents: Progressive Generalization Bound Exploration and Watchdog Monitor

  • Siyuan Liu
  • , Wenjing Liu
  • , Zhiwei Xu
  • , Xin Wang
  • , Bo Chen
  • , Tao Li
  • Nankai University
  • Haihe Lab of ITAI
  • Inner Mongolia University of Technology
  • CAS - Institute of Computing Technology
  • Michigan Technological University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Empowered by large language models (LLMs), intelligent agents have become a popular paradigm for interacting with open environments to facilitate AI deployment. However, hallucinations generated by LLMs - where outputs are inconsistent with facts - pose a significant challenge, undermining the credibility of intelligent agents. Only if hallucinations can be mitigated, the intelligent agents can be used in real-world without any catastrophic risk. Therefore, effective detection and mitigation of hallucinations are crucial to ensure the dependability of agents. Unfortunately, the related approaches either depend on white-box access to LLMs or fail to accurately identify hallucinations. To address the challenge posed by hallucinations of intelligent agents, we present HalMit, a novel black-box watchdog framework that models the generalization bound of LLM-empowered agents and thus detect hallucinations without requiring internal knowledge of the LLM's architecture. Specifically, a probabilistic fractal sampling technique is proposed to generate a sufficient number of queries to trigger the incredible responses in parallel, efficiently identifying the generalization bound of the target agent. Experimental evaluations demonstrate that HalMit significantly outperforms existing approaches in hallucination monitoring. Its black-box nature and superior performance make HalMit a promising solution for enhancing the dependability of LLM-powered systems.

Original languageEnglish
Title of host publicationECAI 2025 - 28th European Conference on Artificial Intelligence, including 14th Conference on Prestigious Applications of Intelligent Systems, PAIS 2025 - Proceedings
EditorsInes Lynce, Nello Murano, Mauro Vallati, Serena Villata, Federico Chesani, Michela Milano, Andrea Omicini, Mehdi Dastani
PublisherIOS Press BV
Pages1019-1026
Number of pages8
ISBN (Electronic)9781643686318
DOIs
StatePublished - Oct 21 2025
Event28th European Conference on Artificial Intelligence, ECAI 2025, including 14th Conference on Prestigious Applications of Intelligent Systems, PAIS 2025 - Bologna, Italy
Duration: Oct 25 2025Oct 30 2025

Publication series

NameFrontiers in Artificial Intelligence and Applications
Volume413
ISSN (Print)0922-6389
ISSN (Electronic)1879-8314

Conference

Conference28th European Conference on Artificial Intelligence, ECAI 2025, including 14th Conference on Prestigious Applications of Intelligent Systems, PAIS 2025
Country/TerritoryItaly
CityBologna
Period10/25/2510/30/25

Fingerprint

Dive into the research topics of 'Towards Mitigation of Hallucination for LLM-Empowered Agents: Progressive Generalization Bound Exploration and Watchdog Monitor'. Together they form a unique fingerprint.

Cite this