Vol.01 · No.10 CS · AI · Infra May 30, 2026

AI Glossary

GlossaryReferenceLearn
AI Safety & Ethics LLM & Generative AI

Hallucination

Difficulty

Plain Explanation

Hallucination is when an AI gives an answer that sounds fluent but is not supported by the facts, the source document, or the task constraints. The dangerous part is that the answer often sounds confident, so readers may trust it before checking the evidence.

A useful analogy is a presenter filling gaps in their notes. The speech may be smooth, but when you compare it with the slide deck, some details were invented or stitched together from weak memory.

The key idea is not that the model is intentionally lying. It is predicting plausible text under uncertainty. Production systems therefore need grounding, verification, abstention, and review paths rather than only nicer wording.

Examples & Analogies

  • A support chatbot invents a refund policy that is not in the company handbook. The tone is helpful, but the claim is unsupported.
  • A research assistant summarizes a paper and adds a result the paper never reported. This is especially risky when the sentence looks like a citation.
  • A coding assistant suggests a library option that does not exist. Tests or official docs are needed before using it.
  • In healthcare, law, finance, and security workflows, the safest answer may be to refuse, ask for more context, or route to a human reviewer.

At a Glance

TypeWhat goes wrongTypical mitigation
Missing-knowledge hallucinationThe model lacks the needed fact or has stale knowledgeRetrieval, fresh sources, source comparison
Grounding failureSources exist, but the answer drifts away from themSentence-level citation checks, evidence mapping
Reasoning failureAn intermediate step is wrong and the conclusion follows itTests, rule-based verifiers, independent solution paths
Overconfident uncertaintyThe model states an uncertain answer as factConfidence labels, abstention rules, human review

Where and Why It Matters

  • News and research summaries lose credibility if the system adds facts that were not in the source material.
  • Enterprise chatbots can create real operational risk by misstating prices, policies, contracts, or compliance obligations.
  • Developer tools waste time when they invent functions, flags, or package behavior.
  • Agent systems raise the stakes because a hallucinated instruction can trigger a real tool action, such as sending an email or updating a database.
  • Hallucination control is therefore a system design problem: retrieval quality, validation, logging, permissions, and escalation matter as much as the base model.

Common Misconceptions

  • ❌ Myth: Hallucination only happens when the model sounds unsure. → ✅ Reality: the riskiest cases often sound polished and confident.
  • ❌ Myth: Adding RAG eliminates hallucination. → ✅ Reality: retrieval can fail, sources can be stale, and the model can still misread relevant evidence.
  • ❌ Myth: Lower temperature is enough. → ✅ Reality: lower randomness does not fix missing knowledge or unsupported claims.
  • ❌ Myth: Longer answers are more trustworthy. → ✅ Reality: longer answers contain more claims that must be checked.

How It Sounds in Conversation

  • "Which source sentence supports this claim? If none, the answer should abstain."
  • "Do not let the model invent policy details. Compare every policy claim with the handbook."
  • "This looks fluent, but the citation does not actually support the sentence."
  • "For code suggestions, require either a passing test or a link to official docs."
  • "Track hallucination by failure type, not just one aggregate score."

Related Reading

References

Helpful?