How do you detect and test for hallucination in an LLM feature?

Question

Accepted Answer

Provide inputs with known ground truth and check whether the model's output contradicts or fabricates beyond that truth. For RAG features, groundedness checks verify every factual claim appears in the retrieved context. For closed-domain tasks, a reference set with correct answers enables accuracy scoring. Hallucination — the model generating plausible but false information — is the hardest AI failure mode to test because you need a ground truth to compare against. RAG / document Q&A: provide a context document and a question. The correct answer exists in the document. Test: does the response accurately reflect only what's in the document? A secondary LLM or keyword check verifies grounding: "does claim X appear verbatim or paraphrastically in context Y?" Closed-domain knowledge (medical, legal, technical): maintain a golden set of factual questions with verified answers. Score the model's answers for factual accuracy. Flag responses that contradict the reference. Open-domain (summaris

How do you detect and test for hallucination in an LLM feature?

Short answer

Detail

// WHAT INTERVIEWERS LOOK FOR