Source attribution

From llmref.wiki
(Redirected from AI citation)
Source attribution — The mechanism by which an AI system links the claims in its answer to the underlying sources.

Overview

Source attribution (also AI citation) is the mechanism by which an AI system associates statements in a generated answer with the sources that support them, typically presented as inline links or numbered references. Attribution is the basis of citation rate and a prerequisite for users to verify AI answers.

Source attribution is widely reported to be unreliable: studies and audits have found high rates of citations that do not support the associated claim, or that point to non-existent sources, with reported error rates spanning a wide range depending on system and methodology.[1][2] Because the concept itself is contested and unstandardized, "attribution" can mean anything from a relevant link to a verified claim-to-source mapping.

How it works

Attribution is produced in different ways depending on architecture:

  • Retrieval-grounded systems attach citations to the documents retrieved for the answer; accuracy depends on whether the cited document actually supports the generated claim (see Faithfulness vs Groundedness).
  • Post-hoc attribution generates citations after the answer, which can produce plausible but unsupported references.

Evaluating attribution requires checking, per claim, whether the cited source exists and supports the statement — distinct from merely counting that a citation is present.

Distinction from related terms

Term Refers to
Source attribution Linking answer claims to underlying sources
Grounding Anchoring an answer in sources at generation time
Groundedness Whether each claim is supported by the provided context
Hallucinated citation An attribution to a source that does not exist or does not support the claim

Source attribution is not the same as correctness: a present, well-formatted citation may still fail to support the claim it accompanies.

Examples

  • An answer footnotes a study URL that, when opened, does not contain the cited statistic — a citation present but unsupported.
  • A model invents a plausible-looking paper title and author that do not exist — a hallucinated citation.

See also

References

  1. LLM Pulse. "Source Attribution in AI: what it is and how to improve it." https://llmpulse.ai/blog/glossary/source-attribution-in-ai/
  2. Research on fabricated references in LLM outputs documents large-scale generation of non-existent citations. See e.g. studies of hallucinated citations in model outputs.