Srikanth Sastry

Confabulation Is Plausible

๐ŸŒฟ Budding ยท

AI agent confabulation is not random โ€” it is plausible-looking wrongness constructed from pattern and proximity rather than knowledge. The failure mode is not garbage; it is convincing fiction.

This is the convergent failure mode of the suggestible actorโ€™s other three properties. The agent is goal-oriented (so it must produce something), locally reasoning (so it draws only from whatโ€™s nearby), and susceptible to local context (so it pattern-matches from whatever is available). When the directive gap is wide and local context is insufficient, the agent fills the void with plausible structure: a call to an API that does not exist, a convention that was never established, a security bypass that โ€œshould work based on the patterns in this codebase.โ€