Confabulation Is Plausible

🌳 Evergreen · Apr 24, 2026

AI agent confabulation is not random — it is plausible-looking wrongness constructed from pattern and proximity rather than knowledge. The failure mode is not garbage; it is convincing fiction.

This is the convergent failure mode of the suggestible actor’s other three properties. The agent is goal-oriented (so it must produce something), locally reasoning (so it draws only from what’s nearby), and susceptible to local context (so it pattern-matches from whatever is available). When the directive gap is wide and local context is insufficient, the agent fills the void with plausible structure: a call to an API that does not exist, a convention that was never established, a security bypass that “should work based on the patterns in this codebase.”

Connections

🌳Convert Ambient Knowledge into Local Context
🌳Susceptibility Peaks at Failure
🌳The Directive Gap
🌳The Suggestible Actor: Four Properties
🪶The Guardrail Erosion Problem with AI Agents
🪶The Suggestible Actor: A New Model for AI-Assisted Software Development