AI Reviewing AI: Shared Blind Spots

🌳 Evergreen · Apr 24, 2026

AI models reviewing AI-generated code share systematic blind spots with the generator, creating gaps that neither side detects. Blain & Noiseux (2026) show models detect their own vulnerabilities 78.7% of the time in review mode but generate them 55.8% by default. This suggests AI code review can partially close the review gap. Google (AIware ‘24) and Meta (TestGen-LLM) are investing heavily in this direction.

But if the reviewer model is trained on similar data as the generator, the review itself has systematic gaps. The models do not know what they do not know, and neither does the reviewer. This is the epistemic equivalent of model collapse applied to the review pipeline.

Connections

🌳Guardrail Erosion Is a Meta-Problem
🌳Review Is the Bottleneck
🌳Three Classes of Guardrail Erosion Resistance
🪶The Guardrail Erosion Problem with AI Agents