AI Hallucinations Expose Security Risks in Critical Infrastructure

Based on Artificial Analysis’s AA-Omniscience benchmark, a 2025 evaluation of 40 AI models found that all but four models tested were more likely to provide a confident, incorrect answer than a correct one on difficult questions.

AA‑Omniscience benchmark (2025): a blunt signal

The benchmark finding is blunt: modern language models can sound authoritative while being wrong. The source explains why: base language models do not retrieve verified facts but construct responses by predicting words and phrases from patterns in their training data. When certainty is low, the model has no internal mechanism to declare “I don’t know.” Instead it supplies the statistically most probable response, which can be plausible but factually inaccurate. That dynamic—confidence without a truth test—is the core risk the AA‑Omniscience result highlights.

How hallucinations form: flawed data, bias, validation gaps, and prompts

The article breaks hallucination drivers into four concrete causes. Flawed training data embeds outdated information and outright errors into a model’s outputs; bias in input data can make the model treat overrepresented patterns as universally applicable; basic language models lack response validation because they are optimized for coherence not factual accuracy; and ambiguous prompts give the model room to fill gaps with assumptions. The piece also warns of a future risk: as AI‑generated content proliferates, later models may be trained on fabricated outputs produced by earlier AIs—a phenomenon described in the source as “model collapse.”

Missed threats, fabricated threats, and incorrect remediation

AI hallucinations affect cybersecurity in three distinct, hazardous ways. First, missed threats: models trained on historical patterns can fail to detect underrepresented techniques and zero‑day attacks because those behaviors are not in the training data, leaving environments exposed. Second, fabricated threats: models may invent malicious activity out of normal behavior, producing false positives that trigger unnecessary incident responses, shutdowns and alert fatigue that desensitizes teams. Third, incorrect remediation: once trust is established, an AI may confidently recommend destructive actions—examples given include deleting sensitive files, modifying system configurations or disabling firewall rules. If carried out, especially with privileged accounts, such recommendations can cause system disruption, irreversible data loss or expanded access for attackers.

Controls that reduce risk: human review, data governance, least‑privilege, and prompt training

The article offers several practical controls. Require human review before any AI output can trigger sensitive or privileged actions—this is nonnegotiable for workflows touching infrastructure changes, access updates, or incident response. Treat training data as a security asset: regularly audit and remove outdated, biased or inaccurate records to reduce the chance those flaws reappear in outputs. Enforce least‑privilege access for AI systems so an AI can read but not delete files, limiting what a hallucination can actually do. Invest in prompt engineering training so staff know how to write specific inputs that reduce ambiguity and produce verifiable outputs.

The piece also links identity security to AI governance, arguing that hallucinations become real incidents when systems—or humans with privileged accounts—act on incorrect guidance. It names Keeper® as a product “built to provide organizations with the visibility and access controls needed to prevent unauthorized access,” and frames tools that monitor privileged activity and secure both human and Non‑Human Identities (NHIs) as part of the defensive posture.

What this means for technologists, procurement leaders, and end users

Technologists and security teams: Treat every AI response as a potential vulnerability until a human verifies it; focus on least‑privilege enforcement and auditing the data that grounds AI behavior.
Procurement and enterprise leaders: Require data‑governance clauses and permission controls when acquiring AI systems; plan for continuous auditing to reduce “model collapse” risks as AI‑generated content proliferates.
End users and employees: Expect training in prompt engineering and an operational rule: do not execute sensitive actions based solely on an AI’s recommendation, however confident it sounds.

The AA‑Omniscience result is not an abstract critique; it translates directly into choices organizations must make. Treating AI outputs as unverified by default, shrinking the access an AI has, and rooting governance in data hygiene are concrete steps the source recommends to prevent plausible‑sounding errors from becoming operational calamities. The remaining challenge—how to prevent model collapse as AI content seeds future training sets—was flagged in the piece but not resolved here: will organizations build the sustained data governance and identity controls necessary to stop confident lies from becoming dangerous actions?

Original story