#166 Probability & Logic Metacognitive error

Illusion of Validity

aka Illusion of Skill

Overestimating the accuracy of your predictions when data appears to form a coherent story, even when evidence says otherwise.

WHAT IT IS

The glitch, explained plainly.

Imagine you're watching clouds and you see one that looks exactly like a dragon. You become really sure it IS a dragon-shaped cloud and start telling everyone. But the cloud just happened to look that way — your brain made a story out of random shapes, and now you're super confident about something that was really just a coincidence. That's what happens when grown-ups look at information and see a clear pattern: they get really sure about what will happen next, even when their guesses turn out wrong over and over.

The illusion of validity occurs when individuals feel highly confident in their predictions or assessments because the available information appears internally consistent or tells a compelling story, regardless of whether that consistency actually improves predictive accuracy. Critically, this confidence persists even after receiving clear feedback that prior predictions were inaccurate, making it one of the most resistant biases to correction. The bias is amplified when input data is highly redundant or correlated — many similar data points feel like stronger evidence, even though truly independent data points would be far more informative. This leads professionals in fields from finance to clinical psychology to maintain unshakeable faith in their expert judgment despite track records that statistically resemble chance.

SOUND FAMILIAR?

Where it shows up.

01 A venture capitalist reads a startup's pitch deck that weaves together impressive founder backgrounds, growing market size data, and a compelling product narrative into a tidy story. She rates the startup as a 'sure bet' and invests heavily, even though her firm's historical data shows that pitch deck quality has near-zero correlation with startup success.
02 A school principal reviews a transfer student's complete file — glowing teacher comments, consistent A grades, and strong extracurricular involvement — and confidently tells the parents the student will absolutely thrive at the new school. He ignores the fact that past transfers with similar profiles have had wildly variable outcomes due to social adjustment factors invisible in the file.
03 An HR manager conducts a structured behavioral interview and the candidate's answers all align perfectly with the ideal profile. She rates the candidate 9/10 for predicted performance, even though internal audits have shown her interview scores correlate only weakly with actual one-year performance reviews. When shown this data, she acknowledges it but still feels confident about this particular candidate.
04 A political analyst assembles polling data, economic indicators, and historical voting trends that all point toward the same electoral outcome. He publishes a prediction with 95% confidence. His colleague points out that the same indicators gave similarly coherent signals before the last two elections and were wrong both times. He concedes the point intellectually but doesn't lower his confidence rating.
05 A senior physician reviews a patient's symptoms, lab results, and family history, all of which form a textbook picture of a particular rare condition. She diagnoses it immediately with high confidence. A junior colleague suggests factoring in the condition's extremely low base rate in the general population, which would make the diagnosis far less probable despite the coherent clinical picture. The senior physician dismisses this, saying the presentation is too clear to doubt.

IN DIFFERENT DOMAINS

Where it shows up at work.

The same glitch looks different depending on the terrain. Finance, medicine, a relationship, a team — same mechanism, different costume.

Finance & investing

Fund managers and stock analysts construct elaborate narratives from market data, developing strong confidence in their predictions. Studies show that most active fund managers perform no better than chance over extended periods, yet both they and their firms maintain conviction in the skill behind their predictions. Redundant financial indicators that move together inflate subjective confidence without improving actual forecast accuracy.

Medicine & diagnosis

Clinicians often develop high diagnostic confidence when a patient's symptoms, history, and test results form a coherent clinical picture. This confidence may persist even when the condition's base rate is extremely low, leading to overdiagnosis. Additionally, more clinical information tends to increase confidence without proportionally increasing diagnostic accuracy.

Education & grading

Teachers and admissions officers develop strong predictions about student success based on application materials, grades, and interviews that form a consistent narrative. Research shows that standardized test scores and interview impressions are weaker predictors of academic success than typically assumed, yet the coherent story these data points tell sustains high confidence in selection decisions.

Relationships

People form strong first impressions of potential partners based on consistent signals — attractive appearance, shared interests, charming conversation — and become overly confident that the relationship will succeed. The coherent narrative from early interactions creates a sense of certainty that obscures the many unpredictable factors determining long-term compatibility.

Tech & product

Product teams review user research data that forms a clean narrative about user needs and become overconfident in their design decisions. A/B tests frequently disprove confident predictions based on qualitative research, yet teams continue to trust their interpretive frameworks. ML engineers also exhibit this when clean-looking training metrics create overconfidence in model generalization to production data.

Workplace & hiring

Hiring managers conduct interviews and feel certain about a candidate because their answers, resume, and demeanor all tell a consistent story. Research consistently shows that unstructured interviews add little predictive validity beyond basic qualifications, yet interviewers maintain high confidence in their ability to identify top performers from the interview narrative.

Politics Media

Pundits assemble coherent narratives from polls, economic data, and political dynamics, generating confident predictions. When predictions fail, the same analysts quickly construct equally coherent post-hoc explanations. Media consumers trust these analysts because their reasoning sounds compelling, regardless of their actual prediction track record.

HOW TO SPOT IT

Ask yourself…

Am I feeling confident because the data tells a coherent story, or because this type of data has independently proven to be predictive?
Would I still be this confident if I reviewed my actual track record of past predictions in similar situations?
Am I relying on many correlated pieces of information that essentially all say the same thing, or on genuinely independent lines of evidence?

HOW TO DEFEND AGAINST IT

The playbook.

Keep a written prediction log with confidence levels and systematically review actual outcomes to calibrate your accuracy over time.
Ask: 'Is the environment I'm predicting in sufficiently regular and predictable, or is it fundamentally noisy and random?'
Count how many truly independent data points support your prediction versus how many are restatements of the same underlying signal.
Before finalizing a judgment, conduct a pre-mortem: assume your prediction is wrong and generate three plausible reasons why it could fail.
Replace subjective confidence with base rates: what percentage of similar predictions in this domain actually turn out to be correct?

FAMOUS CASES

In history.

Kahneman's experience evaluating Israeli Army officer candidates using leaderless group challenges: despite data showing their predictions had near-zero validity, evaluators remained confident in each new assessment.
Freeman Dyson's account of British Bomber Command in WWII: commanders maintained confidence that gun turrets protected bombers despite statistical evidence showing turrets were ineffective and slowed the aircraft.
The 2008 financial crisis: credit rating agencies and financial institutions maintained high confidence in mortgage-backed securities based on coherent-looking risk models built from highly correlated inputs that all failed simultaneously.

WHERE IT COMES FROM

Academic origin

Amos Tversky and Daniel Kahneman, 1973, in their paper 'On the Psychology of Prediction' published in Psychological Review. Further developed by Einhorn and Hogarth in 1978.

Evolutionary origin

In ancestral environments, rapidly recognizing patterns — such as predator tracks, seasonal changes, or social cues — provided critical survival advantages. The cost of missing a real pattern was far greater than the cost of seeing a pattern that wasn't there. Natural selection thus favored brains that formed confident assessments quickly from limited, correlated information, even at the expense of statistical rigor.

IN AI SYSTEMS

How the machines inherit it.

ML models exhibit a computational analog of the illusion of validity: they produce highly confident predictions on data that superficially matches training patterns, even when those patterns don't generalize out-of-distribution. LLMs after alignment become systematically overconfident because human raters in RLHF reward confident-sounding answers over appropriately hedged ones. Developers fall prey to the illusion when clean training metrics and coherent model outputs create false confidence in real-world performance. Users then overtrust these authoritative-sounding outputs, replicating the human bias at the system level.

10 glitches quietly running your life.

A free field-zine PDF — ten cognitive glitches named, illustrated, with a defense move for each. Plus the weekly Glitch Report on Fridays — one bias named, two spotted in the wild, one defense move. Unsubscribe any time.

EXPLORE MORE

Related glitches.

#55

Overconfidence Effect

Ego & Self-Preservation

#89

Representativeness Heuristic

Illusion of Competence

LAUNCH PRICE

Train against your blindspots.

50 cards are free to preview. Buyers unlock the rest of the deck plus the interactive training — Spot-the-Bias Quiz unlimited, Swipe Deck with spaced repetition, My Blindspots, Decision Pre-Flight, the Printable Deck + Cheat Sheets, and the Field Guide e-book. $29.50$59.