Base Rate Fallacy

aka Base Rate Neglect · Base Rate Bias · False Positive Paradox

Ignoring how common something actually is and instead basing probability estimates on vivid, specific details.

Illustration: Base Rate Fallacy
WHAT IT IS

The glitch, explained plainly.

Imagine there are 100 dogs in a park — 95 are friendly and 5 are mean. A dog walks up to you wearing a spiked collar and growling a little. Your brain screams 'Mean dog!' because of the scary details. But if you remembered that 95 out of 100 dogs here are friendly, you'd realize it's still way more likely the dog is just a friendly one having a bad moment. The Base Rate Fallacy is when you forget about the 95-out-of-100 part because the spiked collar story is more interesting.

The Base Rate Fallacy occurs when individuals presented with both general statistical information (how prevalent something is in a population) and specific individuating information (details about a particular case) systematically overweight the specific details and underweight or entirely ignore the base rate. This leads to probability estimates that violate Bayes' theorem, the normative framework for updating beliefs given new evidence. The fallacy is especially pernicious in diagnostic contexts — medical testing, criminal forensics, and hiring algorithms — where a low base rate of the target condition means that even highly accurate tests produce far more false positives than true positives. The bias is driven by the brain's preference for narrative and concrete detail over abstract statistical information, making vivid descriptions feel more informative than they actually are.

SOUND FAMILIAR?

Where it shows up.

  1. 01 After hearing about a shark attack on the news, refusing to swim in the ocean even though the statistical risk is astronomically low compared to the car ride to the beach.
  2. 02 Assuming a quiet, bookish coworker must have studied literature in college, ignoring that business and engineering majors vastly outnumber literature students.
IN DIFFERENT DOMAINS

Where it shows up at work.

The same glitch looks different depending on the terrain. Finance, medicine, a relationship, a team — same mechanism, different costume.

Finance & investing

Investors overweight a company's vivid narrative — charismatic CEO, flashy product launch — while ignoring the base rate of startup failure or industry-wide default rates, leading to overvaluation of individual stocks and underestimation of portfolio risk.

Medicine & diagnosis

Physicians and patients routinely misinterpret positive screening results for rare diseases. A 95% accurate test for a condition affecting 1 in 1,000 people yields far more false positives than true positives, yet both doctors and patients frequently assume a positive result means near-certain diagnosis.

HOW TO SPOT IT

Ask yourself…

  • Am I being swayed by a vivid description or compelling detail while forgetting how common or rare this thing actually is in the population?
  • Do I know the base rate — the overall prevalence or frequency — of what I'm trying to estimate, or am I only considering the specific evidence in front of me?
HOW TO DEFEND AGAINST IT

The playbook.

  • Always ask 'What is the base rate?' before evaluating any specific evidence — make it the first question, not an afterthought.
  • Convert probability formats into natural frequencies: instead of '95% accurate test,' think '1,000 people tested, 1 has the disease, 50 false positives among the 999 healthy people.'
FAMOUS CASES

In history.

  • The false positive paradox in post-9/11 mass surveillance programs: analysts estimated that data-mining algorithms would generate tens of thousands to billions of false positives for every true terrorist identified, because the base rate of terrorism is extraordinarily low.
  • The O.J. Simpson trial (1995): defense attorney Alan Dershowitz argued on television that only 0.1% of men who batter their partners go on to murder them, using base rate reasoning to counter prosecution claims — though critics noted this itself was a misapplication of conditional probability.
  • David Eddy's 1982 study found that fewer than 5% of physicians correctly estimated the probability of breast cancer given a positive mammogram, with most confusing the test's sensitivity with the posterior probability — a textbook case of base rate neglect in clinical medicine.
WHERE IT COMES FROM
Academic origin

Daniel Kahneman and Amos Tversky, 1973 ('On the Psychology of Prediction,' Psychological Review). Maya Bar-Hillel formalized the term 'base rate fallacy' in her influential 1980 paper in Acta Psychologica.

Evolutionary origin

In ancestral environments, organisms acquired probabilistic information sequentially through direct experience (natural sampling) rather than as abstract percentages. A predator encounter was processed as a vivid, immediately relevant event — not as a fraction of total observations. Brains evolved to prioritize specific, salient environmental cues for rapid threat detection. This wiring served survival well when base rates were implicitly encoded through repeated personal encounters, but fails in modern environments where statistical information is presented in abstract, unfamiliar probability formats.

IN AI SYSTEMS

How the machines inherit it.

Machine learning classifiers trained on imbalanced datasets (where the target class is rare) inherit a form of base rate neglect: they optimize for overall accuracy but produce excessive false positives or false negatives for the minority class. Predictive policing algorithms, credit scoring models, and medical AI diagnostic tools all suffer when the base rate of the predicted outcome is very low, leading to overconfident flagging of individuals who do not actually belong to the target class.

Read more on Wikipedia
FREE FIELD ZINE

10 glitches quietly running your life.

A free field-zine PDF — ten cognitive glitches named, illustrated, with a defense move for each. Plus the weekly Glitch Report on Fridays — one bias named, two spotted in the wild, one defense move. Unsubscribe any time.

EXPLORE MORE

Related glitches.

LAUNCH PRICE

You read about it. Now drill it.

This page taught you the name. The deck turns the name into reflex. 1,100+ swipeable scenarios, 1,100+ defenses, 650+ detection prompts — spaced-repetition Swipe Deck, unlimited Spot-the-Bias Quiz, Defense Playbook, Pre-Flight, My Blindspots, Cheat Sheets, Field Guide e-book. $39.53$59.

Unlock the full kit

Everything below — yours forever. Pay once, use across every device.

Launch price — first 100 readers, $20 off. Auto-applied at checkout.
$59 $39.53
one-time payment · lifetime access
  • All interactive digital cards — search, filter, flip, shuffle on any device
  • Five training modes — Spot-the-Bias Quiz, Swipe Deck, Pre-Flight, Diagnose, Blindspots
  • Curated Lenses + Decision Templates + Defense Playbook
  • Printable Deck PDFs + Field Guide e-book + Cheat Sheets + Anki Export
  • Every future improvement, included
Get the full kit  $39.53

30-day refund · no questions asked

Unlock the full kit

Everything below — yours forever. Pay once, use across every device.

Launch price — first 100 readers, $20 off. Auto-applied at checkout.
$59 $39.53
one-time payment · lifetime access
  • All interactive digital cards — search, filter, flip, shuffle on any device
  • Five training modes — Spot-the-Bias Quiz, Swipe Deck, Pre-Flight, Diagnose, Blindspots
  • Curated Lenses + Decision Templates + Defense Playbook
  • Printable Deck PDFs + Field Guide e-book + Cheat Sheets + Anki Export
  • Every future improvement, included
Get the full kit  $39.53

30-day refund · no questions asked