Insensitivity to Sample Size

aka Sample Size Neglect · Law of Small Numbers · Sample Size Fallacy

Treating small samples as equally reliable as large ones when judging probabilities, ignoring how much randomness small samples contain.

WHAT IT IS

The glitch, explained plainly.

Imagine you flip a coin 4 times and get 3 heads. You might think 'wow, this coin lands on heads 75% of the time!' But if you flipped it 1,000 times, you'd get much closer to 50/50. We forget that tiny amounts of information can look very different from the big picture, like tasting one grape and deciding the whole vineyard is sour.

Insensitivity to sample size occurs when people fail to appreciate that smaller samples are inherently more variable and less reliable than larger ones, leading them to draw equally strong conclusions regardless of how many data points underlie a result. This bias manifests as treating a finding from 10 observations with the same confidence as one from 10,000 observations. People intuitively expect any random sample, no matter how small, to closely mirror the overall population — a misconception Tversky and Kahneman called 'belief in the law of small numbers.' The result is systematic overconfidence in patterns detected in limited data and a failure to demand more evidence before drawing conclusions.

SOUND FAMILIAR?

Where it shows up.

  1. 01 Eating at a new restaurant once, having a bad meal, and permanently labeling it a terrible restaurant.
  2. 02 Meeting two people from a country and assuming everyone from that country shares their personality traits.
IN DIFFERENT DOMAINS

Where it shows up at work.

The same glitch looks different depending on the terrain. Finance, medicine, a relationship, a team — same mechanism, different costume.

Finance & investing

Investors frequently evaluate mutual funds or stock-picking strategies based on a few months or quarters of returns, treating short-term outperformance as evidence of genuine skill rather than recognizing that small time samples produce high variability and that streaks are expected by chance alone.

Medicine & diagnosis

Clinicians may draw strong conclusions about a treatment's efficacy from a handful of patients they've personally treated, overriding large randomized controlled trials. Similarly, patients dismiss large-scale safety data in favor of a few anecdotal adverse reactions reported by people they know.

HOW TO SPOT IT

Ask yourself…

  • How large is the sample I'm basing this conclusion on — and would I feel equally confident if the sample were half this size or double it?
  • Am I treating a handful of personal experiences or anecdotes as if they were a large, controlled study?
HOW TO DEFEND AGAINST IT

The playbook.

  • Always ask 'What is N?' before drawing conclusions from any statistic — make sample size your first question, not an afterthought.
  • Use the 'shrink the sample' test: mentally reduce the sample to an absurdly small number (e.g., 2 people) and check if your confidence changes. If it does, you're sensitive to sample size — now calibrate properly for the actual N.
FAMOUS CASES

In history.

  • Howard Wainer and Harris Zwerling demonstrated that both the highest and lowest kidney cancer rates in U.S. counties occurred in small, rural counties — not because of environmental factors but because small populations produce more extreme statistical fluctuations.
  • The Gates Foundation's small-schools initiative invested heavily in breaking up large schools into smaller ones after observing that top-performing schools tended to be small, without recognizing that the worst-performing schools were also disproportionately small due to sampling variability.
WHERE IT COMES FROM
Academic origin

Amos Tversky and Daniel Kahneman, 1971 ('Belief in the Law of Small Numbers,' Psychological Bulletin, 76(2), 105–110); further elaborated in their 1974 paper 'Judgment under Uncertainty: Heuristics and Biases' in Science.

Evolutionary origin

In ancestral environments, humans rarely encountered formal statistical data. Decisions were made from small, personally observed samples — a few encounters with a predator, a handful of foraging trips to a location. Treating these small observations as representative was often adaptive because the cost of waiting for a larger sample (e.g., more predator encounters) could be fatal. Quick generalization from limited experience was a survival advantage when data collection itself was dangerous.

IN AI SYSTEMS

How the machines inherit it.

Machine learning models trained on small or unrepresentative datasets can produce overfit predictions that appear highly accurate during training but fail to generalize. Practitioners who evaluate model performance on small test sets may overestimate accuracy. In recommendation systems, items with few ratings can receive extreme average scores (very high or very low) that disproportionately influence recommendations, a problem known as the cold-start problem.

Read more on Wikipedia
FREE FIELD ZINE

10 glitches quietly running your life.

A free field-zine PDF — ten cognitive glitches named, illustrated, with a defense move for each. Plus the weekly Glitch Report on Fridays — one bias named, two spotted in the wild, one defense move. Unsubscribe any time.

EXPLORE MORE

Related glitches.

LAUNCH PRICE

You read about it. Now drill it.

This page taught you the name. The deck turns the name into reflex. 1,100+ swipeable scenarios, 1,100+ defenses, 650+ detection prompts — spaced-repetition Swipe Deck, unlimited Spot-the-Bias Quiz, Defense Playbook, Pre-Flight, My Blindspots, Cheat Sheets, Field Guide e-book. $39.53$59.

Unlock the full kit

Everything below — yours forever. Pay once, use across every device.

Launch price — first 100 readers, $20 off. Auto-applied at checkout.
$59 $39.53
one-time payment · lifetime access
  • All interactive digital cards — search, filter, flip, shuffle on any device
  • Five training modes — Spot-the-Bias Quiz, Swipe Deck, Pre-Flight, Diagnose, Blindspots
  • Curated Lenses + Decision Templates + Defense Playbook
  • Printable Deck PDFs + Field Guide e-book + Cheat Sheets + Anki Export
  • Every future improvement, included
Get the full kit  $39.53

30-day refund · no questions asked

Unlock the full kit

Everything below — yours forever. Pay once, use across every device.

Launch price — first 100 readers, $20 off. Auto-applied at checkout.
$59 $39.53
one-time payment · lifetime access
  • All interactive digital cards — search, filter, flip, shuffle on any device
  • Five training modes — Spot-the-Bias Quiz, Swipe Deck, Pre-Flight, Diagnose, Blindspots
  • Curated Lenses + Decision Templates + Defense Playbook
  • Printable Deck PDFs + Field Guide e-book + Cheat Sheets + Anki Export
  • Every future improvement, included
Get the full kit  $39.53

30-day refund · no questions asked