A test that's 99% accurate just said you have a rare disease. Here's the surprising โ and reassuring โ truth about what that really means.
See why notPicture a disease that just 1 in 1,000 people have, and a test that's right 99% of the time. You test positive. Your gut screams "I'm 99% done for." But the real chance you're sick is closer to 9%.
Nothing is broken about the test. The twist comes from something quietly powerful โ how rare the disease was to begin with. The easiest way to believe it is to watch it happen to a thousand people at once.
Every dot is a person. Drag the two dials and watch the crowd sort itself: the few blue dots truly have the disease, the amber dots got a false alarm. The question that matters isn't "is the test good?" โ it's "if my dot is coloured, which colour is it likely to be?"
Of the 11 people who test positive, only 1 is actually sick.
So a positive result is overwhelmingly a false alarm. Breathe.
The coloured dots at the top are everyone who tested positive. Notice how the amber false alarms can completely swamp the handful of real cases โ that gap is the whole idea.
When almost nobody has the disease, the handful of real cases are easy to drown out. A 99% test still gets it wrong for 1 in every 100 healthy people โ and there are a lot of healthy people.
Out of 1,000 folks where only one is truly sick, that "tiny" 1% error rate produces about ten false alarms. Ten false positives against one real case: even a great test leaves you outnumbered ten to one. Make the disease more common, or the test sharper, and the balance tips back โ exactly what you just felt on the dials.
The same trap springs anywhere you screen a big crowd for something rare. The rarer the thing, the more the alarms are false.
Mammograms, genetic panels, mass disease screens โ a positive is a reason to look closer, not a diagnosis.
Scan millions of harmless bags for the rare dangerous one and most alarms will, thankfully, be false.
Flagging the rare bad email or transaction means the same balancing act between catches and false flags.
Earthquake warnings, fault detectors, AI moderation โ rarity makes "it went off" weaker evidence than it feels.
The trap: it's tempting to read "99% accurate" as "99% sure I'm sick." But those answer two different questions. "How often is the test right?" is about the test. "Given a positive, am I sick?" is about you โ and it depends on how rare the disease was before you ever walked in.
How common is the thing before any test? That's your honest starting point.
A positive pushes your belief up โ by a lot or a little, depending on how good the test is.
You land between the base rate and certainty โ rarely all the way at "definitely."