"An algorithm, meanwhile, is a step-by-step recipe for performing a series of actions, and in most cases 'algorithm' means simply 'computer program'." (Tim Harford, "The Data Detective: Ten easy rules to make sense of statistics", 2020)
"Big data is revolutionizing the world around us, and it is
easy to feel alienated by tales of computers handing down decisions made in
ways we don’t understand. I think we’re right to be concerned. Modern data
analytics can produce some miraculous results, but big data is often less
trustworthy than small data. Small data can typically be scrutinized; big data
tends to be locked away in the vaults of Silicon Valley. The simple statistical
tools used to analyze small datasets are usually easy to check;
pattern-recognizing algorithms can all too easily be mysterious and
commercially sensitive black boxes." (Tim Harford, "The Data Detective: Ten easy rules to make sense of statistics", 2020)
"Each decision about what data to gather and how to analyze
them is akin to standing on a pathway as it forks left and right and deciding
which way to go. What seems like a few simple choices can quickly multiply into
a labyrinth of different possibilities. Make one combination of choices and
you’ll reach one conclusion; make another, equally reasonable, and you might
find a very different pattern in the data." (Tim Harford, "The Data Detective: Ten easy rules to make sense of statistics", 2020)
"Each of us is sweating data, and those data are being mopped
up and wrung out into oceans of information. Algorithms and large datasets are
being used for everything from finding us love to deciding whether, if we are
accused of a crime, we go to prison before the trial or are instead allowed to
post bail. We all need to understand what these data are and how they can be
exploited." (Tim Harford, "The Data Detective: Ten easy rules to make sense of statistics", 2020)
"Good statistics are not a trick, although they are a kind of
magic. Good statistics are not smoke and mirrors; in fact, they help us see
more clearly. Good statistics are like a telescope for an astronomer, a
microscope for a bacteriologist, or an X-ray for a radiologist. If we are
willing to let them, good statistics help us see things about the world around
us and about ourselves - both large and small - that we would not be able to see in
any other way." (Tim Harford, "The Data Detective: Ten easy rules to make sense of statistics", 2020)
"Ideally, a decision maker or a forecaster will combine the outside view and the inside view - or, similarly, statistics plus personal experience. But it’s much better to start with the statistical view, the outside view, and then modify it in the light of personal experience than it is to go the other way around. If you start with the inside view you have no real frame of reference, no sense of scale - and can easily come up with a probability that is ten times too large, or ten times too small." (Tim Harford, "The Data Detective: Ten easy rules to make sense of statistics", 2020)
"If we don’t understand the statistics, we’re likely to be badly mistaken about the way the world is. It is all too easy to convince ourselves that whatever we’ve seen with our own eyes is the whole truth; it isn’t. Understanding causation is tough even with good statistics, but hopeless without them. [...] And yet, if we understand only the statistics, we understand little. We need to be curious about the world that we see, hear, touch, and smell, as well as the world we can examine through a spreadsheet." (Tim Harford, "The Data Detective: Ten easy rules to make sense of statistics", 2020)
"[…] in a world where so many people seem to hold extreme views with strident certainty, you can deflate somebody’s overconfidence and moderate their politics simply by asking them to explain the details." (Tim Harford, "The Data Detective: Ten easy rules to make sense of statistics", 2020)
"It’d be nice to fondly imagine that high-quality statistics
simply appear in a spreadsheet somewhere, divine providence from the numerical heavens.
Yet any dataset begins with somebody deciding to collect the numbers. What
numbers are and aren’t collected, what is and isn’t measured, and who is
included or excluded are the result of all-too-human assumptions,
preconceptions, and oversights." (Tim Harford, "The Data Detective: Ten easy rules to make sense of statistics", 2020)
"Making big data work is harder than it seems. Statisticians have spent the past two hundred years figuring out what traps lie in wait when we try to understand the world through data. The data are bigger, faster, and cheaper these days, but we must not pretend that the traps have all been made safe. They have not." (Tim Harford, "The Data Detective: Ten easy rules to make sense of statistics", 2020)
"Many people have strong intuitions about whether they would
rather have a vital decision about them made by algorithms or humans. Some
people are touchingly impressed by the capabilities of the algorithms; others
have far too much faith in human judgment. The truth is that sometimes the
algorithms will do better than the humans, and sometimes they won’t. If we want
to avoid the problems and unlock the promise of big data, we’re going to need
to assess the performance of the algorithms on a case-by-case basis. All too
often, this is much harder than it should be. […] So the problem is not the
algorithms, or the big datasets. The problem is a lack of scrutiny,
transparency, and debate." (Tim Harford, "The Data Detective: Ten easy rules to make sense of statistics", 2020)
"Much of the data visualization that bombards us today is
decoration at best, and distraction or even disinformation at worst. The
decorative function is surprisingly common, perhaps because the data
visualization teams of many media organizations are part of the art
departments. They are led by people whose skills and experience are not in
statistics but in illustration or graphic design. The emphasis is on the
visualization, not on the data. It is, above all, a picture." (Tim Harford, "The Data Detective: Ten easy rules to make sense of statistics", 2020)
"Numbers can easily confuse us when they are unmoored from a
clear definition." (Tim Harford, "The Data Detective: Ten easy rules to make sense of statistics", 2020)
"Premature enumeration is an equal-opportunity blunder: the most numerate among us may be just as much at risk as those who find their heads spinning at the first mention of a fraction. Indeed, if you’re confident with numbers you may be more prone than most to slicing and dicing, correlating and regressing, normalizing and rebasing, effortlessly manipulating the numbers on the spreadsheet or in the statistical package - without ever realizing that you don’t fully understand what these abstract quantities refer to. Arguably this temptation lay at the root of the last financial crisis: the sophistication of mathematical risk models obscured the question of how, exactly, risks were being measured, and whether those measurements were something you’d really want to bet your global banking system on." (Tim Harford, "The Data Detective: Ten easy rules to make sense of statistics", 2020)
"Sample error reflects the risk that, purely by chance, a randomly chosen sample of opinions does not reflect the true views of the population. The 'margin of error' reported in opinion polls reflects this risk, and the larger the sample, the smaller the margin of error. […] sampling error has a far more dangerous friend: sampling bias. Sampling error is when a randomly chosen sample doesn’t reflect the underlying population purely by chance; sampling bias is when the sample isn’t randomly chosen at all." (Tim Harford, "The Data Detective: Ten easy rules to make sense of statistics", 2020)
"Statistical metrics can show us facts and trends that would be impossible to see in any other way, but often they’re used as a substitute for relevant experience, by managers or politicians without specific expertise or a close-up view." (Tim Harford, "The Data Detective: Ten easy rules to make sense of statistics", 2020)
"Statisticians are sometimes dismissed as bean counters. The
sneering term is misleading as well as unfair. Most of the concepts that matter
in policy are not like beans; they are not merely difficult to count, but
difficult to define. Once you’re sure what you mean by 'bean', the bean
counting itself may come more easily. But if we don’t understand the
definition, then there is little point in looking at the numbers. We have
fooled ourselves before we have begun."(Tim Harford, "The Data Detective: Ten easy rules to make sense of statistics", 2020)
"So information is beautiful - but misinformation can be
beautiful, too. And producing beautiful misinformation is becoming easier than
ever." (Tim Harford, "The Data Detective: Ten easy rules to make sense of statistics", 2020)
"The contradiction between what we see with our own eyes and
what the statistics claim can be very real. […] The truth is more complicated.
Our personal experiences should not be dismissed along with our feelings, at
least not without further thought. Sometimes the statistics give us a vastly
better way to understand the world; sometimes they mislead us. We need to be
wise enough to figure out when the statistics are in conflict with everyday
experience - and in those cases, which to believe." (Tim Harford, "The Data Detective: Ten easy rules to make sense of statistics", 2020)
"The world is full of patterns that are too subtle or too
rare to detect by eyeballing them, and a pattern doesn’t need to be very subtle
or rare to be hard to spot without a statistical lens." (Tim Harford, "The Data Detective: Ten easy rules to make sense of statistics", 2020)
"The whole discipline of statistics is built on measuring or
counting things. […] it is important to understand what is being measured or
counted, and how. It is surprising how rarely we do this. Over the years, as I
found myself trying to lead people out of statistical mazes week after week, I
came to realize that many of the problems I encountered were because people had
taken a wrong turn right at the start. They had dived into the mathematics of a
statistical claim - asking about sampling errors and margins of error, debating
if the number is rising or falling, believing, doubting, analyzing,
dissecting - without taking the ti- me to understand the first and most obvious
fact: What is being measured, or counted? What definition is being used?" (Tim Harford, "The Data Detective: Ten easy rules to make sense of statistics", 2020)
"Those of us in the business of communicating ideas need to go beyond the fact-check and the statistical smackdown. Facts are valuable things, and so is fact-checking. But if we really want people to understand complex issues, we need to engage their curiosity. If people are curious, they will learn." (Tim Harford, "The Data Detective: Ten easy rules to make sense of statistics", 2020)
"Unless we’re collecting data ourselves, there’s a limit to how much we can do to combat the problem of missing data. But we can and should remember to ask who or what might be missing from the data we’re being told about. Some missing numbers are obvious […]. Other omissions show up only when we take a close look at the claim in question." (Tim Harford, "The Data Detective: Ten easy rules to make sense of statistics", 2020)
"We don’t need to become emotionless processors of numerical information - just noticing our emotions and taking them into account may often be enough to improve our judgment. Rather than requiring superhuman control over our emotions, we need simply to develop good habits." (Tim Harford, "The Data Detective: Ten easy rules to make sense of statistics", 2020)
"We filter new information. If it accords with what we
expect, we’ll be more likely to accept it. […] Our brains are always trying to make
sense of the world around us based on incomplete information. The brain makes
predictions about what it expects, and tends to fill in the gaps, often based on
surprisingly sparse data." (Tim Harford, "The Data Detective: Ten easy rules to make sense of statistics", 2020)
"We should conclude nothing because that pair of numbers alone tells us very little. If we want to understand what’s happening, we need to step back and take in a broader perspective." (Tim Harford, "The Data Detective: Ten easy rules to make sense of statistics", 2020)
"[…] when it comes to interpreting the world around us, we need to realize that our feelings can trump our expertise. […] The more extreme the emotional reaction, the harder it is to think straight. […] It is not easy to master our emotions while assessing information that matters to us, not least because our emotions can lead us astray in different directions. […] We often find ways to dismiss evidence that we don’t like. And the opposite is true, too: when evidence seems to support our preconceptions, we are less likely to look too closely for flaws." (Tim Harford, "The Data Detective: Ten easy rules to make sense of statistics", 2020)
"When we are trying to understand a statistical claim - any statistical claim - we need to start by asking ourselves what the claim actually means. [...] A surprising statistical claim is a challenge to our existing worldview. It may provoke an emotional response - even a fearful one." (Tim Harford, "The Data Detective: Ten easy rules to make sense of statistics", 2020)