What Is Statistics, Really?

Chapter 1

Starting With Questions You Can't Just Answer

You're scrolling through your phone and see a headline: "New study shows eating chocolate improves test scores!" Should you believe it? Should you stock up on candy bars before your next exam?

Or this: Your friend claims their basketball team is "way better" this season. They won 15 games last year and 18 games this year. Is that difference meaningful, or just luck?

These questions have something in common. You can't answer them just by looking. You need a way to think about uncertainty, about patterns, about what's real and what's just random noise. That's what statistics is for.

🤔 Can You Trust This Claim?

Click each claim to reveal whether statistics can help you evaluate it

"Eating chocolate improves test scores"

Statistics can help! You'd need to check: How many people were studied? Was there a control group? Could something else explain the result? Correlation does not imply causation.

"Our team is way better — 15 wins last year vs 18 this year"

Statistics can evaluate this! Going from 15 to 18 wins might just be normal variation. You'd need to know total games played, strength of opponents, and whether 3 extra wins is statistically significant.

"Our new website design is better than the old one"

Statistics is essential here! Companies run A/B tests — show version A to some users, version B to others, then use statistics to determine if the difference in outcomes is real or just chance.

Chapter 2

Two Big Jobs: Describing and Inferring

Statistics has two main jobs, and they're different enough that they have their own names.

Descriptive statistics is about summarizing and organizing data you already have. You collected 100 test scores — now what? You might say "the average was 78" or "most scores were between 70 and 85." You're describing the data in a way that makes sense.

Inferential statistics is about going beyond your data to make conclusions about the bigger picture. You surveyed 50 students about cafeteria food, but you want to know about all 2,000 students. Inferential statistics helps you make that leap, and tells you how confident you can be.

🔀 Descriptive vs Inferential

Click each example to classify it — is it descriptive or inferential?

Here's the thing: you always start with descriptive statistics. Always. You have to understand what's in your data before you can make bigger claims about the world.

Chapter 3

Why We Need This at All

Let me show you why we can't just "look at the data." Here are test scores from Ms. Johnson's math class — 50 numbers. Try to figure out how the class did just by looking:

📋 Ms. Johnson's Test Scores

Click any score to highlight it. Can you tell how the class did just by scanning?

Selected

—

Lowest

—

Highest

—

Mean

Your eyes glaze over. There are too many numbers. You might notice a few high scores, a few low ones, but you can't hold all 50 numbers in your head at once. This is why we need ways to summarize — we need to compress information without losing what matters.

Chapter 4

Finding the Center: What's Typical?

The first question you usually ask about data is: what's a typical value? You have three main options, and they each tell you something slightly different.

📐 Mean, Median & Mode

Click each button to see how each measure of center works on the test scores

Mean = Sum of all values ÷ Number of values

Mean = 4,010 ÷ 50 = 80.2

Which one should you use? For test scores that are fairly evenly spread, the mean works great. For house prices where a few mansions might skew things, the median is better. For favorite colors or most common shoe sizes, the mode is your only real option.

Chapter 5

Why the Center Isn't Enough

Let's say two classes both averaged 75 on a test. Are they the same? Not necessarily.

⚖️ Same Average, Different Stories

Both classes average 75 — but look at how different the spread is

Class A (tight cluster)

Mean

Range

Class B (spread out)

Mean

Range

If you only know the average, you're missing half the story. You need to know about spread — how scattered the values are around the center.

Chapter 6

Understanding Deviation

Spread is about how far values are from the center. The distance from the average is called a deviation. Let's use a tiny dataset so you can see every step.

📏 Step-by-Step: Deviations

Click each step to build toward standard deviation

Scores: 70, 75, 80, 85, 90 | Mean: ?

Score	Deviation	Squared
70	—	—
75	—	—
80	—	—
85	—	—
90	—	—

Standard Deviation = √( Sum of squared deviations ÷ Number of values )

= √( 250 ÷ 5 ) = √50 ≈ 7.07

In plain English: scores typically differ from the average by about 7 points. The standard deviation is in the same units as your original data, making it easy to interpret.

Chapter 7

Seeing Patterns: Distributions

Once you can summarize data with a center and spread, the next question is: what does the overall pattern look like? This is called a distribution — how the values are distributed across the possible range.

📊 Height Distribution of 100 Students

Watch how grouping data reveals patterns you can't see in raw numbers

4'8"–4'11"

5'0"–5'3"

5'4"–5'7"

5'8"–5'11"

6'0"–6'3"

6'4"–6'7"

Most students are in the middle ranges. Fewer are very short or very tall. Some distributions have most values in the middle (like heights). Some have most values at one end. Understanding the shape helps you know what to expect and what's unusual.

Chapter 8

From Patterns to Probability

Once you see patterns in data, you can start making predictions. In your school's parking lot, 60 out of 100 cars are SUVs, 30 are sedans, and 10 are trucks. If you had to guess what the next car pulling in would be, you'd probably guess SUV — not because you're certain, but because it's most likely based on the pattern.

🚗 Parking Lot Simulator

Click the buttons to simulate cars arriving. Watch how frequencies approach the true probabilities.

Total cars: 0

🚙 SUV (60%)

🚗 Sedan (30%)

🛻 Truck (10%)

Probability = Number of times something happened ÷ Total observations

P(SUV) = 60 ÷ 100 = 0.60 or 60%

This only works if your data is representative. If you only looked at the parking lot on "Truck Tuesday" when all the pickup owners meet up, your estimates would be way off.

Chapter 9

When Two Things Vary Together

Sometimes you want to know if two things are related. Do students who study more hours get higher grades? Do taller basketball players score more points? Correlation measures how tightly points cluster around a straight line, and ranges from -1 to +1.

📈 Explore Correlation

Click a scenario to see the scatter plot and correlation value

Crucial: Ice cream sales and drowning deaths are correlated — not because ice cream causes drowning, but because both happen more in summer. This is why people say "correlation doesn't imply causation."

Chapter 10

Putting It All Together

Let's walk through a complete example. Question: Are students who eat breakfast performing better in first period?

🥣 Breakfast Study: Step by Step

Click each step to walk through the analysis

The breakfast group averaged 8 points higher and had less variation (standard deviation of 8 vs 12). But is this difference meaningful, or could it be random chance? This is exactly where inferential statistics comes in — you'd use additional tools to determine if the difference is large enough to be confident it's real.

Chapter 11

What Statistics Can and Can't Do

Statistics is powerful, but it has limits. Understanding both sides helps you be a smarter consumer of data.

✅ Powers & Limits

Green items are things statistics can do. Red items are its limitations.

✅ Help you summarize large amounts of data

❌ Prove things with absolute certainty

✅ Reveal patterns you couldn't see otherwise

❌ Tell you what causes what (without careful study design)

✅ Quantify uncertainty (tell you how confident to be)

❌ Make up for bad data collection

✅ Help you make decisions with incomplete information

❌ Answer questions that aren't about data

Garbage in, garbage out. If you collect data poorly — only asking your friends, or only measuring on Mondays, or asking leading questions — no amount of statistical analysis will fix it.

Chapter 12

Why This Matters to You

You encounter statistics constantly, whether you realize it or not. News articles cite studies. Companies use data to make decisions. Social media shows you content based on statistical models of what you'll like.

🌍 Statistics in Your World

Real ways understanding statistics helps you

🔎

Evaluate Claims

Is that study actually good? Is the sample size large enough?

🛡️

Avoid Being Misled

Are they cherry-picking data? Confusing correlation with causation?

🎯

Better Decisions

What does the data actually say about your options?

💬

Informed Discussions

What do the numbers really mean?

Your Statistics Toolkit

📐

Measures of Center

Mean, median, and mode — three ways to find what's typical

📏

Spread

Range, variance, and standard deviation tell you how scattered values are

📊

Distributions

The shape of your data reveals patterns you can't see in raw numbers

🎲

Probability

Using observed frequencies to estimate how likely things are

📈

Correlation

Measures whether two things tend to move together (−1 to +1)

🔍

Inference

Going beyond your sample to draw conclusions about the bigger picture

What Is Statistics, Really?

Starting With Questions You Can't Just Answer

Two Big Jobs: Describing and Inferring

Why We Need This at All

Finding the Center: What's Typical?

Why the Center Isn't Enough

Class A (tight cluster)

Class B (spread out)

Understanding Deviation

Seeing Patterns: Distributions

From Patterns to Probability

When Two Things Vary Together

Putting It All Together

What Statistics Can and Can't Do

Why This Matters to You

Your Statistics Toolkit

Statistics — A Visual Guide

Cite this post