๐Ÿ“Š
๐ŸŽฏ
๐Ÿ”

What Is Statistics, Really?

Tools for thinking clearly about an uncertain world โ€” no formulas required to start

โ†“
Chapter 1

Starting With Questions You Can't Just Answer

You're scrolling through your phone and see a headline: "New study shows eating chocolate improves test scores!" Should you believe it? Should you stock up on candy bars before your next exam?

Or this: Your friend claims their basketball team is "way better" this season. They won 15 games last year and 18 games this year. Is that difference meaningful, or just luck?

These questions have something in common. You can't answer them just by looking. You need a way to think about uncertainty, about patterns, about what's real and what's just random noise. That's what statistics is for.

๐Ÿค” Can You Trust This Claim?
Click each claim to reveal whether statistics can help you evaluate it
"Eating chocolate improves test scores"
Statistics can help! You'd need to check: How many people were studied? Was there a control group? Could something else explain the result? Correlation does not imply causation.
"Our team is way better โ€” 15 wins last year vs 18 this year"
Statistics can evaluate this! Going from 15 to 18 wins might just be normal variation. You'd need to know total games played, strength of opponents, and whether 3 extra wins is statistically significant.
"Our new website design is better than the old one"
Statistics is essential here! Companies run A/B tests โ€” show version A to some users, version B to others, then use statistics to determine if the difference in outcomes is real or just chance.
Chapter 2

Two Big Jobs: Describing and Inferring

Statistics has two main jobs, and they're different enough that they have their own names.

Descriptive statistics is about summarizing and organizing data you already have. You collected 100 test scores โ€” now what? You might say "the average was 78" or "most scores were between 70 and 85." You're describing the data in a way that makes sense.

Inferential statistics is about going beyond your data to make conclusions about the bigger picture. You surveyed 50 students about cafeteria food, but you want to know about all 2,000 students. Inferential statistics helps you make that leap, and tells you how confident you can be.

๐Ÿ”€ Descriptive vs Inferential
Click each example to classify it โ€” is it descriptive or inferential?
Here's the thing: you always start with descriptive statistics. Always. You have to understand what's in your data before you can make bigger claims about the world.
Chapter 3

Why We Need This at All

Let me show you why we can't just "look at the data." Here are test scores from Ms. Johnson's math class โ€” 50 numbers. Try to figure out how the class did just by looking:

๐Ÿ“‹ Ms. Johnson's Test Scores
Click any score to highlight it. Can you tell how the class did just by scanning?
0
Selected
โ€”
Lowest
โ€”
Highest
โ€”
Mean

Your eyes glaze over. There are too many numbers. You might notice a few high scores, a few low ones, but you can't hold all 50 numbers in your head at once. This is why we need ways to summarize โ€” we need to compress information without losing what matters.

Chapter 4

Finding the Center: What's Typical?

The first question you usually ask about data is: what's a typical value? You have three main options, and they each tell you something slightly different.

๐Ÿ“ Mean, Median & Mode
Click each button to see how each measure of center works on the test scores
Mean = Sum of all values รท Number of values
Mean = 4,010 รท 50 = 80.2
Which one should you use? For test scores that are fairly evenly spread, the mean works great. For house prices where a few mansions might skew things, the median is better. For favorite colors or most common shoe sizes, the mode is your only real option.
Chapter 5

Why the Center Isn't Enough

Let's say two classes both averaged 75 on a test. Are they the same? Not necessarily.

โš–๏ธ Same Average, Different Stories
Both classes average 75 โ€” but look at how different the spread is

Class A (tight cluster)

75
Mean
4
Range

Class B (spread out)

75
Mean
55
Range

If you only know the average, you're missing half the story. You need to know about spread โ€” how scattered the values are around the center.

Chapter 6

Understanding Deviation

Spread is about how far values are from the center. The distance from the average is called a deviation. Let's use a tiny dataset so you can see every step.

๐Ÿ“ Step-by-Step: Deviations
Click each step to build toward standard deviation
Scores: 70, 75, 80, 85, 90   |   Mean: ?
ScoreDeviationSquared
70โ€”โ€”
75โ€”โ€”
80โ€”โ€”
85โ€”โ€”
90โ€”โ€”
Standard Deviation = โˆš( Sum of squared deviations รท Number of values )
= โˆš( 250 รท 5 ) = โˆš50 โ‰ˆ 7.07

In plain English: scores typically differ from the average by about 7 points. The standard deviation is in the same units as your original data, making it easy to interpret.

Chapter 7

Seeing Patterns: Distributions

Once you can summarize data with a center and spread, the next question is: what does the overall pattern look like? This is called a distribution โ€” how the values are distributed across the possible range.

๐Ÿ“Š Height Distribution of 100 Students
Watch how grouping data reveals patterns you can't see in raw numbers
4'8"โ€“4'11"
3
5'0"โ€“5'3"
12
5'4"โ€“5'7"
35
5'8"โ€“5'11"
31
6'0"โ€“6'3"
15
6'4"โ€“6'7"
4

Most students are in the middle ranges. Fewer are very short or very tall. Some distributions have most values in the middle (like heights). Some have most values at one end. Understanding the shape helps you know what to expect and what's unusual.

Chapter 8

From Patterns to Probability

Once you see patterns in data, you can start making predictions. In your school's parking lot, 60 out of 100 cars are SUVs, 30 are sedans, and 10 are trucks. If you had to guess what the next car pulling in would be, you'd probably guess SUV โ€” not because you're certain, but because it's most likely based on the pattern.

๐Ÿš— Parking Lot Simulator
Click the buttons to simulate cars arriving. Watch how frequencies approach the true probabilities.
Total cars: 0
๐Ÿš™ SUV (60%)
0%
๐Ÿš— Sedan (30%)
0%
๐Ÿ›ป Truck (10%)
0%
Probability = Number of times something happened รท Total observations
P(SUV) = 60 รท 100 = 0.60 or 60%
This only works if your data is representative. If you only looked at the parking lot on "Truck Tuesday" when all the pickup owners meet up, your estimates would be way off.
Chapter 9

When Two Things Vary Together

Sometimes you want to know if two things are related. Do students who study more hours get higher grades? Do taller basketball players score more points? Correlation measures how tightly points cluster around a straight line, and ranges from -1 to +1.

๐Ÿ“ˆ Explore Correlation
Click a scenario to see the scatter plot and correlation value
X axis Y axis Click a scenario above
Crucial: Ice cream sales and drowning deaths are correlated โ€” not because ice cream causes drowning, but because both happen more in summer. This is why people say "correlation doesn't imply causation."
Chapter 10

Putting It All Together

Let's walk through a complete example. Question: Are students who eat breakfast performing better in first period?

๐Ÿฅฃ Breakfast Study: Step by Step
Click each step to walk through the analysis

The breakfast group averaged 8 points higher and had less variation (standard deviation of 8 vs 12). But is this difference meaningful, or could it be random chance? This is exactly where inferential statistics comes in โ€” you'd use additional tools to determine if the difference is large enough to be confident it's real.

Chapter 11

What Statistics Can and Can't Do

Statistics is powerful, but it has limits. Understanding both sides helps you be a smarter consumer of data.

โœ… Powers & Limits
Green items are things statistics can do. Red items are its limitations.
โœ… Help you summarize large amounts of data
โŒ Prove things with absolute certainty
โœ… Reveal patterns you couldn't see otherwise
โŒ Tell you what causes what (without careful study design)
โœ… Quantify uncertainty (tell you how confident to be)
โŒ Make up for bad data collection
โœ… Help you make decisions with incomplete information
โŒ Answer questions that aren't about data
Garbage in, garbage out. If you collect data poorly โ€” only asking your friends, or only measuring on Mondays, or asking leading questions โ€” no amount of statistical analysis will fix it.
Chapter 12

Why This Matters to You

You encounter statistics constantly, whether you realize it or not. News articles cite studies. Companies use data to make decisions. Social media shows you content based on statistical models of what you'll like.

๐ŸŒ Statistics in Your World
Real ways understanding statistics helps you
๐Ÿ”Ž
Evaluate Claims
Is that study actually good? Is the sample size large enough?
๐Ÿ›ก๏ธ
Avoid Being Misled
Are they cherry-picking data? Confusing correlation with causation?
๐ŸŽฏ
Better Decisions
What does the data actually say about your options?
๐Ÿ’ฌ
Informed Discussions
What do the numbers really mean?

Your Statistics Toolkit

๐Ÿ“
Measures of Center
Mean, median, and mode โ€” three ways to find what's typical
๐Ÿ“
Spread
Range, variance, and standard deviation tell you how scattered values are
๐Ÿ“Š
Distributions
The shape of your data reveals patterns you can't see in raw numbers
๐ŸŽฒ
Probability
Using observed frequencies to estimate how likely things are
๐Ÿ“ˆ
Correlation
Measures whether two things tend to move together (โˆ’1 to +1)
๐Ÿ”
Inference
Going beyond your sample to draw conclusions about the bigger picture

Cite this post

Rizvi, I. (2026). Statistics โ€” A Visual Guide. theog.dev.
DOI: 10.5281/zenodo.19413295

@article{rizvi2026statistics,
  title   = {Statistics โ€” A Visual Guide},
  author  = {Rizvi, Imran},
  year    = {2026},
  url     = {https://theog.dev/ml-ds/statistics-visual.html},
  doi     = {10.5281/zenodo.19413295}
}