Probability is one of those topics that sounds intimidating until you realize you've been using it your entire life. You check the weather and see "70% chance of rain" — that's probability. You think "there's no way I get called on today" right before your professor calls on you — that's also probability, just the informal kind that happens to be wrong at the worst possible time.
The formal version is actually pretty simple once you get past the notation. This guide covers everything: the basic probability formula, the three fundamental rules, how to handle two events and three events, independent versus dependent events, conditional probability and Bayes' theorem, and a bunch of real-world examples that make the math click. We'll use a free probability calculator throughout so you can check every number as we go.
What Is Probability? The Simple Definition
The probability definition is straightforward: probability is a number between 0 and 1 that measures how likely something is to happen.
- 0 means the event is impossible — it will never happen
- 1 means the event is certain — it will always happen
- Everything in between represents various degrees of likelihood
You'll also see this expressed as a percentage (multiply by 100) or as odds (a ratio like "3 to 1"). They're all the same idea in different clothes.
A probability can never be negative and can never be greater than 1. If your calculation gives you −0.2 or 1.7, you made an arithmetic error somewhere. The entire probability number line runs exactly from 0 to 1, inclusive.
Theoretical vs. Experimental Probability
There are two flavors of probability worth distinguishing:
Theoretical probability is what you calculate based on logic and symmetry, before you run any experiment. If a standard die is fair, the theoretical probability of rolling a 4 is exactly 1/6 ≈ 0.167 — because there's one favorable outcome out of six equally likely ones.
Experimental probability is what you observe after actually running the experiment. You roll a die 600 times. You get a 4 exactly 98 times. The experimental probability is 98/600 ≈ 0.163. That's close to 1/6 but not exactly 1/6 — because real experiments have random variation.
As you run more trials, experimental probability creeps closer and closer to theoretical probability. This is the Law of Large Numbers and it's why casinos can build hotels: they know that over millions of bets, the house edge will reliably show up in the numbers, even though any single player might get lucky on a given night.
The Probability Formula, Explained Step by Step
The basic probability formula is the starting point for everything else:
This works when all outcomes are equally likely. Let's run through a few probability formula examples to make it concrete.
Example 1: Rolling a standard die
What's the probability of rolling an even number (2, 4, or 6)?
Favorable outcomes: {2, 4, 6} = 3
Total outcomes: {1, 2, 3, 4, 5, 6} = 6
P(even) = 3/6 = 0.5 (50%)
Example 2: Drawing a card from a standard deck
What's the probability of drawing a King?
Favorable outcomes: 4 Kings in a 52-card deck
Total outcomes: 52
P(King) = 4/52 = 1/13 ≈ 0.077 (7.7%)
Example 3: Picking a red marble from a bag
A bag contains 3 red, 5 blue, and 2 green marbles (10 total).
P(red) = 3/10 = 0.30 (30%)
Calculate Any Probability Instantly
Our free probability calculator handles single events, multiple events, combinations (nCr), permutations (nPr), and binomial distributions — all with step-by-step formulas shown.
Open Probability Calculator →The Three Basic Probability Rules
Almost every probability problem you'll encounter can be solved using three core rules. Learn these and you can handle 90% of what shows up in a statistics class or real-world scenario.
Rule 1: The Complement Rule
The probability that an event does not happen equals 1 minus the probability that it does.
Example: There's a 30% chance of rain. What's the probability it doesn't rain?
P(no rain) = 1 − 0.30 = 0.70 (70%)
The complement rule is one of the most useful tools in probability because it's often much easier to calculate what won't happen and subtract from 1.
Rule 2: The Addition Rule (Union)
The addition rule gives the probability that event A or event B (or both) occurs:
The reason you subtract P(A and B) is to avoid double-counting outcomes that belong to both events. If A and B are mutually exclusive — they can't both happen at the same time — then P(A and B) = 0 and the formula simplifies.
Example: What's the probability of drawing a heart or a face card from a deck?
P(heart) = 13/52, P(face card) = 12/52, P(heart AND face card) = 3/52 (Jack, Queen, King of hearts)
P(heart or face card) = 13/52 + 12/52 − 3/52 = 22/52 ≈ 0.423 (42.3%)
Rule 3: The Multiplication Rule (Intersection)
The multiplication rule gives the probability that both A and B occur:
When events are independent, knowing A happened tells you nothing new about B — so you just multiply the two probabilities directly.
Probability of Two Events: Every Formula You Need
Two-event probability questions are the workhorses of most stats classes and real-world problems. Here's the complete set of formulas with concrete examples for each.
P(A and B) — Both Events Occur
For independent events: P(A and B) = P(A) × P(B)
For dependent events: P(A and B) = P(A) × P(B|A)
Example (independent): Flip a coin and roll a die. What's the probability of getting heads AND rolling a 6?
P(heads) × P(6) = 0.5 × (1/6) = 1/12 ≈ 0.083
P(A or B) — At Least One Event Occurs
P(A or B) = P(A) + P(B) − P(A and B)
Example: You roll a die. What's the probability of rolling a 3 or an odd number?
P(3) = 1/6, P(odd) = 3/6, P(3 AND odd) = 1/6 (3 is odd, so it's in both sets)
P(3 or odd) = 1/6 + 3/6 − 1/6 = 3/6 = 0.5
P(at least one) — One or More Events Occur
The complement shortcut: P(at least one) = 1 − P(neither occurs)
Example: Two students each independently have a 40% chance of passing a test. What's the probability at least one passes?
P(A' and B') = 0.60 × 0.60 = 0.36
P(at least one passes) = 1 − 0.36 = 0.64 (64%)
P(neither) — No Events Occur
P(neither A nor B) = P(A') × P(B') — for independent events
Using the example above: P(neither passes) = 0.60 × 0.60 = 0.36 (36%)
P(exactly one) — One Event Occurs But Not Both
Example: Same students from above. What's the probability exactly one of them passes?
P(A passes, B fails) + P(A fails, B passes) = (0.40 × 0.60) + (0.60 × 0.40) = 0.24 + 0.24 = 0.48 (48%)
| Scenario | Formula | Example (P(A)=0.4, P(B)=0.4) |
|---|---|---|
| Both A and B | P(A) × P(B) | 0.4 × 0.4 = 0.16 |
| A or B (at least one) | 1 − P(A') × P(B') | 1 − 0.6 × 0.6 = 0.64 |
| Exactly one | P(A)·P(B') + P(A')·P(B) | 0.24 + 0.24 = 0.48 |
| Neither | P(A') × P(B') | 0.6 × 0.6 = 0.36 |
Check your work: All four scenarios (both, A only, B only, neither) must add up to exactly 1.0. If they don't, you made an error. 0.16 + 0.24 + 0.24 + 0.36 = 1.00 ✓
Independent Events vs. Dependent Events
This distinction matters enormously. Get it wrong and every formula that follows falls apart.
What Are Independent Events?
Independent events are events where the outcome of one has absolutely no effect on the probability of the other. The defining test is simple: does knowing the result of A change the probability of B? If no — they're independent.
Examples of independent events:
- Flipping a coin twice — heads on the first flip doesn't change the odds of the second flip
- Rolling two dice simultaneously — die 1 and die 2 don't affect each other
- Whether it rains in Tokyo and whether your local coffee shop runs out of oat milk
- Drawing cards with replacement — you draw a card, note it, put it back, shuffle, draw again
The independent events formula is the simplest in probability:
Example: You roll a fair die three times. What's the probability of getting a 6 all three times?
P(6) × P(6) × P(6) = (1/6)³ = 1/216 ≈ 0.0046 (0.46%)
What Are Dependent Events?
Dependent events are events where the outcome of one does affect the probability of the other. The classic case is drawing cards without replacement — after you remove one card from the deck, the composition of the deck changes, so the probability of the next draw is different.
Examples of dependent events:
- Drawing two aces from a deck without replacement
- Picking two different colored socks from a drawer without looking
- Whether a student passes an exam given that they studied — studying and passing are related
Example: What's the probability of drawing two Aces from a standard 52-card deck without replacement?
P(1st Ace) = 4/52
P(2nd Ace | 1st was Ace) = 3/51 (only 3 Aces and 51 cards remain)
P(both Aces) = (4/52) × (3/51) = 12/2652 ≈ 0.0045 (0.45%)
Conditional Probability: P(A | B)
Conditional probability is the probability of event A occurring given that event B has already occurred. It's written P(A | B), read "the probability of A given B."
How to calculate conditional probability — step by step example:
A bag contains 4 red balls and 6 blue balls. You draw one ball and keep it (it's red). You draw a second ball. What's the probability the second ball is also red?
- P(1st ball is red) = 4/10 = 0.4
- After removing a red ball: 3 red balls, 9 total remain
- P(2nd red | 1st was red) = 3/9 = 0.333 (33.3%)
Another way to check using the formula:
P(both red) = (4/10) × (3/9) = 12/90 ≈ 0.133
P(2nd red | 1st red) = P(both red) ÷ P(1st red) = 0.133 ÷ 0.4 = 0.333 ✓
Bayes' Theorem: Updating Probabilities with Evidence
Bayes' theorem is the crown jewel of conditional probability. It answers the question: "I observed B. What does that tell me about the probability of A?"
It sounds abstract until you see a concrete example. Let's use a medical test:
A disease affects 1% of the population. A test for the disease is 95% accurate: it correctly identifies 95% of people who have it, and correctly identifies 95% of people who don't. You test positive. What's the actual probability you have the disease?
Intuition says 95%. The math says something very different:
- P(disease) = 0.01, P(no disease) = 0.99
- P(positive | disease) = 0.95
- P(positive | no disease) = 0.05 (false positive rate)
- P(positive) = (0.95 × 0.01) + (0.05 × 0.99) = 0.0095 + 0.0495 = 0.059
- P(disease | positive) = (0.95 × 0.01) ÷ 0.059 = 0.0095 ÷ 0.059 ≈ 0.161 (16.1%)
Even with a positive test, there's only a 16% chance you actually have the disease — because the disease is rare, and the false positive rate swamps the true positives. This is called the base rate fallacy, and it's why doctors don't diagnose on a single test result alone. Bayes' theorem is how spam filters, recommendation algorithms, and medical screening protocols all work under the hood.
Run the Numbers Yourself
The probability calculator includes a multi-event tab for P(A and B), P(A or B), and all the two-event scenarios — plus a combinations and permutations tab for counting problems.
Try the Probability Calculator →Probability of Three Events
Three-event problems follow the same logic as two events, just extended one more step. The formulas get longer but the concepts are identical.
P(A and B and C) — All Three Occur
Example: Roll a die three times. Probability of getting a number greater than 4 all three times?
P(>4) = 2/6 = 1/3 on each roll
P(all three >4) = (1/3)³ = 1/27 ≈ 0.037 (3.7%)
P(at least one of A, B, or C)
Use the complement — it's much easier than enumerating all the ways at least one can happen:
Example: Three students each have a 40% chance of passing. What's the probability at least one passes?
P(none pass) = 0.60 × 0.60 × 0.60 = 0.216
P(at least one passes) = 1 − 0.216 = 0.784 (78.4%)
P(none of A, B, C)
P(none) = P(A') × P(B') × P(C') — for independent events. Same as the complement calculation above: 0.216 (21.6%).
P(exactly one of three events)
Using the same student example with P(A)=P(B)=P(C)=0.4:
Each term = 0.40 × 0.60 × 0.60 = 0.144
P(exactly one passes) = 3 × 0.144 = 0.432 (43.2%)
P(exactly two of three events)
Each term = 0.40 × 0.40 × 0.60 = 0.096
P(exactly two pass) = 3 × 0.096 = 0.288 (28.8%)
| Scenario | Formula (P=0.4 each) | Result |
|---|---|---|
| All three occur | 0.4³ | 0.064 |
| Exactly two occur | 3 × (0.4² × 0.6) | 0.288 |
| Exactly one occurs | 3 × (0.4 × 0.6²) | 0.432 |
| None occur | 0.6³ | 0.216 |
Adding all four: 0.064 + 0.288 + 0.432 + 0.216 = 1.000 ✓
Union vs. Intersection: Venn Diagrams Explained
The terminology trips people up more than the math does. Here's the clean version:
Union (A ∪ B) means "A or B" — everything in either circle of a Venn diagram, including the overlap. This is the addition rule.
Intersection (A ∩ B) means "A and B" — only the overlapping region where both events occur. This is the multiplication rule.
In a Venn diagram with two overlapping circles:
- The left circle only = A happens, B does not
- The overlap = both A and B happen
- The right circle only = B happens, A does not
- Outside both circles = neither A nor B happens
The probability union formula for any two events is:
P(A ∪ B) = P(A) + P(B) − P(A ∩ B)
And for mutually exclusive events (no overlap in the Venn diagram):
P(A ∪ B) = P(A) + P(B)
This is also called the probability addition rule. "Or" = add, then subtract the overlap you double-counted.
Probability vs. Statistics: The Key Difference
These two words get lumped together constantly, and while they're closely related, they point in opposite directions.
Probability works forward. You have a known model (a fair die, a bag of colored balls, a coin) and you use math to predict what's likely to happen. The model is given; you're computing outcomes.
Statistics works backward. You have observed data and you're trying to infer the underlying model. You flipped a coin 200 times and got 130 heads — is it a fair coin? Statistics gives you the tools to answer that question. The data is given; you're reconstructing what generated it.
Probability is a branch of pure mathematics. Statistics is a data analysis discipline that uses probability as its theoretical foundation. A statistician who doesn't know probability is like an engineer who doesn't know physics — technically possible but not going to end well.
Real-World Probability Examples
Probability isn't just textbook exercises. It shows up everywhere once you start looking.
Probability in Games and Dice
Dice are the canonical probability toy because they're simple and physical. Some useful numbers to know:
| Event | Probability | Decimal |
|---|---|---|
| Rolling a specific number (1d6) | 1/6 | 0.167 |
| Rolling a 7 with 2d6 | 6/36 | 0.167 |
| Rolling a 12 with 2d6 (snake eyes / boxcars) | 1/36 | 0.028 |
| Rolling any doubles with 2d6 | 6/36 | 0.167 |
| Drawing a Royal Flush (5-card poker) | 4/2,598,960 | 0.00000154 |
Probability in Gambling Odds
Casinos and bookmakers express probability as odds rather than decimals — which makes things look better than they are. When you understand the conversion, you can see the house edge in plain numbers.
If a slot machine is advertised with odds of "98 to 2" (meaning 98% return-to-player), you're paying $2 on every $100 in expected value — forever. That's the house edge. It's small per bet but relentless over millions of spins. Probability in gambling is essentially just expected value: the average outcome per bet, calculated using the formula E = P(win) × winnings − P(lose) × stake.
Real Life Probability Examples Beyond Gambling
A few scenarios where probability thinking pays off:
- Weather forecasting: A "70% chance of rain" is an explicit probability statement based on atmospheric models. If the forecaster says 70% rain and it doesn't rain, that's not a bad forecast — it was only supposed to rain 70% of the time in those conditions.
- Quality control: A factory tests whether 2% of its products are defective. If you buy 5 products, the probability at least one is defective is 1 − (0.98)⁵ ≈ 9.6%.
- Medical screening: The Bayes' theorem example from earlier. Understanding base rates prevents overreacting to a single positive test result.
- Insurance pricing: Every premium you pay is calculated from probability models. A 1-in-1,000 annual chance of a $100,000 event costs roughly $100/year to insure before overhead — actuaries just apply the formula at scale.
- Machine learning: Classifiers output probabilities. "This email has a 97% probability of being spam" is a direct application of Bayes' theorem.
Probability Sampling Methods
In statistics, how you sample a population directly affects the probability that your results are unbiased. The main methods:
- Simple random sampling: Every individual has an equal probability of being selected. The gold standard for unbiased estimates.
- Stratified sampling: Divide the population into subgroups (strata), then sample randomly within each. Better representation of minority groups.
- Cluster sampling: Randomly select clusters (schools, neighborhoods), then sample everyone in those clusters. More practical but adds sampling error.
- Systematic sampling: Select every nth individual from a list. Fast and simple, but watch out for hidden patterns in the list that could introduce bias.
The probability that your sample is representative depends on the sampling method and sample size. Larger samples and random selection are the two levers you control.
"A and B" (both occur) → Multiplication rule: P(A) × P(B) for independent, P(A) × P(B|A) for dependent
"A or B" (at least one) → Addition rule: P(A) + P(B) − P(A and B)
"not A" (complement) → 1 − P(A)
"A given B" (conditional) → P(A and B) ÷ P(B)
"at least one" → 1 − P(none occur)