Probability
Basic Probability Concepts
Sample Space and Events
- Sample space (): The set of all possible outcomes of an experiment
- Event: Any subset of the sample space
- Probability: A number between 0 and 1 that measures the likelihood of an event
Probability Rules
- Legitimate probability values: for any event
- Sum of probabilities: (the probability of the entire sample space is 1)
- Complement rule: , where is the complement of
Addition Rule
For any two events and :
If and are mutually exclusive (disjoint, cannot both occur):
Multiplication Rule
If and are independent:
Conditional Probability
The probability of given that has occurred. Note that must be greater than 0.
Independence
Events and are independent if the occurrence of one does not affect the probability of the other:
Equivalently:
Important: Independence is not the same as mutually exclusive. In fact, if two events are both mutually exclusive and both have non-zero probability, they cannot be independent.
Checking Independence
To check independence on the AP exam:
- Calculate and separately
- Calculate
- Check whether
Alternatively, check whether .
Disjoint vs Independent
| Property | Disjoint (Mutually Exclusive) | Independent |
|---|---|---|
| Definition | Cannot occur together | Occurrence of one does not affect the other |
Bayes’ Theorem
Used to find the probability of a “cause” given an observed “effect.” Useful for medical testing problems.
Two-Way Tables and Probability
Given a two-way table:
- Marginal probability: probability based on a single variable (row or column total / grand total)
- Joint probability: probability of both events occurring (cell / grand total)
- Conditional probability: probability given a condition (cell / row total or cell / column total)
Discrete Random Variables
A random variable assigns a numerical value to each outcome in the sample space.
Probability Distribution
A table, graph, or formula that gives the probability () for each value () of the random variable .
Requirements:
- for each
Mean (Expected Value)
The mean of a random variable is the long-run average of its values over many repetitions.
Variance and Standard Deviation
Rules for Means and Variances
For random variables and , and constants and :
- (always)
- If and are independent: and
Binomial Distributions
A binomial setting has four conditions:
- Binary: Each observation has two possible outcomes (success/failure)
- Independent: Observations are independent (or approximately independent if sampling without replacement and the population is at least 10 times the sample)
- n trials: A fixed number of trials
- p probability: The probability of success is the same for each trial
Binomial Probability
Mean and Standard Deviation
Geometric Distributions
A geometric setting has three conditions:
- Binary: Each trial has two outcomes
- Independent: Trials are independent
- p probability: The probability of success is the same for each trial
The random variable counts the number of trials until the first success.
Geometric Probability
Mean
Normal Distributions and the Central Limit Theorem
Central Limit Theorem (CLT)
For a sufficiently large sample size (typically ), the sampling distribution of is approximately normal, regardless of the shape of the population distribution.
The CLT allows us to use normal probability calculations for sample means even when the population is not normally distributed.
Sampling Distribution of
For a sample proportion with population proportion and sample size :
Approximately normal when and .
Common Pitfalls
- Confusing with
- Assuming events are independent without justification
- Confusing disjoint with independent
- Forgetting to check the 10% condition for the binomial approximation
- Misapplying the central limit theorem with small sample sizes $
- Confusing the mean of a random variable with its most probable value