Statistics and Probability for 11th Graders - Part 5
Statistical Inference and Hypothesis Testing
2026-02-21
What is Statistical Inference?
What is Statistical Inference?
Statistical inference is the process of drawing conclusions about a population based on data from a sample.
Why do we need it?
- We can rarely measure an entire population
- Instead, we collect a sample and use it to estimate population parameters
- We quantify our uncertainty using probability
What is Statistical Inference?
Why do we need it?
| Population mean |
\(\mu\) |
Avg height of all students in PH |
| Sample mean |
\(\bar{x}\) |
Avg height of 50 students surveyed |
| Population proportion |
\(p\) |
% of voters who prefer candidate A |
| Sample proportion |
\(\hat{p}\) |
% in a poll of 200 voters |
Sampling Distributions
A sampling distribution is the distribution of a statistic (like \(\bar{x}\)) computed from many samples of the same size.
Sampling Distributions
A sampling distribution is the distribution of a statistic (like \(\bar{x}\)) computed from many samples of the same size.
Special case of sampling distribution:
Sampling Distributions
Special case of sampling distribution:
The Central Limit Theorem (CLT)
For large enough \(n\), the sampling distribution of \(\bar{x}\) is approximately normal, regardless of the population shape.
\[\bar{x} \sim N\!\left(\mu,\; \frac{\sigma}{\sqrt{n}}\right)\]
- Mean of sampling distribution \(= \mu\)
- Standard error (SE) \(= \dfrac{\sigma}{\sqrt{n}}\)
Sampling Distributions
Special note about CLT
As sample size increases:
- SE gets smaller — estimates are more precise
- Distribution becomes more normal
- Rule of thumb: \(n \geq 30\) is usually “large enough”
Sampling Distribution — Example
A population has mean \(\mu = 70\) and \(\sigma = 10\). We take samples of size \(n = 25\). Find the mean and SE of the sampling distribution.
\[\text{Mean} = \mu = 70\]
\[SE = \frac{\sigma}{\sqrt{n}} = \frac{10}{\sqrt{25}} = \frac{10}{5} = 2\]
So \(\bar{x} \sim N(70,\; 2)\). Most sample means will fall between 66 and 74.
Confidence Intervals
A confidence interval (CI) gives a range of plausible values for a population parameter.
\[\bar{x} \pm z^* \cdot \frac{\sigma}{\sqrt{n}}\]
Common CI levels:
- 90% (\(z^*=1.645\))
- 95% (\(z^*=1.96\))
- 99% (\(z^*=2.576\))
Confidence Intervals
Common Misconception:
- A 95% CI does not mean “there is a 95% chance \(\mu\) is in this interval.”
- It means: if we repeated the process many times, 95% of the intervals we build would contain \(\mu\).
Confidence Intervals – Example
Example:
Confidence Intervals – Example
Example:
A sample of \(n = 36\) students has \(\bar{x} = 68\) kg and \(\sigma = 12\) kg. Construct a 95% CI for the population mean.
\[\bar{x} \pm z^* \cdot \frac{\sigma}{\sqrt{n}} = 68 \pm 1.96 \cdot \frac{12}{\sqrt{36}} = 68 \pm 1.96 \cdot 2 = 68 \pm 3.92\]
\[\boxed{(64.08,\; 71.92)}\]
We are 95% confident the true mean weight is between 64.08 kg and 71.92 kg.
Hypothesis Testing – The Big Picture
Hypothesis testing is a formal procedure to decide whether sample evidence is strong enough to reject a claim about a population.
Hypothesis Testing – The Big Picture
Hypothesis testing is a formal procedure to decide whether sample evidence is strong enough to reject a claim about a population.
Two Hypotheses
- Null hypothesis or \(H_0\): The “nothing is happening” claim; assumed true until proven otherwise
- Alternative hypothesis or \(H_a\) or \(H_1\): What we want to show; the “new” claim
Hypothesis Testing – The Big Picture
Step-by-Step Process
- State \(H_0\) and \(H_a\)
- Choose significance level \(\alpha\)
- Compute the test statistic
- Find the p-value
- Make a decision and conclusion
Hypothesis Testing – The Big Picture
Step-by-Step Process
- State \(H_0\) and \(H_a\)
- Choose significance level \(\alpha\)
- Compute the test statistic
- Find the p-value
- Make a decision and conclusion
\(H_0\) always contains =. \(H_a\) usually opposes \(H_0\), and usually uses \(\neq\), \(<\), or \(>\).
Setting Up Hypotheses
Two-tailed
“Is the mean different from 50?”
\[H_0: \mu = 50\] \[H_a: \mu \neq 50\]
Left-tailed
“Is the mean less than 50?”
\[H_0: \mu = 50\] \[H_a: \mu < 50\]
Right-tailed
“Is the mean greater than 50?”
\[H_0: \mu = 50\] \[H_a: \mu > 50\]
The direction of \(H_a\) determines where the rejection region falls in the distribution.
p-values and Significance
The p-value is the probability of getting a result as extreme as your sample, assuming \(H_0\) is true.
Type I and Type II Errors
When we make a decision, we can be wrong in two ways:
| Reject \(H_0\) |
Type I Error (\(\alpha\)) |
Correct (Power) |
| Fail to Reject \(H_0\) |
Correct |
Type II Error (\(\beta\)) |
Type I and Type II Errors
Remarks:
Type I Error (\(\alpha\))
- Rejecting \(H_0\) when it is actually true
- “False positive”
- Probability \(= \alpha\) (significance level)
- e.g., convicting an innocent person
Type II Error (\(\beta\))
- Failing to reject \(H_0\) when it is actually false
- “False negative”
- Probability \(= \beta\)
- Power \(= 1 - \beta\) = prob. of correctly rejecting \(H_0\)
- e.g., acquitting a guilty person
The Z-Test
Use the Z-test when: population \(\sigma\) is known and \(n \geq 30\) (or population is normal).
The Z-Test
Use the Z-test when: population \(\sigma\) is known and \(n \geq 30\) (or population is normal).
\[z = \frac{\bar{x} - \mu_0}{\sigma / \sqrt{n}}\]
The Z-Test
Use the Z-test when: population \(\sigma\) is known and \(n \geq 30\) (or population is normal).
\[z = \frac{\bar{x} - \mu_0}{\sigma / \sqrt{n}}\]
A school claims the average test score is \(\mu = 75\). A sample of \(n = 40\) gives \(\bar{x} = 78\), with known \(\sigma = 10\). Test at \(\alpha = 0.05\) (two-tailed).
\[z = \frac{78 - 75}{10/\sqrt{40}} = \frac{3}{1.581} \approx 1.897\]
Critical value: \(z^* = \pm 1.96\). Since \(|1.897| < 1.96\), we fail to reject \(H_0\). There is insufficient evidence that the mean differs from 75.
The T-Test
Use the T-test when: population \(\sigma\) is unknown (use sample \(s\) instead) or \(n < 30\).
\[t = \frac{\bar{x} - \mu_0}{s / \sqrt{n}}, \quad df = n - 1\]
The T-Test
Use the T-test when: population \(\sigma\) is unknown (use sample \(s\) instead) or \(n < 30\).
\[t = \frac{\bar{x} - \mu_0}{s / \sqrt{n}}, \quad df = n - 1\]
A sample of \(n = 16\) bags shows \(\bar{x} = 498\) g and \(s = 6\) g. Claim: \(\mu = 500\) g. Test at \(\alpha = 0.05\) (left-tailed).
\[t = \frac{498 - 500}{6/\sqrt{16}} = \frac{-2}{1.5} \approx -1.333\]
Critical value (\(df = 15\), left-tailed): \(t^* = -1.753\). Since \(-1.333 > -1.753\), we fail to reject \(H_0\). No significant evidence that bags are underfilled.
Proportion Test
Use a proportion test when the variable is categorical (yes/no, success/failure).
\[z = \frac{\hat{p} - p_0}{\sqrt{\dfrac{p_0(1-p_0)}{n}}}\]
Conditions: \(np_0 \geq 10\) and \(n(1-p_0) \geq 10\)
Proportion Test
Use a proportion test when the variable is categorical (yes/no, success/failure).
\[z = \frac{\hat{p} - p_0}{\sqrt{\dfrac{p_0(1-p_0)}{n}}}\]
Conditions: \(np_0 \geq 10\) and \(n(1-p_0) \geq 10\)
A candidate claims 60% support (\(p_0 = 0.60\)). A poll of \(n = 200\) finds \(\hat{p} = 0.55\). Test at \(\alpha = 0.05\) (left-tailed).
\[z = \frac{0.55 - 0.60}{\sqrt{0.60 \cdot 0.40 / 200}} = \frac{-0.05}{0.03464} \approx -1.443\]
Critical value: \(z^* = -1.645\). Since \(-1.443 > -1.645\), we fail to reject \(H_0\). Not enough evidence that support is below 60%.