Estimation Sampling Distribution SECTIONS 31 Sampling Distributions 31 Question of the Day What proportion of M amp M candies are blue The Big Picture Population Sample Sampling ID: 785110
Download The PPT/PDF document "STAT 101 Dr. Kari Lock Morgan" is the property of its rightful owner. Permission is granted to download and print the materials on this web site for personal, non-commercial use only, and to display it on your personal computer provided you do not modify the materials and that you retain all copyright notices contained in the materials. By downloading content from our website, you accept the terms of this agreement.
Slide1
STAT 101Dr. Kari Lock Morgan
Estimation: Sampling Distribution
SECTIONS 3.1
Sampling Distributions (3.1)
Slide2Question of the Day
What proportion of M & M candies are
blue?
Slide3The Big Picture
Population
Sample
Sampling
Statistical Inference
Interval estimation
Hypothesis testing
Slide4Statistical InferenceStatistical inference
is the process of drawing conclusions about the entire population based on information in a sample.Example: use the sample of M&Ms candies we have here to draw conclusions about all M&Ms
Slide5Statistic and ParameterA parameter is a number that describes some aspect of a population.
A statistic is a number that is computed from data in a sample.
We usually have a sample statistic and want to use it to make inferences about the population parameter
Slide6M & M Candiesp = proportion of M & M candies that are blue
Get an estimate from one sample.
p
= ???
Slide7The Big Picture
Population
Sample
Sampling
Statistical Inference
PARAMETERS
STATISTICS
Slide8Parameter versus Statistic
mu
sigma
rho
x-bar
p-hat
Slide9Point EstimateWe use the statistic from a sample as a point estimate for a population parameter.
Point estimates will not match population parameters exactly, but they are our best guess, given the data
Slide10How far might the population parameter fall from the sample statistic?
GOAL: Identify an
interval
of plausible values.
p
?
p
?p?
Slide11Key Question and Answer
Key Question: For a given sample statistic, what are plausible values for the population parameter? How far might the true population parameter be from the sample statistic?
Key answer:
It depends on how much the statistic varies from sample to sample!
Slide12More SamplesLet’s collect a few more point estimates!
Important point: Sample statistics vary
from sample to
sample, and knowing how much a statistic varies from sample to sample helps us assess uncertainty in the statistic!
Slide13Lots of Samples
To really see how statistics vary from sample to sample, let’s take lots of samples and compute lots of statistics! (eat lots of M&Ms!)
Enter your sample proportion on the
google
form emailed to you before class (if you don’t have a computer, have someone near you enter your number)
You just made your first sampling distribution!
Slide14Sampling DistributionA sampling distribution is the distribution of sample statistics computed for different samples of the same size from the same population.
A sampling distribution shows us how the sample statistic varies from sample to sample
Slide15Sampling DistributionIn the M & M sampling distribution, what does each dot represent?
One Reese’s piece One sample statistic
Slide16Center and ShapeCenter: If samples are randomly selected, the sampling distribution will be centered around the population parameter.
Shape:
For most of the statistics we consider, if the sample size is large enough the sampling distribution will be symmetric and bell-shaped.
Slide17Sampling Caution
If you take random samples, the sampling distribution will be centered around the true population parameter
If sampling bias exists (if you do not take random samples), your sampling distribution may give you bad information about the true parameter
“The. Polls. Have. Stopped. Making. Any. Sense.”
Slide18We really care about the spread of the statistic…
How much do statistics vary from sample to sample?
Sampling distribution
?
Slide19Standard ErrorThe standard error of a statistic, SE, is the standard deviation of the sample statistic
The standard error measures how much the statistic varies from sample to sample
The standard error can be calculated as the standard deviation of the sampling distribution
Slide20Standard ErrorThe more the statistic varies from sample to sample, the t
he standard error. higher lower
Slide21M & M Standard Error
0.01 0.1 0.2
0.35
Based on our sampling distribution, the standard error is closest to (distribution below is based on 100 samples):
Slide22Lower SE means statistics closer to true parameter value…
p
Distance from parameter to statistic
SE measures “typical” distance between parameter and statistic
SE =
0.1
SE =
0.04
Slide23Distance from parameter to statistic gives distance from statistic to parameter
p
Rare for statistics to be further than this from parameter
So rare for parameter to be further than this from statistic
SE can be used to determine width of interval!
Slide24The larger the SE, the larger the interval
p
SE =
0.1
SE =
0.04
Rare for statistics to be further than this from parameter
SE = 0.1SE =
0.04
Slide25Sample Size Matters!As the sample size increases, the variability of the sample statistics tends to decrease and the sample statistics tend to be closer to the true value of the population parameter
For larger sample sizes, you get less variability in the statistics, so less uncertainty in your estimates
Slide26M & Ms
StatKey
Slide27Sample SizeSuppose we were to take samples of size 10 and samples of size 100 from the same population, and compute the sample means. Which sample means would have the
higher standard error? The sample means using n = 10
The sample means using n = 100
Slide28Slide29So larger n means narrower intervals
Small
n
p
?
Large
n
p?
Slide30SummaryInterval estimates are superior to just point estimates because they account for uncertaintyA sampling distribution is a collection of many statistics from the same population and same
nThe width of the interval depends on how much the statistic varies from sample to sample, measured by the standard error (SE)Larger SE => wider intervalLarger n
=> smaller SE => narrower interval
Slide31To DoRead Section 3.1HW 3.1 due Friday, 10/2