« previous | Tuesday, March 8, 2011 | next »
Confidence Intervals (Chapter 7)
Confidence interval might not even contain μ
3 Cases for Confidence Intervals with mean (μ) of population:
- When population distribution is Normal and σ is known
- When we have a large sample (pop. distribution may not be normal and σ may not be known)
- When we have a small sample from a normal distribution and σ is unknown
Confidence Interval: A range of values
such that we are
% sure that the mean
of a population lies within that range.
Note: confidence intervals can be random, but μ is fixed: it never changes. Therefore, our confidence interval might not even contain μ.
Case 1: Normal Population Distribution
Given a sample to of size from a normal distribution with (unknown) mean and (known) standard deviation ,
a % Confidence Interval for is:
Where is the sample size and is a value such that the area underneath the Z-curve from to is :
In α notation: . For example, 95% confidence interval will have
Note: A larger sample size will decrease the range of the confidence interval.
Controlling Interval Width
Suppose we want our confidence interval to be at most :
This means that we will have to take at least samples from the population in order to have the same confidence over an interval of width .
Case 2: Large Sample
We do not have a normal population, but we can assume that the distribution of our sample average is approximately normal by CLT. Therefore, we can substitute our sample's standard deviation for the population:
Case 3: Normal Population with Small Sample
Standard deviation σ is unknown.
By standardizing, our sample average approaches Normal distribution, but a small sample size forces us to use T distribution:
, but is "something else"
T distribution
A more spread-out version of the Z-curve. For sample size , has degrees of freedom (df)
Similar to α notation, implies that .
Examples
Apartment Rental Fees
Suppose we want to find the average cost to rent a 1-bedroom apartment in College Station.
Let represent this rental cost. We want to find μ for the population, but note that we do not know the real value of μ or the distribution of .
Suppose we sample 32 apartment facilities and find that our sample average (a statistic) is $450. We can estimate μ with the expression:
Is $450 a good estimate? what about $460? $420?
What if we had an interval—lower to upper bound —for μ such that we are 95% sure that μ is between and ?
This interval is called a confidence interval and can be generated from random sample values.
GPR at A&M
Suppose the average GPR at Texas A&M is 3.0. Dr. Jun decides to inquire students' GPR at a bar on Northgate. She finds with 95% confidence that students who go to bars have a mean GPR between 2.5 and 2.9. What does this mean? (Students that go to bars tend to have lower GPR).
Lecture 14
Thursday, March 10, 2011
Confidence Interval of a population proportion for a large sample
- A population consists of success or failure
- Population proportion can be estimated:
where is the number of successes in a sample of size .
A 100 × (1 − α)% confidence interval for is:
Thus, to control the width of our confidence interval (where the width is at most ):
Example
A random sample of 539 households from a city was selected, and we determine that 133 of these households owned at least one firearm. Give a 95% confidence interval for the proportion of all households in this city that own at least one firearm.
Sample Variance Distribution
If the population follows a normal distribution, our sample variance in the following form follows a chi-squared distribution with degrees of freedom .
Therefore we can construct a 100 × (1 − α)% confidence interval for the population variance :
χ2 Distribution
Only takes positive values (similar to gamma distribution).
PDF: