STAT 211 Topic 6

From Notes
Jump to navigation Jump to search
Lecture 13 Notes

« previous | Tuesday, March 8, 2011 | next »


Confidence Intervals (Chapter 7)

Confidence interval might not even contain μ

3 Cases for Confidence Intervals with mean (μ) of population:

  1. When population distribution is Normal and σ is known
  2. When we have a large sample (pop. distribution may not be normal and σ may not be known)
  3. When we have a small sample from a normal distribution and σ is unknown


Confidence Interval: A range of values such that we are % sure that the mean of a population lies within that range.


Note: confidence intervals can be random, but μ is fixed: it never changes. Therefore, our confidence interval might not even contain μ.


Case 1: Normal Population Distribution

Given a sample to of size from a normal distribution with (unknown) mean and (known) standard deviation ,
a % Confidence Interval for is:

Where is the sample size and is a value such that the area underneath the Z-curve from to is :

In α notation: . For example, 95% confidence interval will have

Note: A larger sample size will decrease the range of the confidence interval.

Controlling Interval Width

Suppose we want our confidence interval to be at most :

This means that we will have to take at least samples from the population in order to have the same confidence over an interval of width .


Case 2: Large Sample

We do not have a normal population, but we can assume that the distribution of our sample average is approximately normal by CLT. Therefore, we can substitute our sample's standard deviation for the population:


Case 3: Normal Population with Small Sample

Standard deviation σ is unknown.

By standardizing, our sample average approaches Normal distribution, but a small sample size forces us to use T distribution:

, but is "something else"


T distribution

A more spread-out version of the Z-curve. For sample size , has degrees of freedom (df)

Similar to α notation, implies that .


Examples

Apartment Rental Fees

Suppose we want to find the average cost to rent a 1-bedroom apartment in College Station.

Let represent this rental cost. We want to find μ for the population, but note that we do not know the real value of μ or the distribution of .

Suppose we sample 32 apartment facilities and find that our sample average (a statistic) is $450. We can estimate μ with the expression:

Is $450 a good estimate? what about $460? $420?

What if we had an interval—lower to upper bound —for μ such that we are 95% sure that μ is between and ?

This interval is called a confidence interval and can be generated from random sample values.


GPR at A&M

Suppose the average GPR at Texas A&M is 3.0. Dr. Jun decides to inquire students' GPR at a bar on Northgate. She finds with 95% confidence that students who go to bars have a mean GPR between 2.5 and 2.9. What does this mean? (Students that go to bars tend to have lower GPR).


Lecture 14

Lecture 14 Notes

Thursday, March 10, 2011

Confidence Interval of a population proportion for a large sample

  • A population consists of success or failure
  • Population proportion can be estimated:

where is the number of successes in a sample of size .

A 100 × (1 − α)% confidence interval for is:

Thus, to control the width of our confidence interval (where the width is at most ):


Example

A random sample of 539 households from a city was selected, and we determine that 133 of these households owned at least one firearm. Give a 95% confidence interval for the proportion of all households in this city that own at least one firearm.

Sample Variance Distribution

If the population follows a normal distribution, our sample variance in the following form follows a chi-squared distribution with degrees of freedom .

Therefore we can construct a 100 × (1 − α)% confidence interval for the population variance :


χ2 Distribution

Only takes positive values (similar to gamma distribution).

PDF: