A November 7-10, 2013, Gallup poll asked 1,039 U.S. adults how much they planned to personally spend on Christmas gifts this year. The report cited an average of $704.
\[ \left( \bar{x} - 2\frac{s}{\sqrt{n}}, \bar{x} + 2\frac{s}{\sqrt{n}} \right) \]
Now work on 1-5 in your table groups.
The Gallup report did not provide \(s\), the sample standard deviation.
What property of the sample does \(s\) measure?
Suppose \(s\) was equal to $150. Use the 2SD method to record a 95% confidence interval for \(\mu\).
Suppose \(s\) was equal to $300. Use the 2SD method to record the 95% confidence interval for \(\mu\).
The poll involved 562 men and 477 women. Suppose the women reported planning to spend an average of $704 with standard deviation $150. Record a 2SD interval for the mean amount that women plan to spend.
Compare the intervals you recorded in #4, #5, and #6. How do sample standard deviation and sample size affect the width of a confidence interval? What else can affect the width of a confidence interval?
Reese’s Pieces candies come in three colors: orange, yellow, and brown. Suppose that you take a random sample of candies and want to estimate the long-run proportion of candies that is orange. Let’s assume for now (although we would not know this when conducting the study) that this long-run proportion, symbolized by \(\pi\), is equal to 0.50.
Thus, 95% confidence means that if we repeatedly sampled from a process and used the sample statistic to construct a 95% confidence interval, in the long run, roughly _______% of all those intervals would manage to capture the actual value of the long-run proportion , and the remaining _______% would not.
Before you change the confidence level to 90%, have your group predict what will happen. Widths? Percent green/red?
Now change Conf level to 90% and Recalculate. Record how the widths of the intervals change. Why does this make sense? Record how the running total changes. Why does this make sense?
The confidence level indicates the long-run percentage of confidence intervals that would succeed in capturing the (unknown) value of the parameter if random samples were to be taken repeatedly from the population/process and a confidence interval produced from each sample.
Before you change the sample size to 400, have your group predict what will happen. Widths? Percent green/red?
Change the sample size to 400 and press Sample. Record how the widths of the intervals change. Why does this make sense? Record how the running total changes. Why does this make sense?
Did people lie to pollsters?
Explainer: Write up your group’s choice, and explain why you made it.
A survey of 47000 Americans found that 32.4% own a cat. Use the Theory-Based Inference applet to test \(H_0: \pi = 0.33333\) versus the alternative \(H_a:\pi < 0.33333\). Record one sentence interpreting the p-value in the context of this problem.
Record a theory-based 95% confidence interval for the proportion of Americans who own a cat. Why is the interval so narrow?
What is the difference between these two explanations? Explainer: Write up your group’s choice and why you made it.
Now suppose that the actual percentage of American households that own a cat is 30%. Use the One Proportion Applet to simulate 1000 samples of size 100 (assuming that the population parameter is 0.3) and make a dot plot of the number of successes for these 1000 samples.
Regard these 1000 samples as 1000 different surveys, each of 100 households. In how many of these surveys would you have rejected the test in #15? (Use the rejection region.) Record the proportion of samples in which your survey count lies in the rejection region.
If the null hypothesis is \(H_0: \pi = 0.33333\), but the actual population parameter is 0.3, is the null true or false?
In how many of your 1000 simulated surveys did you fail to reject the null? If you fail to reject a false null, what type of error have you made? Record the power of this test.