
a.
To construct: The 99% confidence interval for the proportion of Americans who are more than 50 years of age and also current smokers.
a.

Answer to Problem 25.42E
The 99% confidence interval for the proportion of Americans who are more than 50 years of age and also current smokers is 0.0826 to 0.1057or 8.26% to 10.57%.
Explanation of Solution
Given info:
The data shows the results obtained by asking two questions to a random sample of 50 year old adults “How do your overall health, excellent, very good, good, fair and poor?” and the second question is “Do you smoke currently, yes or no?”
Calculation:
Let
Thus, the proportion of Americans who are more than 50 years of age and currently smoking is 0.094.
Software procedure:
Step-by-step procedure for constructing 99% confidence interval for the given proportion is shown below:
- Click on Stat, Basic statistics and 1-proportion.
- Choose Summarized data, under Number of
events enter 404, under Number of trials enter 4,310. - Click on options, choose 99% confidence interval.
- Click ok.
Output using MINITAB is given below:
Thus, the 99% confidence interval for the proportion of Americans who are more than who are more than 50 years of age and also current smokers is 0.0826 to 0.1057.
b.
To compare: The two conditional distributions and also the graph of conditional distributions.
To find: The conditional distribution of age for the Americans who use social networking sitesand who don’t use social networking siteson their phones.
To construct: A suitable graph for the conditional distribution of age for the Americans who use social networking sitesand who don’t use social networking siteson their phones.
b.

Answer to Problem 25.42E
- The bar graph and the conditional distributions show that Americans are currently smoking have lowest percentages in “excellent” and “very good” health evaluation whereas Americans who are not currently smoking have highest percentages in “excellent” and “very good” health evaluation.
- The percentages in “fair and poor” health evaluation is more in currently smoking group when compared to the percentages for the same categories under not currently smoking group.
The conditional distribution of health evaluation for the Americans who are currently smoking and not currently smoking is given below:
Conditional distribution of health evaluation for the Americans who are currently smoking and not currently smoking | ||
Health Evaluation | Yes | No |
Excellent | 6.20 | 12.4 |
Very good | 28.50 | 39.9 |
Good | 35.90 | 33.5 |
Fair | 22.30 | 14.0 |
Poor | 7.18 | 0.3 |
The bar graph showing the conditional distribution of health evaluation for the Americans who are currently smoking and not currently smoking is given below:
- Output obtained from MINITAB is given below:
Explanation of Solution
Calculation:
The conditional distribution of health evaluation for the Americans who are current smokers is calculated as follows:
The conditional distribution of health evaluation for the Americans who are not current smokers is calculated as follows:
The conditional distribution of health evaluation for the Americans who are currently smoking and not currently smoking is given below:
Conditional distribution of health evaluation for the Americans who are currently smoking and not currently smoking | ||
Health Evaluation | Yes | No |
Excellent |
|
|
Very good |
|
|
Good |
|
|
Fair |
|
|
Poor |
|
|
Software Procedure:
Step-by-step procedure to construct the Bar Chart using the MINITAB software:
- Choose Graph > Bar Chart.
- From Bars represent, choose Values from a table.
- Under Two Columns of values, choose Cluster. Click OK.
- In Graph variables, enter the columns of Yes and No.
- In Row labels, enter “Health Evaluation”.
- In Table arrangement, click “columns are outermost categories and columns are innermost”.
- Click OK.
- Interpretation:
- The bar chart for comparing two conditional distributions is constructed with bars to the extreme left side of the graph, which represents the currently smoking group.
- c.
- To test: Whether there is a significant difference between health evaluation and currently smoking and not currently smoking group.
- To find: The
mean value of the test statistic given that the null hypothesis is true.
- To give: The P-value.
- c.

Answer to Problem 25.42E
- There is a significant difference between health evaluation and currently smoking and not currently smoking group.
- The mean value of the chi-square test statistic given that the null hypothesis is true is given is to be 4.
- The P-value is 0.000.
Explanation of Solution
- Calculation:
- The claim is to test whether there is any significant difference between health evaluation and currently smoking and not currently smoking group.
Cell frequency for using Chi-square test:
- When at most 20% of the cell frequencies are less than 5
- If all the individual frequencies are 1 or more than 1.
- All the expected frequencies must be 5 or greater than 5
The hypotheses used for testing are given below:
Software procedure:
Step-by-stepprocedure for calculating the chi-square test statistic is given below:
- Click on Stat, select Tables and then click on Chi-square Test (Two-way table in a worksheet).
- Under Columns containing the table: enter the columns of Yes and No.
- Click ok.
Output obtained from MINITAB is given below:
Thus, the test statistic is 229.660, the degree of freedom is 4, and the P-value is 0.000.
Only one cell is having an expected frequency less than 5. The usage of chi-square test is appropriate.
Conclusion:
The P-value is 0.000 and the level of significance is 0.05.
Here, the P-value is less than the level of significance.
Therefore, the null hypothesis is rejected.
Thus, there is sufficient evidence to support the claim that there is a significant difference between health evaluation and currently smoking and not currently smoking group.
Justification:
Fact:
If the null hypothesis is true, then the mean of any chi-square distribution is equal to its degrees of freedom.
From the MINITAB output, it can be observed that the degree of freedom is calculated as 3. So, the mean value of the chi-square statistic is 3.
Thus, the mean value of the chi-square statistic is 3.
Also, the observed chi-square value is
d.
To give: A comparison for the difference in the distribution of age across social media network usage.
d.

Explanation of Solution
Comparison:
From the MINITAB output, it can be seen that the observed frequencies for good, fair and poor health evaluation for currently smoking group is higher than the expected frequency, given that the null hypothesis is true.
In non-smoking group the observed frequencies for good, fair and poor health evaluation is lesser than the expected frequency.
The people who are currently smoking have a poor health condition when compared to the non smoking group.
Hence, there is a significant difference in the health evaluation for currently smoking and not currently smoking group.
Want to see more full solutions like this?
Chapter 25 Solutions
Loose-leaf Version for The Basic Practice of Statistics 7e & LaunchPad (Twelve Month Access)
- Suppose that you take a sample of 100 from a population that contains 45 percent Democrats. What sample size condition do you need to check here (if any)?What’s the standard error of ^P?Compare the standard errors of ^p n=100 for ,n=1000 , n=10,000, and comment.arrow_forwardSuppose that a class’s test scores have a mean of 80 and standard deviation of 5. You choose 25 students from the class. What’s the chance that the group’s average test score is more than 82?arrow_forwardSuppose that you collect data on 10 products and check their weights. The average should be 10 ounces, but your sample mean is 9 ounces with standard deviation 2 ounces. Find the standard score.What percentile is the standard score found in part a of this question closest to?Suppose that the mean really is 10 ounces. Do you find these results unusual? Use probabilities to explain.arrow_forward
- Suppose that you want to sample expensive computer chips, but you can have only n=3 of them. Should you continue the experiment?arrow_forwardSuppose that studies claim that 40 percent of cellphone owners use their phones in the car while driving. What’s the chance that more than 425 out of a random sample of 1,000 cellphone owners say they use their phones while driving?arrow_forwardSuppose that the average length of stay in Europe for American tourists is 17 days, with standard deviation 4.5. You choose a random sample of 16 American tourists. The sample of 16 stay an average of 18.5 days or more. What’s the chance of that happening?arrow_forward
- How do you recognize that a statistical problem requires you to use the CLT? Think of one or two clues you can look for. (Assume quantitative data.)arrow_forwardSuppose that you take a sample of 100 from a skewed population with mean 50 and standard deviation 15. What sample size condition do you need to check here (if any)?What’s the shape and center of the sampling distribution for ?What’s the standard error?arrow_forwardQuestion 3 The following stem-and-leaf displays the weekly salary of employees at this firm. Stem-and-Leaf Display Leaf Unit = 10.0 N=x 5 3 00123 12 4 0125888 (y) 5 11234456777 z 6 13568 5 7 154 2 8 46 i. Determine the value of x, y and z. [3] ii. What is the value of the median? [2] iii. Find the mode of this data set. iv. Calculate the range [1] [2]arrow_forward
- Let Y be a continuous RV with PDF otherwise Find the CDF, Fry), of Y . Find an expression for pth, p € (0, 1), quantile of the distribution. Find E(Y) and V(Y). Find E(-2Y + 1) and V(-3Y - 2). Find E(Y3).arrow_forwardLet X be a continuous RV with CDF Find P(X < 0), P(-1 < X < 1) and P(0.5 < X). Based on your answers to the above questions, what is the median of the distribu-tion? Why Find the PDF, fx (x), of X.arrow_forwardA survey of 581 citizens found that 313 of them favor a new bill introduced by the city. We want to find a 95% confidence interval for the true proportion of the population who favor the bill. What is the lower limit of the interval? Enter the result as a decimal rounded to 3 decimal digits. Your Answer:arrow_forward
- MATLAB: An Introduction with ApplicationsStatisticsISBN:9781119256830Author:Amos GilatPublisher:John Wiley & Sons IncProbability and Statistics for Engineering and th...StatisticsISBN:9781305251809Author:Jay L. DevorePublisher:Cengage LearningStatistics for The Behavioral Sciences (MindTap C...StatisticsISBN:9781305504912Author:Frederick J Gravetter, Larry B. WallnauPublisher:Cengage Learning
- Elementary Statistics: Picturing the World (7th E...StatisticsISBN:9780134683416Author:Ron Larson, Betsy FarberPublisher:PEARSONThe Basic Practice of StatisticsStatisticsISBN:9781319042578Author:David S. Moore, William I. Notz, Michael A. FlignerPublisher:W. H. FreemanIntroduction to the Practice of StatisticsStatisticsISBN:9781319013387Author:David S. Moore, George P. McCabe, Bruce A. CraigPublisher:W. H. Freeman





