![Beginning Statistics, 2nd Edition](https://www.bartleby.com/isbn_cover_images/9781932628678/9781932628678_largeCoverImage.gif)
To find:
To calculate the test statistic.
![Check Mark](/static/check-mark.png)
Answer to Problem 15CR
Solution:
a.
b. Chi-Square distribution;
c.
d.
So, there is sufficient evidence not support the claim that education level and region of the U.S are not independent.
Explanation of Solution
Consider the following scenario,
“A sociologist wants to study the eduaction levels of people living in various regios of the United States. He surveys a random sample of 100 people in each of the following regios: Northeast, Southeast, Midwest, and West.”
The results obtained are found in the following table is,
Education Levels | |||||
Less Than a High School Diploma | High School Graduate, No College | Some College or Associate Degree | College Graduate | Total | |
Northeast | 11 | 29 | 45 | 15 | 100 |
Southeast | 19 | 28 | 44 | 9 | 100 |
Midwest | 15 | 25 | 47 | 13 | 100 |
West | 13 | 31 | 42 | 14 | 100 |
Total | 58 | 113 | 178 | 51 | 400 |
Step 1: To state the null and alternative hypotheses.
Let the null hypothesis be that education level and region of the U.S are independent of one another.
Null hypothesis:
Alternative hypothesis:
Step 2: To determine which distribution to use for the test statistic, and state the level of significance.
To determine there is a relationship between education level and region of the U.S. Since we assume that the necessary condition that have been met. So, use the chi-square test for this association and the level of siginificance is
Step 3: To calculate the test statistic.
First calculate the expected value for each cell in the
Formula for calculating expected value of a frequency in a contingency table:
The expected value of the frequency for the ith possible outcome in a contingency table is given by
Where n is the
From the above table represent the value of n is 400.
The
The expected values of for southeast and education level is given below,
The expected values of for mid-west and education level is given below,
The expected values of for west and education level is given below,
The table represents the contingency table of expected values by using the above formula is given below,
Education Levels | |||||
Less Than a High School Diploma | High School Graduate, No College | Some College or Associate Degree | College Graduate | Total | |
Northeast | 14.5 | 28.25 | 44.5 | 12.75 | 100 |
Southeast | 14.5 | 28.25 | 44.5 | 12.75 | 100 |
Midwest | 14.5 | 28.25 | 44.5 | 12.75 | 100 |
West | 14.5 | 28.25 | 44.5 | 12.75 | 100 |
Total | 58 | 113 | 178 | 51 | 400 |
Formula for calculating test statistic for a Chi-Square test for association:
The test statistic for a chi-square test for association is given below,
Where Oi, is the observed frequency for the ith possible outcome and Ei is the expected frequency for the ith possible outcome and n is the sample size.
From the above table represent the value of n is 400.
The test statistic,
The following table represents the chi square test statistic value s given below,
Observed values | Expected Values | |
|
|
11 | 14.5 | -3.5 | 12.25 | 0.84483 |
19 | 14.5 | 4.5 | 20.25 | 1.39655 |
15 | 14.5 | 0.5 | 0.25 | 0.01724 |
13 | 14.5 | -1.5 | 2.25 | 0.15517 |
29 | 28.25 | 0.75 | 0.5625 | 0.01991 |
28 | 28.25 | -0.25 | 0.0625 | 0.00221 |
25 | 28.25 | -3.25 | 10.5625 | 0.37389 |
31 | 28.25 | 2.75 | 7.5625 | 0.2677 |
45 | 44.5 | 0.5 | 0.25 | 0.00562 |
44 | 44.5 | -0.5 | 0.25 | 0.00562 |
47 | 44.5 | 2.5 | 6.25 | 0.14045 |
42 | 44.5 | -2.5 | 6.25 | 0.14045 |
15 | 12.75 | 2.25 | 5.0625 | 0.39706 |
9 | 12.75 | -3.75 | 14.0625 | 1.10294 |
13 | 12.75 | 0.25 | 0.0625 | 0.0049 |
14 | 12.75 | 1.25 | 1.5625 | 0.12255 |
So, the value of test statistic,
Step 4:
Draw a conclusion and interpret the decision.
Degrees of freedom in a Chi-square test for association:
In a chi-square test for association the number of degrees of freedom for the chi-square distribution of the test statistic is given by,
Where R is the number of rows of the data in the contingency table (not including the row total and C is the number of columns of data in the contingency table (not including the column totals).
Rejection Region for Chi-Square Test for Association:
Reject the null hypothesis,
From the given information R = 4 and C = 4.
Substitute the above values in the formula of degrees of freedom to get the following,
The number of degrees of freedom for this test is 9.
Use the level of significance of
Use the “Area to the right of the critical value
The test statistic value is
Comparing the test statistic and critical value to get the following,
So, there is sufficient evidence not support the claim that education level and region of the U.S are not independent.
Final statement:
Thus, there is sufficient evidence not support the claim that education level and region of the U.S are not independent.
Want to see more full solutions like this?
Chapter 10 Solutions
Beginning Statistics, 2nd Edition
- What were the average sales for the four weeks prior to the experiment? What were the sales during the four weeks when the stores used the digital display? What is the mean difference in sales between the experimental and regular POP time periods? State the null hypothesis being tested by the paired sample t-test. Do you reject or retain the null hypothesis? At a 95% significance level, was the difference significant? Explain why or why not using the results from the paired sample t-test. Should the manager of the retail chain install new digital displays in each store? Justify your answer.arrow_forwardA retail chain is interested in determining whether a digital video point-of-purchase (POP) display would stimulate higher sales for a brand advertised compared to the standard cardboard point-of-purchase display. To test this, a one-shot static group design experiment was conducted over a four-week period in 100 different stores. Fifty stores were randomly assigned to the control treatment (standard display) and the other 50 stores were randomly assigned to the experimental treatment (digital display). Compare the sales of the control group (standard POP) to the experimental group (digital POP). What were the average sales for the standard POP display (control group)? What were the sales for the digital display (experimental group)? What is the (mean) difference in sales between the experimental group and control group? List the null hypothesis being tested. Do you reject or retain the null hypothesis based on the results of the independent t-test? Was the difference between the…arrow_forwardQuestion 4 An article in Quality Progress (May 2011, pp. 42-48) describes the use of factorial experiments to improve a silver powder production process. This product is used in conductive pastes to manufacture a wide variety of products ranging from silicon wafers to elastic membrane switches. Powder density (g/cm²) and surface area (cm/g) are the two critical characteristics of this product. The experiments involved three factors: reaction temperature, ammonium percentage, stirring rate. Each of these factors had two levels, and the design was replicated twice. The design is shown in Table 3. A222222222222233 Stir Rate (RPM) Ammonium (%) Table 3: Silver Powder Experiment from Exercise 13.23 Temperature (°C) Density Surface Area 100 8 14.68 0.40 100 8 15.18 0.43 30 100 8 15.12 0.42 30 100 17.48 0.41 150 7.54 0.69 150 8 6.66 0.67 30 150 8 12.46 0.52 30 150 8 12.62 0.36 100 40 10.95 0.58 100 40 17.68 0.43 30 100 40 12.65 0.57 30 100 40 15.96 0.54 150 40 8.03 0.68 150 40 8.84 0.75 30 150…arrow_forward
- - + ++ Table 2: Crack Experiment for Exercise 2 A B C D Treatment Combination (1) Replicate I II 7.037 6.376 14.707 15.219 |++++ 1 བྱ॰༤༠སྦྱོ སྦྱོཋཏྟཱུ a b ab 11.635 12.089 17.273 17.815 с ас 10.403 10.151 4.368 4.098 bc abc 9.360 9.253 13.440 12.923 d 8.561 8.951 ad 16.867 17.052 bd 13.876 13.658 abd 19.824 19.639 cd 11.846 12.337 acd 6.125 5.904 bcd 11.190 10.935 abcd 15.653 15.053 Question 3 Continuation of Exercise 2. One of the variables in the experiment described in Exercise 2, heat treatment method (C), is a categorical variable. Assume that the remaining factors are continuous. (a) Write two regression models for predicting crack length, one for each level of the heat treatment method variable. What differences, if any, do you notice in these two equations? (b) Generate appropriate response surface contour plots for the two regression models in part (a). (c) What set of conditions would you recommend for the factors A, B, and D if you use heat treatment method C = +? (d) Repeat…arrow_forwardQuestion 2 A nickel-titanium alloy is used to make components for jet turbine aircraft engines. Cracking is a potentially serious problem in the final part because it can lead to nonrecoverable failure. A test is run at the parts producer to determine the effect of four factors on cracks. The four factors are: pouring temperature (A), titanium content (B), heat treatment method (C), amount of grain refiner used (D). Two replicates of a 24 design are run, and the length of crack (in mm x10-2) induced in a sample coupon subjected to a standard test is measured. The data are shown in Table 2. 1 (a) Estimate the factor effects. Which factor effects appear to be large? (b) Conduct an analysis of variance. Do any of the factors affect cracking? Use a = 0.05. (c) Write down a regression model that can be used to predict crack length as a function of the significant main effects and interactions you have identified in part (b). (d) Analyze the residuals from this experiment. (e) Is there an…arrow_forwardA 24-1 design has been used to investigate the effect of four factors on the resistivity of a silicon wafer. The data from this experiment are shown in Table 4. Table 4: Resistivity Experiment for Exercise 5 Run A B с D Resistivity 1 23 2 3 4 5 6 7 8 9 10 11 12 I+I+I+I+Oooo 0 0 ||++TI++o000 33.2 4.6 31.2 9.6 40.6 162.4 39.4 158.6 63.4 62.6 58.7 0 0 60.9 3 (a) Estimate the factor effects. Plot the effect estimates on a normal probability scale. (b) Identify a tentative model for this process. Fit the model and test for curvature. (c) Plot the residuals from the model in part (b) versus the predicted resistivity. Is there any indication on this plot of model inadequacy? (d) Construct a normal probability plot of the residuals. Is there any reason to doubt the validity of the normality assumption?arrow_forward
- Stem1: 1,4 Stem 2: 2,4,8 Stem3: 2,4 Stem4: 0,1,6,8 Stem5: 0,1,2,3,9 Stem 6: 2,2 What’s the Min,Q1, Med,Q3,Max?arrow_forwardAre the t-statistics here greater than 1.96? What do you conclude? colgPA= 1.39+0.412 hsGPA (.33) (0.094) Find the P valuearrow_forwardA poll before the elections showed that in a given sample 79% of people vote for candidate C. How many people should be interviewed so that the pollsters can be 99% sure that from 75% to 83% of the population will vote for candidate C? Round your answer to the whole number.arrow_forward
- Suppose a random sample of 459 married couples found that 307 had two or more personality preferences in common. In another random sample of 471 married couples, it was found that only 31 had no preferences in common. Let p1 be the population proportion of all married couples who have two or more personality preferences in common. Let p2 be the population proportion of all married couples who have no personality preferences in common. Find a95% confidence interval for . Round your answer to three decimal places.arrow_forwardA history teacher interviewed a random sample of 80 students about their preferences in learning activities outside of school and whether they are considering watching a historical movie at the cinema. 69 answered that they would like to go to the cinema. Let p represent the proportion of students who want to watch a historical movie. Determine the maximal margin of error. Use α = 0.05. Round your answer to three decimal places. arrow_forwardA random sample of medical files is used to estimate the proportion p of all people who have blood type B. If you have no preliminary estimate for p, how many medical files should you include in a random sample in order to be 99% sure that the point estimate will be within a distance of 0.07 from p? Round your answer to the next higher whole number.arrow_forward
- MATLAB: An Introduction with ApplicationsStatisticsISBN:9781119256830Author:Amos GilatPublisher:John Wiley & Sons IncProbability and Statistics for Engineering and th...StatisticsISBN:9781305251809Author:Jay L. DevorePublisher:Cengage LearningStatistics for The Behavioral Sciences (MindTap C...StatisticsISBN:9781305504912Author:Frederick J Gravetter, Larry B. WallnauPublisher:Cengage Learning
- Elementary Statistics: Picturing the World (7th E...StatisticsISBN:9780134683416Author:Ron Larson, Betsy FarberPublisher:PEARSONThe Basic Practice of StatisticsStatisticsISBN:9781319042578Author:David S. Moore, William I. Notz, Michael A. FlignerPublisher:W. H. FreemanIntroduction to the Practice of StatisticsStatisticsISBN:9781319013387Author:David S. Moore, George P. McCabe, Bruce A. CraigPublisher:W. H. Freeman
![Text book image](https://www.bartleby.com/isbn_cover_images/9781119256830/9781119256830_smallCoverImage.gif)
![Text book image](https://www.bartleby.com/isbn_cover_images/9781305251809/9781305251809_smallCoverImage.gif)
![Text book image](https://www.bartleby.com/isbn_cover_images/9781305504912/9781305504912_smallCoverImage.gif)
![Text book image](https://www.bartleby.com/isbn_cover_images/9780134683416/9780134683416_smallCoverImage.gif)
![Text book image](https://www.bartleby.com/isbn_cover_images/9781319042578/9781319042578_smallCoverImage.gif)
![Text book image](https://www.bartleby.com/isbn_cover_images/9781319013387/9781319013387_smallCoverImage.gif)