
Perform test of hypotheses that gender and admission are independent for each department at the level of significance of
Prove that the only department for which this hypothesis is rejected is department A.
Also prove that the admission rate for women in department A is significantly higher than the rate for men.

Answer to Problem 4CS
There is enough evidence to conclude that the gender and admission are not independent of gender in department A.
There is not enough evidence to conclude that the gender and admission are not independent of gender in department B.
There is not enough evidence to conclude that the gender and admission are not independent of gender in department C.
There is not enough evidence to conclude that the gender and admission are not independent of gender in department D.
There is not enough evidence to conclude that the gender and admission are not independent of gender in department E.
There is not enough evidence to conclude that the gender and admission are not independent of gender in department F.
Explanation of Solution
Calculation:
The six contingency tables with 2 rows and 2 columns for each of the six departments are given. The two rows consist of gender and the two columns consist of accepted and rejected applicants.
A contingency table is obtained as using two qualitative variables. One of the qualitative variable is row variable that has one category for each row of the table another is column variable has one category for each column of the table.
Step 1:
For Department A:
The hypotheses are:
Null Hypothesis:
Alternate Hypothesis:
Step 2:
Now, it is obtained that,
Gender | Accept | Reject | Row Total |
Male | 512 | 313 | 825 |
Female | 89 | 19 | 108 |
Column Total | 601 | 332 | 933 |
Step 3:
Expected frequencies:
The expected frequencies in case of contingency table is obtained as,
Now, using the formula of expected frequency it is found that the expected frequency for the accepted male is obtained as,
Hence, in similar way the expected frequencies are obtained as,
Gender | Accept | Reject |
Male | ||
Female |
The accept and reject can be rewritten as,
Accept | 1 |
Reject | 2 |
Step 4:
Level of significance:
The level of significance is given as 0.01.
Test Statistic:
Software procedure:
Step -by-step software procedure to obtain test statistic using MINITAB software is as follows:
- Select Stat > Table > Cross Tabulation and Chi-Square.
- Check the box of Raw data (categorical variables).
- Under For rows enter Gender.
- Under For columns enter Column.
- Check the box of Count under Display.
- Under Chi-Square, click the box of Chi-Square test.
- Select OK.
- Output using MINITAB software is given below:
Thus, the value of chi-square statistic is 17.248.
It is known that under the null hypothesis
In the given question there are 2 rows and 2 columns.
Hence, the degrees of freedom is
Thus, the degrees of freedom is 1.
It is known that when the null hypothesis
It is found that all the expected frequencies corresponding to all rows and columns of the given contingency table are more than 5.
Hence, the test of independence is appropriate.
Step 5:
Critical value:
In a test of hypotheses the critical value is the point by which one can reject or accept the null hypothesis.
Software procedure:
Step-by-step software procedure to obtain critical value using MINITAB software is as follows:
- Select Graph > Probability distribution plot > view probability
- Select Chi -Square under distribution.
- In Degrees of freedom, enter 1.
- Choose Probability Value and Right Tail for the region of the curve to shade.
- Enter the Probability value as 0.01 under shaded area.
- Select OK.
- Output using MINITAB software is given below:
Hence, the critical value at
Rejection rule:
If the
Step 6:
Conclusion:
Here, the
That is,
Thus, the decision is “reject the null hypothesis”.
Thus, there is enough evidence to conclude that the gender and admission are not independent of gender in department A.
For Department B:
Step 1:
The hypotheses are:
The hypotheses are:
Null Hypothesis:
Alternate Hypothesis:
Step 2:
Now, it is obtained that,
Gender | Accept | Reject | Row Total |
Male | 353 | 207 | 560 |
Female | 17 | 8 | 25 |
Column Total | 370 | 215 | 585 |
Step 3:
Now, using the formula of expected frequency it is found that the expected frequency for the accepted male is obtained as,
Hence, in similar way the expected frequencies are obtained as,
Gender | Accept | Reject |
Male | ||
Female |
The accept and reject can be rewritten as,
Accept | 1 |
Reject | 2 |
Step 4:
Level of significance:
The level of significance is given as 0.01.
Test Statistic:
Software procedure:
Step -by-step software procedure to obtain test statistic using MINITAB software is as follows:
- Select Stat > Table > Cross Tabulation and Chi-Square.
- Check the box of Raw data (categorical variables).
- Under For rows enter Gender.
- Under For columns enter Column.
- Check the box of Count under Display.
- Under Chi-Square, click the box of Chi-Square test.
- Select OK.
- Output using MINITAB software is given below:
Thus, the value of chi-square statistic is 0.254.
In the given question there are 2 rows and 2 columns.
Hence, the degrees of freedom is
Thus, the degrees of freedom is 1.
It is known that when the null hypothesis
It is found that all the expected frequencies corresponding to all rows and columns of the given contingency table are more than 5.
Hence, the test of independence is appropriate.
Step 5:
Critical value:
In a test of hypotheses the critical value is the point by which one can reject or accept the null hypothesis.
It is already found that the critical value at
Rejection rule:
If the
Step 6:
Conclusion:
Here, the
That is,
Thus, the decision is “fail to reject the null hypothesis”.
Thus, there is not enough evidence to conclude that the gender and admission are not independent of gender in department B.
For Department C:
Step 1:
The hypotheses are:
Null Hypothesis:
Alternate Hypothesis:
Step 2:
Now, it is obtained that,
Gender | Accept | Reject | Row Total |
Male | 120 | 205 | 325 |
Female | 202 | 391 | 593 |
Column Total | 322 | 596 | 918 |
Step 3:
Now, using the formula of expected frequency it is found that the expected frequency for the accepted male is obtained as,
Hence, in similar way the expected frequencies are obtained as,
Gender | Accept | Reject |
Male | ||
Female |
The accept and reject can be rewritten as,
Accept | 1 |
Reject | 2 |
Step 4:
Level of significance:
The level of significance is given as 0.01.
Test Statistic:
Software procedure:
Step -by-step software procedure to obtain test statistic using MINITAB software is as follows:
- Select Stat > Table > Cross Tabulation and Chi-Square.
- Check the box of Raw data (categorical variables).
- Under For rows enter Gender.
- Under For columns enter Column.
- Check the box of Count under Display.
- Under Chi-Square, click the box of Chi-Square test.
- Select OK.
- Output using MINITAB software is given below:
Thus, the value of chi-square statistic is 0.754.
In the given question there are 2 rows and 2 columns.
Hence, the degrees of freedom is
Thus, the degrees of freedom is 1.
It is known that when the null hypothesis
It is found that all the expected frequencies corresponding to all rows and columns of the given contingency table are more than 5.
Hence, the test of independence is appropriate.
Step 5:
Critical value:
In a test of hypotheses the critical value is the point by which one can reject or accept the null hypothesis.
It is already found that the critical value at
Rejection rule:
If the
Step 6:
Conclusion:
Here, the
That is,
Thus, the decision is “fail to reject the null hypothesis”.
Thus, there is not enough evidence to conclude that the gender and admission are not independent of gender in department C.
For Department D:
Step 1:
The hypotheses are:
Null Hypothesis:
Alternate Hypothesis:
Step 2:
Now, it is obtained that,
Gender | Accept | Reject | Row Total |
Male | 138 | 279 | 417 |
Female | 131 | 244 | 375 |
Column Total | 269 | 523 | 792 |
Step 3:
Now, using the formula of expected frequency it is found that the expected frequency for the accepted male is obtained as,
Hence, in similar way the expected frequencies are obtained as,
Gender | Accept | Reject |
Male | ||
Female |
The accept and reject can be rewritten as,
Accept | 1 |
Reject | 2 |
Step 4:
Level of significance:
The level of significance is given as 0.01.
Test Statistic:
Software procedure:
Step -by-step software procedure to obtain test statistic using MINITAB software is as follows:
- Select Stat > Table > Cross Tabulation and Chi-Square.
- Check the box of Raw data (categorical variables).
- Under For rows enter Gender.
- Under For columns enter Column.
- Check the box of Count under Display.
- Under Chi-Square, click the box of Chi-Square test.
- Select OK.
- Output using MINITAB software is given below:
Thus, the value of chi-square statistic is 0.298.
In the given question there are 2 rows and 2 columns.
Hence, the degrees of freedom is
Thus, the degrees of freedom is 1.
It is known that when the null hypothesis
It is found that all the expected frequencies corresponding to all rows and columns of the given contingency table are more than 5.
Hence, the test of independence is appropriate.
Step 5:
Critical value:
In a test of hypotheses the critical value is the point by which one can reject or accept the null hypothesis.
It is already found that the critical value at
Rejection rule:
If the
Step 6:
Conclusion:
Here, the
That is,
Thus, the decision is “fail to reject the null hypothesis”.
Thus, there is not enough evidence to conclude that the gender and admission are not independent of gender in department D.
For Department E:
Step 1:
The hypotheses are:
Null Hypothesis:
Alternate Hypothesis:
Step 2:
Now, it is obtained that,
Gender | Accept | Reject | Row Total |
Male | 53 | 138 | 191 |
Female | 94 | 299 | 383 |
Column Total | 147 | 437 | 584 |
Step 3:
Now, using the formula of expected frequency it is found that the expected frequency for the accepted male is obtained as,
Hence, in similar way the expected frequencies are obtained as,
Gender | Accept | Reject |
Male | ||
Female |
Step 4:
Level of significance:
The level of significance is given as 0.01.
Test Statistic:
Software procedure:
Step -by-step software procedure to obtain test statistic using MINITAB software is as follows:
- Select Stat > Table > Cross Tabulation and Chi-Square.
- Check the box of Raw data (categorical variables).
- Under For rows enter Gender.
- Under For columns enter Column.
- Check the box of Count under Display.
- Under Chi-Square, click the box of Chi-Square test.
- Select OK.
- Output using MINITAB software is given below:
Thus, the value of chi-square statistic is 1.001.
In the given question there are 2 rows and 2 columns.
Hence, the degrees of freedom is
Thus, the degrees of freedom is 1.
It is known that when the null hypothesis
It is found that all the expected frequencies corresponding to all rows and columns of the given contingency table are more than 5.
Hence, the test of independence is appropriate.
Step 5:
Critical value:
In a test of hypotheses the critical value is the point by which one can reject or accept the null hypothesis.
It is already found that the critical value at
Rejection rule:
If the
Step 6:
Conclusion:
Here, the
That is,
Thus, the decision is “fail to reject the null hypothesis”.
Thus, there is not enough evidence to conclude that the gender and admission are not independent of gender in department E.
For Department F:
Step 1:
The hypotheses are:
Null Hypothesis:
Alternate Hypothesis:
Step 2:
Now, it is obtained that,
Gender | Accept | Reject | Row Total |
Male | 22 | 351 | 373 |
Female | 24 | 317 | 341 |
Column Total | 46 | 668 | 714 |
Step 3:
Now, using the formula of expected frequency it is found that the expected frequency for the accepted male is obtained as,
Hence, in similar way the expected frequencies are obtained as,
Gender | Accept | Reject |
Male | ||
Female |
The accept and reject can be rewritten as,
Accept | 1 |
Reject | 2 |
Step 4:
Level of significance:
The level of significance is given as 0.01.
Test Statistic:
Software procedure:
Step -by-step software procedure to obtain test statistic using MINITAB software is as follows:
- Select Stat > Table > Cross Tabulation and Chi-Square.
- Check the box of Raw data (categorical variables).
- Under For rows enter Gender.
- Under For columns enter Column.
- Check the box of Count under Display.
- Under Chi-Square, click the box of Chi-Square test.
- Select OK.
- Output using MINITAB software is given below:
Thus, the value of chi-square statistic is 0.384.
In the given question there are 2 rows and 2 columns.
Hence, the degrees of freedom is
Thus, the degrees of freedom is 1.
It is known that when the null hypothesis
It is found that all the expected frequencies corresponding to all rows and columns of the given contingency table are more than 5.
Hence, the test of independence is appropriate.
Step 5:
Critical value:
In a test of hypotheses the critical value is the point by which one can reject or accept the null hypothesis.
It is already found that the critical value at
Rejection rule:
If the
Step 6:
Conclusion:
Here, the
That is,
Thus, the decision is “fail to reject the null hypothesis”.
Thus, there is not enough evidence to conclude that the gender and admission are not independent of gender in department F.
Thus, it is clear that the only department for which this hypothesis is rejected is department A.
The accepted number of women out of total 108 women is 89.
Hence, the percentage of accepted number of women in department A is,
The accepted number of men out of total 825 men is 512.
Hence, the accepted number of men in department A is,
Thus, it can be concluded that the admission rate for women in department A is significantly higher than the rate for men.
Want to see more full solutions like this?
Chapter 10 Solutions
Essential Statistics
- « CENGAGE MINDTAP Quiz: Chapter 38 Assignment: Quiz: Chapter 38 ips Questions ra1kw08h_ch38.15m 13. 14. 15. O Which sentence has modifiers in the correct place? O a. When called, she for a medical emergency responds quickly. b. Without giving away too much of the plot, Helena described the heroine's actions in the film. O c. Nearly the snakebite victim died before the proper antitoxin was injected. . O O 16 16. O 17. 18. O 19. O 20 20. 21 21. 22. 22 DS 23. 23 24. 25. O O Oarrow_forwardQuestions ra1kw08h_ch36.14m 12. 13. 14. 15. 16. Ӧ 17. 18. 19. OS 20. Two separate sentences need Oa. two separate subjects. Ob. two dependent clauses. c. one shared subject.arrow_forwardCustomers experiencing technical difficulty with their Internet cable service may call an 800 number for technical support. It takes the technician between 30 seconds and 11 minutes to resolve the problem. The distribution of this support time follows the uniform distribution. Required: a. What are the values for a and b in minutes? Note: Do not round your intermediate calculations. Round your answers to 1 decimal place. b-1. What is the mean time to resolve the problem? b-2. What is the standard deviation of the time? c. What percent of the problems take more than 5 minutes to resolve? d. Suppose we wish to find the middle 50% of the problem-solving times. What are the end points of these two times?arrow_forward
- Exercise 6-6 (Algo) (LO6-3) The director of admissions at Kinzua University in Nova Scotia estimated the distribution of student admissions for the fall semester on the basis of past experience. Admissions Probability 1,100 0.5 1,400 0.4 1,300 0.1 Click here for the Excel Data File Required: What is the expected number of admissions for the fall semester? Compute the variance and the standard deviation of the number of admissions. Note: Round your standard deviation to 2 decimal places.arrow_forward1. Find the mean of the x-values (x-bar) and the mean of the y-values (y-bar) and write/label each here: 2. Label the second row in the table using proper notation; then, complete the table. In the fifth and sixth columns, show the 'products' of what you're multiplying, as well as the answers. X y x minus x-bar y minus y-bar (x minus x-bar)(y minus y-bar) (x minus x-bar)^2 xy 16 20 34 4-2 5 2 3. Write the sums that represents Sxx and Sxy in the table, at the bottom of their respective columns. 4. Find the slope of the Regression line: bi = (simplify your answer) 5. Find the y-intercept of the Regression line, and then write the equation of the Regression line. Show your work. Then, BOX your final answer. Express your line as "y-hat equals...arrow_forwardApply STATA commands & submit the output for each question only when indicated below i. Generate the log of birthweight and family income of children. Name these new variables Ibwght & Ifaminc. Include the output of this code. ii. Apply the command sum with the detail option to the variable faminc. Note: you should find the 25th percentile value, the 50th percentile and the 75th percentile value of faminc from the output - you will need it to answer the next question Include the output of this code. iii. iv. Use the output from part ii of this question to Generate a variable called "high_faminc" that takes a value 1 if faminc is less than or equal to the 25th percentile, it takes the value 2 if faminc is greater than 25th percentile but less than or equal to the 50th percentile, it takes the value 3 if faminc is greater than 50th percentile but less than or equal to the 75th percentile, it takes the value 4 if faminc is greater than the 75th percentile. Include the outcome of this code…arrow_forward
- solve this on paperarrow_forwardApply STATA commands & submit the output for each question only when indicated below i. Apply the command egen to create a variable called "wyd" which is the rowtotal function on variables bwght & faminc. ii. Apply the list command for the first 10 observations to show that the code in part i worked. Include the outcome of this code iii. Apply the egen command to create a new variable called "bwghtsum" using the sum function on variable bwght by the variable high_faminc (Note: need to apply the bysort' statement) iv. Apply the "by high_faminc" statement to find the V. descriptive statistics of bwght and bwghtsum Include the output of this code. Why is there a difference between the standard deviations of bwght and bwghtsum from part iv of this question?arrow_forwardAccording to a health information website, the distribution of adults’ diastolic blood pressure (in millimeters of mercury, mmHg) can be modeled by a normal distribution with mean 70 mmHg and standard deviation 20 mmHg. b. Above what diastolic pressure would classify someone in the highest 1% of blood pressures? Show all calculations used.arrow_forward
- Write STATA codes which will generate the outcomes in the questions & submit the output for each question only when indicated below i. ii. iii. iv. V. Write a code which will allow STATA to go to your favorite folder to access your files. Load the birthweight1.dta dataset from your favorite folder and save it under a different filename to protect data integrity. Call the new dataset babywt.dta (make sure to use the replace option). Verify that it contains 2,998 observations and 8 variables. Include the output of this code. Are there missing observations for variable(s) for the variables called bwght, faminc, cigs? How would you know? (You may use more than one code to show your answer(s)) Include the output of your code (s). Write the definitions of these variables: bwght, faminc, male, white, motheduc,cigs; which of these variables are categorical? [Hint: use the labels of the variables & the browse command] Who is this dataset about? Who can use this dataset to answer what kind of…arrow_forwardApply STATA commands & submit the output for each question only when indicated below İ. ii. iii. iv. V. Apply the command summarize on variables bwght and faminc. What is the average birthweight of babies and family income of the respondents? Include the output of this code. Apply the tab command on the variable called male. How many of the babies and what share of babies are male? Include the output of this code. Find the summary statistics (i.e. use the sum command) of the variables bwght and faminc if the babies are white. Include the output of this code. Find the summary statistics (i.e. use the sum command) of the variables bwght and faminc if the babies are male but not white. Include the output of this code. Using your answers to previous subparts of this question: What is the difference between the average birthweight of a baby who is male and a baby who is male but not white? What can you say anything about the difference in family income of the babies that are male and male…arrow_forwardA public health researcher is studying the impacts of nudge marketing techniques on shoppers vegetablesarrow_forward
- MATLAB: An Introduction with ApplicationsStatisticsISBN:9781119256830Author:Amos GilatPublisher:John Wiley & Sons IncProbability and Statistics for Engineering and th...StatisticsISBN:9781305251809Author:Jay L. DevorePublisher:Cengage LearningStatistics for The Behavioral Sciences (MindTap C...StatisticsISBN:9781305504912Author:Frederick J Gravetter, Larry B. WallnauPublisher:Cengage Learning
- Elementary Statistics: Picturing the World (7th E...StatisticsISBN:9780134683416Author:Ron Larson, Betsy FarberPublisher:PEARSONThe Basic Practice of StatisticsStatisticsISBN:9781319042578Author:David S. Moore, William I. Notz, Michael A. FlignerPublisher:W. H. FreemanIntroduction to the Practice of StatisticsStatisticsISBN:9781319013387Author:David S. Moore, George P. McCabe, Bruce A. CraigPublisher:W. H. Freeman





