Concept explainers
Alumni donations are an important source of revenue for colleges and universities. If administrators could determine the factors that could lead to increases in the percentage of alumni who make a donation, they might be able to implement policies that could lead to increased revenues. Research shows that students who are more satisfied with their contact with teachers are more likely to graduate. As a result, one might suspect that smaller class sizes and lower student/faculty ratios might lead to a higher percentage of satisfied graduates, which in turn might lead to increases in the percentage of alumni who make a donation. The following table shows data for 48 national universities. The Graduation Rate column is the percentage of students who initially enrolled at the university and graduated. The % of Classes Under 20 column shows the percentages of classes with fewer than 20 students that are offered. The Student/Faculty Ratio column is the number of students enrolled divided by the total number of faculty. Finally, the Alumni Giving Rate column is the percentage of alumni who made a donation to the university.
Managerial Report
- 1. Use methods of
descriptive statistics to summarize the data. - 2. Develop an estimated simple linear regression model that can be used to predict the alumni giving rate, given the graduation rate. Discuss your findings.
- 3. Develop an estimated multiple linear regression model that could be used to predict the alumni giving rate using the Graduation Rate, % of Classes Under 20, and Student/Faculty Ratio as independent variables. Discuss your findings.
- 4. Based on the results in parts (2) and (3), do you believe another regression model may be more appropriate? Estimate this model, and discuss your results.
- 5. What conclusions and recommendations can you derive from your analysis? What universities are achieving a substantially higher alumni giving rate than would be expected, given their Graduation Rate, % of Classes Under 20, and Student/Faculty Ratio? What universities are achieving a substantially lower alumni giving rate than would be expected, given their Graduation Rate, % of Classes Under 20, and Student/Faculty Ratio? What other independent variables could be included in the model?
1.
Summarize the data using the methods of descriptive statistics.
Answer to Problem 1C
The numerical summaries of the data are as follows:
Explanation of Solution
Calculation:
The data is related to the Alumni Giving Rate for 48 national universities.
Descriptive Statistics:
Step-by-step procedure to get the descriptive statistics using EXCEL software:
- Open an EXCEL sheet named as AlumnniGiving.
- Select Data > Data Analysis > Descriptive Statistics.
- Click OK.
- Under Input Range, enter $C$1:$F$49.
- Choose Grouped By as Columns.
- Select Labels in the first row.
- Select Summary statistics.
- Under Output Range, enter $J$1.
- Click OK.
Output obtained using EXCEL software is given below:
According to the output, it is found that mean, standard deviation, and median of the Graduate Rate are 83.042, 8.61, and 83.5, respectively.
The mean, standard deviation, and median of % of classes Under 20 are 55.73, 13.19, and 59.5, respectively.
The mean, standard deviation, and median for the Student–Faculty Ratio are 11.54, 4.85, and 10.5, respectively.
Step-by-step procedure to obtain the correlation coefficient using EXCEL software:
- Select Data > Data Analysis > Descriptive Statistics.
- Click OK.
- Under Input Range, enter $C$1:$F$49.
- Choose Grouped By as Columns.
- Select Labels in the first row.
- Click Ok.
Output obtained using the EXCEL software is given below:
On observing the output of the correlation, it is clear that the graduate rate is positively related with the % of class under 20 and the Alumni Giving Rate and the graduate rate are negatively related with the Student–Faculty Ratio. The % of classes Under 30 is negatively related with the Student–Faculty Ratio and the % of classes Under 20 is positively related with the Alumni Giving Ratio. Finally, the Student–Faculty Ratio is negatively related to the Alumni Giving Rate.
2.
Obtain the estimated simple linear regression model that can be used to predict the alumni-giving rate, given the graduation rate and provide comments about the findings.
Answer to Problem 1C
The estimated regression equation that could be used to predict t, the alumni-giving rateis
Explanation of Solution
Calculation:
In this situation, the Graduation rateis the independent variable
Regression of the Alumni-giving rate using the Graduation rate:
Step-by-step procedure to obtain the regression equation using EXCEL software:
- Select Data>Data Analysis>Regression.
- Click OK.
- Under Input Y Range, enter $F$1:$F$49.
- Under Input X Range, enter $C$1:$C$49.
- Click the box of Labels.
- Under New Worksheet.
- Click OK.
Output obtained using EXCEL software is given as follows:
Thus, the estimated regression equation that could be used to predict the alumni-giving rate, given the graduation rate is
From the output, the slope, b=1.18, which is positive. The positive value of slope depicts that the alumni giving rate is expected to increase or decrease by 1.18 percentage as the graduation rate increases or decreases by one.
R2 (R-squared):
The coefficient of determination (R2) is defined as the proportion of variation in the observed values of the response variable that is explained by the regression. The squared correlation gives the fraction of variability of the response variable (y) accounted for by the linear regression model.
From the output, R square=57.15%.
Thus, only 57.15% variability in the alumni-giving rateis explained by the variability in the graduation rate.
3.
Create an estimated multiple linear regression model that could be used to predict the alumni-given rate using the Graduation Rate, % of Classes Under 20, and Student/Faculty Ratio as independent variables and provide the comments about the findings.
Answer to Problem 1C
The estimated multiple linear regression model that could be used to predict the alumni-given rate using the Graduation Rate, % of Classes Under 20, and Student/Faculty Ratio as independent variables is
Explanation of Solution
Calculation:
In this given problem, the Graduation rate
Step-by-step procedure to obtain the estimated regression equation using EXCEL:
- In Data, select Data Analysis and choose Regression.
- In Input Y Range, select $F$1:$F$49.
- In Input X Range, select $C$1:$E$49.
- Select Labels.
- Click on Confidence Levels and type 95.
- Select the Residuals plot check box.
- Click OK.
Output obtained using EXCEL is given below:
Residual plot for Graduation Rate and % of Classes Under 20:
Thus, the estimated multiple linear regression model that could be used to predict the alumni-given rate using the Graduation Rate, % of Classes Under 20, and Student/Faculty Ratio as independent variables is
Graduation Rate:
From the output, the slope,
% of classes Under 20:
From the output, the slope,
Student–Faculty Ratio:
From the output, the slope,
R2 (R-squared):
From the output, R square=69.99%.
Thus, only 69.99% variability in the alumni-giving rate (response variable) is explained by the explanatory variables in the regression equation.
4.
State whether it is believed that another regression model may be more appropriate, if so, estimate the model and discuss the findings.
Answer to Problem 1C
The estimated multiple linear regression model that could be used to predict the alumni-given rate using the Graduation Rate and Student/Faculty Ratio as independent variables is
The second-order quadratic model is
Explanation of Solution
Calculation:
For Graduation rate:
Consider that
Null hypothesis:
That is, there is no significant relationship between the Alumni giving rate and graduation rate.
Alternative hypothesis:
That is, there is a significant relationship between the Alumni giving rate and graduation rate.
From the output in Part (3), it is found that the t-test statistic corresponding to comfort is 4.51 with p value of approximately 0.00005.
Level of significance:
The assumed level of significance is
Conclusion:
Here, the p value is less than the level of significance.
That is,
Thus, the decision is “reject the null hypothesis”.
Therefore, the data provide sufficient evidence to conclude that there is a significant relationship between the Alumni-giving rate and graduation rate at 0.05 level of significance.
For % of classes under 20:
Consider that
Null hypothesis:
That is, there is no significant relationship between the alumni-giving rate and % of classes under 20.
Alternative hypothesis:
That is, there is a significant relationship between the alumni-giving rate and % of classes under 20.
From the output in Part (a), it is found that the t-test statistic corresponding to % of classes under 20 is 0.2084 with p value of approximately 0.8358.
Conclusion:
Here, the p value is greater than the level of significance.
That is,
Thus, the decision is “fail to reject the null hypothesis”.
Therefore, the data do not provide sufficient evidence to conclude that there is a significant relationship between the alumni-giving rate and % of classes under 20 at 0.05 level of significance.
For Student–Faculty Ratio:
Consider that
Null hypothesis:
That is, there is no significant relationship between the alumni-giving rate and student faculty ratio.
Alternative hypothesis:
That is, there is a significant relationship between the alumni-giving rate and student faculty ratio.
From the output in Part (a), it is found that the t-test statistic corresponding to in-house dining is -3.082 with p value of approximately 0.0035.
Conclusion:
Here, the p value is less than the level of significance.
That is,
Thus, the decision is “reject the null hypothesis”.
Therefore, the data provide sufficient evidence to conclude that there is a significant relationship between the alumni-giving rate and student faculty ratio at 0.01 level of significance.
Now, by comparing the results based on the p-value, it is clear that “% of class under 20” in the regression model in not a good predictor.
In this situation, remove “% of class under 20” variable from the model and rerun the test by considering “Graduation rate” and “student faculty ratio” as the explanatory variable and the “alumni-giving rate” as the response variable.
The estimated regression model will be as given below:
In this given problem, the Graduation rate
Step-by-step procedure to obtain the estimated regression equation using EXCEL:
- In Data, select Data Analysis and choose Regression.
- In Input Y Range, select $F$1:$F$49.
- In Input X Range, select $D$1:$E$49.
- Select Labels.
- Click on Confidence Levels and type 95.
- Select the Residuals plot check box.
- Click OK.
Output obtained using EXCEL is given below:
Residual plot for the Graduation Rate and Student–Faculty Ratio:
Thus, the estimated multiple linear regression model that could be used to predict the alumni-given rate using the Graduation Rate and Student/Faculty Ratio as independent variables is
Graduation Rate:
From the output, the slope,
Student–Faculty Ratio:
From the output, the slope,
R2 (R-squared):
From the output, R square=69.96%.
Thus, only 69.96% variability in the alumni-giving rate (response variable) is explained by the explanatory variables in the regression equation.
Based on output, it is observed that the R-squared value decreases by 0.04% after removing the % of class under 20 from the dataset, which indicates that the relationship between the alumni-giving rate and the graduation rate may be non-linear. Therefore, it is appropriate to estimate the second-order quadratic relationship between the two variables.
The second-order quadratic model will be as given below:
Step-by-step procedure to obtain the second-order quadratic regression equation using EXCEL:
- Insert a new column “Graduate Rate^2” in between C and D.
- In cell D, Enter “=C2^2”.
- Drag the cursor till the end of the dataset.
- In Data, select Data Analysis and choose Regression.
- In Input Y Range, select $F$1:$F$49.
- In Input X Range, select $C$1:$E$49.
- Select Labels.
- Click on Confidence Levels and type 95.
- Select the Residuals plot check box.
- Click OK.
Output obtained using EXCEL is given below:
Residual plot for the Graduation Rate, Graduation Rate^2, and Student–Faculty Ratio:
The second-order quadratic model is as follows:
R2(R-squared):
From the output, R square=75.13%.
Thus, only 75.13% variability in the alumni-giving rate (response variable) is explained by the explanatory variables in the regression equation.
5.
Discuss about the universities that are achieving a substantially higher alumni-giving rate than the expected, given that the graduation rate, % of classes under 20, and student/faculty ratio.
Discuss about the universities that are achieving a substantially loweralumni-giving rate than the expected, given that the graduation rate, % of classes under 20, and student/faculty ratio.
Also explain the other independent variables that could be included in the model.
Explanation of Solution
Using the output corresponding to the second-order quadratic model, the residual explain about the predicts the universities that are achieving a substantially higher and loweralumni giving rate and is given as below:
On observing the results of residual, the Alumni-giving rate at the Wake Forest University, U. of Florida, Dartmouth College, Lehigh University, U. of Notre Dame, Princeton University is approximately greater than 9% from the model predicts. Therefore, the presidents of those Universities will be glad with the determinations of the offices of Alumni Affairs.
Also, the results of residual reveal that the Alumni-giving rate at U. of Washington, Georgetown University, Columbia University, Johns Hopkins University, Stanford University,and North-western University, U. of Michigan–Ann Arbor is below 8% from the predicted model. Therefore, the presidents of those Universities will be suggested to do hard work with the determinations of the offices of Alumni Affairs.
Want to see more full solutions like this?
Chapter 7 Solutions
Essentials of Business Analytics (MindTap Course List)
- In Gallup's Annual Consumption Habits Poll, telephone interviews were conducted for a random sample of 1014 adults aged 18 and over. One of the questions was "How many cups of coffee, if any, do you drink on an average day?" The following table shows the results obtained (Gallup website, August 6, 2012). Excel File: data05-23.xls Number of Cups per Day Number of Responses 0 365 264 193 3 4 or more 91 101 Define a random variable x = number of cups of coffee consumed on an average day. Let x = 4 represent four or more cups. Round your answers to four decimal places. a. Develop a probability distribution for x. x 0 1 2 3 4 f(x) b. Compute the expected value of x. cups of coffee c. Compute the variance of x. cups of coffee squared d. Suppose we are only interested in adults that drink at least one cup of coffee on an average day. For this group, let y = the number of cups of coffee consumed on an average day. Compute the expected value of y. Compare it to the expected value of x. The…arrow_forwardIn Gallup's Annual Consumption Habits Poll, telephone interviews were conducted for a random sample of 1014 adults aged 18 and over. One of the questions was "How many cups of coffee, if any, do you drink on an average day?" The following table shows the results obtained (Gallup website, August 6, 2012). Excel File: data05-23.xls Number of Cups per Day Number of Responses 0 365 264 193 2 3 4 or more 91 101 Define a random variable x = number of cups of coffee consumed on an average day. Let x = 4 represent four or more cups. Round your answers to four decimal places. a. Develop a probability distribution for x. x 0 1 2 3 f(x) b. Compute the expected value of x. cups of coffee c. Compute the variance of x. cups of coffee squared d. Suppose we are only interested in adults that drink at least one cup of coffee on an average day. For this group, let y = the number of cups of coffee consumed on an average day. Compute the expected value of y. Compare it to the expected value of x. The…arrow_forwardA technician services mailing machines at companies in the Phoenix area. Depending on the type of malfunction, the service call can take 1, 2, 3, or 4 hours. The different types of malfunctions occur at about the same frequency. Develop a probability distribution for the duration of a service call. Duration of Call x f(x) 1 2 3 4 Which of the following probability distribution graphs accurately represents the data set? Consider the required conditions for a discrete probability function, shown below.Does this probability distribution satisfy equation (5.1)?Does this probability distribution satisfy equation (5.2)? What is the probability a service call will take three hours? A service call has just come in, but the type of malfunction is unknown. It is 3:00 P.M. and service technicians usually get off at 5:00 P.M. What is the probability the service technician will have to work overtime to fix the machine today?arrow_forward
- A psychologist determined that the number of sessions required to obtain the trust of a new patient is either 1, 2, or 3. Let x be a random variable indicating the number of sessions required to gain the patient's trust. The following probability function has been proposed. x f(x) for x = 1, 2, or 3 a. Consider the required conditions for a discrete probability function, shown below. f(x) ≥0 Σf(x) = 1 (5.1) (5.2) Does this probability distribution satisfy equation (5.1)? Select Does this probability distribution satisfy equation (5.2)? Select b. What is the probability that it takes exactly 2 sessions to gain the patient's trust (to 3 decimals)? c. What is the probability that it takes at least 2 sessions to gain the patient's trust (to 3 decimals)?arrow_forwardA technician services mailing machines at companies in the Phoenix area. Depending on the type of malfunction, the service call can take 1, 2, 3, or 4 hours. The different types of malfunctions occur at about the same frequency. Develop a probability distribution for the duration of a service call. Which of the following probability distribution graphs accurately represents the data set? Consider the required conditions for a discrete probability function, shown below.Does this probability distribution satisfy equation (5.1)?Does this probability distribution satisfy equation (5.2)? What is the probability a service call will take three hours? A service call has just come in, but the type of malfunction is unknown. It is 3:00 P.M. and service technicians usually get off at 5:00 P.M. What is the probability the service technician will have to work overtime to fix the machine today?arrow_forwardWest Virginia has one of the highest divorce rates in the nation, with an annual rate of approximately 5 divorces per 1000 people (Centers for Disease Control and Prevention website, January 12, 2012). The Marital Counseling Center, Inc. (MCC) thinks that the high divorce rate in the state may require them to hire additional staff. Working with a consultant, the management of MCC has developed the following probability distribution for x = the number of new clients for marriage counseling for the next year. Excel File: data05-19.xls 10 20 f(x) .05 .10 11 30 40 50 60 .10 .20 .35 .20 a. Is this probability distribution valid? Yes Explain. greater than or equal to 0 f(x) Σf(x) equal to 1 b. What is the probability MCC will obtain more than 30 new clients (to 2 decimals)? c. What is the probability MCC will obtain fewer than 20 new clients (to 2 decimals)? d. Compute the expected value and variance of x. Expected value Variance clients per year squared clients per yeararrow_forward
- Reconsider the patient satisfaction data in Table 1. Fit a multiple regression model using both patient age and severity as the regressors. (a) Test for significance of regression. (b) Test for the individual contribution of the two regressors. Are both regressor variables needed in the model? (c) Has adding severity to the model improved the quality of the model fit? Explain your answer.arrow_forwardThe output voltage of a power supply is assumed to be normally distributed. Sixteen observations taken at random on voltage are as follows: 10.35, 9.30, 10.00, 9.96, 11.65, 12.00, 11.25, 9.58, 11.54, 9.95, 10.28, 8.37, 10.44, 9.25, 9.38, and 10.85. (a) Test the hypothesis that the mean voltage equals 12 V against a two-sided alternative using a = 0.05. (b) Construct a 95% two-sided confidence interval on μ. (c) Test the hypothesis that σ² = 11 using α = 0.05. (d) Construct a 95% two-sided confidence interval on σ. (e) Construct a 95% upper confidence interval on σ. (f) Does the assumption of normality seem reasonable for the output voltage?arrow_forwardAnalyze the residuals from the regression model on the patient satisfaction data from Exercise 3. Comment on the adequacy of the regression model.arrow_forward
- Consider the hypotheses: Hop=po H₁ppo where 2 is known. Derive a general expression for determining the sample size for detecting a true mean of 1μo with probability 1-ẞ if the type I error is a.arrow_forwardSuppose we wish to test the hypotheses: Họ : | = 15 H₁: 15 where we know that o² = 9.0. If the true mean is really 20, what sample size must be used to ensure that the probability of type II error is no greater than 0.10? Assume that a = 0.05.arrow_forwardTable 1 contains the data from a patient satisfaction survey for a group of 25 randomly selected patients at a hospital. In addition to satisfaction, data were collected on patient age and an index that measured the severity of illness. (a) Fit a linear regression model relating satisfaction to patient age. (b) Test for significance of regression. (c) What portion of the total variability is accounted for by the regressor variable age? Table 1: Patient Satisfaction Data Severity Observation Age (21) (x2) Satisfaction (y) 1 55 50 2 46 24 3 30 46 4 35 48 5 59 58 6 61 60 7 74 65 8 38 42 9 27 42 10 51 50 11 53 38 12 41 30 13 37 31 88 14 24 34 15 42 30 16 50 48 17 58 61 18 60 71 19 62 62 20 68 38 21 70 41 22 79 66 23 63 31 24 39 42 25 49 40 BE225222222222222222 68 77 96 80 43 44 26 88 75 57 56 88 102 88 70 43 46 56 59 26 83 75arrow_forward
- Glencoe Algebra 1, Student Edition, 9780079039897...AlgebraISBN:9780079039897Author:CarterPublisher:McGraw HillBig Ideas Math A Bridge To Success Algebra 1: Stu...AlgebraISBN:9781680331141Author:HOUGHTON MIFFLIN HARCOURTPublisher:Houghton Mifflin HarcourtHolt Mcdougal Larson Pre-algebra: Student Edition...AlgebraISBN:9780547587776Author:HOLT MCDOUGALPublisher:HOLT MCDOUGAL