Concept explainers
TravelAir.com samples domestic airline flights to explore the relationship between airfare and distance. The service would like to know if there is a
- a. Draw a
scatter diagram with Distance as the independent variable and Fare as the dependent variable. Is the relationship direct or indirect? - b. Compute the
correlation coefficient . At the .05 significance level, is it reasonable to conclude that the correlation coefficient is greater than zero? - c. What percentage of the variation in Fare is accounted for by Distance of a flight?
- d. Determine the regression equation. How much does each additional mile add to the fare? Estimate the fare for a 1,500-mile flight.
- e. A traveler is planning to fly from Atlanta to London Heathrow. The distance is 4,218 miles. She wants to use the regression equation to estimate the fare. Explain why it would not be a good idea to estimate the fare for this international flight with the regression equation.
a.
Construct a scatter diagram with Distance as the independent variable and Fare as the dependent variable.
Explain the relationship between the variables.
Answer to Problem 61CE
The scatter diagram of the data is as follows:
Explanation of Solution
Step-by-step procedure to obtain the scatterplot using MegaStat software:
- In an EXCEL sheet enter the data values of x and y.
- Go to Add-Ins > MegaStat > Correlation/Regression > Scatterplot.
- Enter horizontal axis as Sheet6!$X$1:$X$31 and vertical axis as Sheet6!$Y$1:$Y$31.
- Click on OK.
From the scatterplot of the data indicates an increasing trend. It shows that as the distance increases, the fare also increases. Therefore, there is a positive association between distance and fare.
Thus, the relationship is direct.
b.
Find the correlation coefficient.
Check whether the correlation coefficient is greater than zero.
Answer to Problem 61CE
The correlation coefficient is 0.656.
There is enough evidence to infer that the population correlation is positive.
Explanation of Solution
Step-by-step procedure to obtain the correlation coefficient using MegaStat software:
- In an EXCEL sheet enter the data values of x and y.
- Go to Add-Ins > MegaStat > Correlation/Regression > Correlation matrix.
- Enter Input Range as Sheet6!$X$1:$Y$31.
- Click on OK.
Output obtained using MegaStat is given as follows:
The correlation coefficient is 0.656.
Denote the population correlation as
The hypotheses are given below:
Null hypothesis:
That is, the correlation in the population is less than or equal to zero.
Alternative hypothesis:
That is, the correlation in the population is positive.
Test statistic:
The test statistic is as follows:
Here, the sample size is 30 and the correlation coefficient is 0.656.
The test statistic is as follows:
Degrees of freedom:
The level of significance is 0.05. Therefore,
Critical value:
Step-by-step software procedure to obtain the critical value using EXCEL software:
- Open an EXCEL file.
- In cell A1, enter the formula “=T.INV (0.95, 28)”.
Output obtained using the EXCEL is given as follows:
Decision rule:
Reject the null hypothesis H0, if
Otherwise, fail to reject H0.
Conclusion:
The value of test statistic is 4.599 and the critical value is 1.701.
Here,
By the rejection rule, reject the null hypothesis.
Thus, there is enough evidence to infer that the population correlation is positive.
c.
Explain what percentage of the variation in ‘Fare’ is accounted for by ‘Distance’ of a flight.
Explanation of Solution
The coefficient of determination is the square of correlation coefficient.
From part (b), the correlation coefficient is 0.656.
Thus, the coefficient of determination is 0.43
Thus, about 43% of the variation in fares is explained by the variation in distance.
d.
Find the regression equation.
Explain how much does each additional mile add to the fare.
Find the fare for a 1,500 mile flight.
Answer to Problem 61CE
The regression equation is
The fare for a 1,500 mile flight is $226.1.
Explanation of Solution
Step-by-step procedure to obtain the ‘Regression equation’ using the MegaStat software:
- In an EXCEL sheet enter the data values of x and y.
- Go to Add-Ins > MegaStat > Correlation/Regression > Regression Analysis.
- Select input range as ‘Sheet6!$Y$1:$Y$31’ under Y/Dependent variable.
- Select input range ‘Sheet6!$X$1:$X$31’ under X/Independent variables.
- Click on OK.
Output using the Mega Stat software is given below:
From the output, the regression equation is,
Thus, for each additional mile $0.0527 is added to the fare.
Substitute the value ‘1,500’ for ‘distance’ in the regression equation.
Thus, fare for a 1,500 mile flight is $226.1.
e.
Explain why it is not suitable to estimate the fare for the international flight with the regression equation.
Explanation of Solution
It is given that the distance is 4,218 miles. This flight is far away from the range of the sampled data. Thus, using the regression equation may not be suitable to estimate the fare for the flight.
Want to see more full solutions like this?
Chapter 13 Solutions
EBK STATISTICAL TECHNIQUES IN BUSINESS
- The mean, variance, skewness and kurtosis of a dataset are given as - Mean = 15, Variance = 20, SKewness = 1.5 and Kurtosis = 3.5 calculate the first four raw moments. (Note- Please include as much detailed solution/steps in the solution to understand, Thank you!)arrow_forwardWrite codes to perform the functions in each of these cases i. ii. Apply cd command to tell STATA the filepath associated with your "favorite folder" (use the same name for the favorite folder that we have been using in class) Apply log using command to tell stata that you are creating a log file to record the codes and the outcomes of these codes. Make sure your log file is called loghwa1_W25.smcl. Do not forget to include the replace option. iii. Get help for the "regress" command & include a screenshot of the outcome of this code iv. V. Open a stata file stored in STATA memory called pop2000.dta Continue from question iv. Save this file in your favorite folder (current working directory) using a different name & a replace optionarrow_forwardAre there any unusually high or low pH levels in this sample of wells?arrow_forward
- 0 n AM RIES s of of 10 m Frequency 40 Frequency 20 20 30 10 You make two histograms from two different data sets (see the following figures), each one containing 200 observations. Which of the histograms has a smaller spread: the first or the second? 40 30 20 10 0 20 40 60 0 20 20 40 60 60 80 80 100 80 100arrow_forwardTIP the aren't, the data are not sym 11 Suppose that the average salary at a certain company is $100,000, and the median salary is $40,000. a. What do these figures tell you about the shape of the histogram of salaries at this company? b. Which measure of center is more appro- priate here? c. Suppose that the company goes through a salary negotiation. How can people on each side use these summary statistics to their advantage? 6360 be 52 PART 1 Getting Off to a Statistically Significant Sarrow_forward12 Suppose that you know that a data set is skewed left, and you know that the two measures of center are 19 and 38. Which figure is the mean and which is the median?arrow_forward
- y of 45 home- televisions u find that 010020 le own one, ee, and 1 owns y histogram of 4 Suppose that you have a loaded die. You roll it several times and record the outcomes, which are shown in the following figure. Histogram for Loaded Die 444% 34.00 48% 6% 2% Frequency 20 20 15 155 10 5- ம 0 1 2 3 4 Outcome 5 6 a. Make a relative frequency histogram of these results. b. You can make a relative frequency histo- gram from a frequency histogram; can you go the other direction?arrow_forwardCalculate the mean for Study Hours and Test Scores. Compute the covariance between the two variables using the formula: Calculate the standard deviation for Study Hours (X) and Test Scores (Y). Determine the correlation coefficient Interpret the results: What does the calculated r-value indicate about the relationship between study hours and test scores?arrow_forwardFor unemployed persons in the United States, the average number of months of unemployment at the end of December 2009 was approximately seven months (Bureau of Labor Statistics, January 2010). Suppose the following data are for a particular region in upstate New York. The values in the first column show the number of months unemployed and the values in the second column show the corresponding number of unemployed persons. Months Unemployed Number Unemployed 1 1029 2 1686 3 2269 4 2675 5 3487 6 4652 7 4145 8 3587 9 2325 10 1120 Let x be a random variable indicating the number of months a person is unemployed. a. Use the data to develop an empirical discrete probability distribution for x (to 4 decimals). (x) f(x) 1 2 3 4 5 6 7 8 9 10 b. Show that your probability distribution satisfies the conditions for a valid discrete probability distribution. The input in the box below will not be graded, but may be reviewed and considered by your instructor. blank c. What is the probability that a…arrow_forward
- West Virginia has one of the highest divorce rates in the nation, with an annual rate of approximately 5 divorces per 1000 people (Centers for Disease Control and Prevention website, January 12, 2012). The Marital Counseling Center, Inc. (MCC) thinks that the high divorce rate in the state may require them to hire additional staff. Working with a consultant, the management of MCC has developed the following probability distribution for x = the number of new clients for marriage counseling for the next year. Excel File: data05-19.xls x 10 f(x) .05 20 30 .10 .10 40 .20 50 60 .35 .20 a. Is this probability distribution valid? - Select your answer- Explain. f(x) Σf(x) Select your answer Select your answer b. What is the probability MCC will obtain more than 30 new clients (to 2 decimals)? c. What is the probability MCC will obtain fewer than 20 new clients (to 2 decimals)? d. Compute the expected value and variance of x. Expected value Variance clients per year squared clients per yeararrow_forwardFor unemployed persons in the United States, the average number of months of unemployment at the end of December 2009 was approximately seven months (Bureau of Labor Statistics, January 2010). Suppose the following data are for a particular region in upstate New York. The values in the first column show the number of months unemployed and the values in the second column show the corresponding number of unemployed persons. Months Unemployed Number Unemployed 1 1029 2 1686 3 2269 4 2675 5 3487 6 4652 7 4145 8 3587 9 2325 10 1120 Let x be a random variable indicating the number of months a person is unemployed. a. Use the data to develop an empirical discrete probability distribution for x (to 4 decimals). (x) f(x) 1 2 3 4 5 6 7 8 9 10 b. Show that your probability distribution satisfies the conditions for a valid discrete probability distribution. The input in the box below will not be graded, but may be reviewed and considered by your instructor. c. What is the probability that a person…arrow_forwardIn Gallup's Annual Consumption Habits Poll, telephone interviews were conducted for a random sample of 1014 adults aged 18 and over. One of the questions was "How many cups of coffee, if any, do you drink on an average day?" The following table shows the results obtained (Gallup website, August 6, 2012). Excel File: data05-23.xls Number of Cups per Day Number of Responses 0 365 264 193 3 4 or more 91 101 Define a random variable x = number of cups of coffee consumed on an average day. Let x = 4 represent four or more cups. Round your answers to four decimal places. a. Develop a probability distribution for x. x 0 1 2 3 4 f(x) b. Compute the expected value of x. cups of coffee c. Compute the variance of x. cups of coffee squared d. Suppose we are only interested in adults that drink at least one cup of coffee on an average day. For this group, let y = the number of cups of coffee consumed on an average day. Compute the expected value of y. Compare it to the expected value of x. The…arrow_forward
- Glencoe Algebra 1, Student Edition, 9780079039897...AlgebraISBN:9780079039897Author:CarterPublisher:McGraw HillHolt Mcdougal Larson Pre-algebra: Student Edition...AlgebraISBN:9780547587776Author:HOLT MCDOUGALPublisher:HOLT MCDOUGALBig Ideas Math A Bridge To Success Algebra 1: Stu...AlgebraISBN:9781680331141Author:HOUGHTON MIFFLIN HARCOURTPublisher:Houghton Mifflin Harcourt
- Functions and Change: A Modeling Approach to Coll...AlgebraISBN:9781337111348Author:Bruce Crauder, Benny Evans, Alan NoellPublisher:Cengage Learning