Concept explainers
A sample containing years to maturity and yield (%) for 40 corporate bonds is contained in the data file named CorporateBonds (Barron’s, April 2, 2012).
- a. Develop a
scatter diagram of the data using x = years to maturity as the independent variable. Does a simple linear regression model appear to be appropriate? - b. Develop an estimated regression equation with x = years to maturity and x2 as the independent variables.
- c. As an alternative to fitting a second-order model, fit a model using the natural logarithm of price as the independent variable; that is, ŷ = b0 + b1ln(x). Does the estimated regression using the natural logarithm of x provide a better fit than the estimated regression developed in part (b)? Explain.
a.
Construct a scatter diagram of the data using
Decide whether a simple linear regression model appears to be appropriate.
Answer to Problem 29SE
The scatter diagram of the data using
A simple linear regression model does not appear to be appropriate.
Explanation of Solution
Calculation:
The data gives information on yield (%) of 40 corporate bonds and the respective years to maturity.
Scatterplot:
Software procedure:
Step by step procedure to draw scatter diagram using MINITAB software is given below:
- Choose Graph > Scatterplot.
- Choose Simple, and then click OK.
- In Y–variables, enter the column of Yield.
- In X–variables enter the column of Years.
- Click OK.
Observation:
The scatterplot shows a gradual increase in the yield, at a decreasing rate, with increase in years up to 25. After this, there is a reduction in the values of yield. Thus, a simple linear regression model does not appear to be appropriate.
b.
Develop an estimated multiple regression equation with
Answer to Problem 29SE
The estimated multiple regression equation with
Explanation of Solution
Calculation:
Square transformation:
Software procedure:
Step by step procedure to make square transformation using MINITAB software is given as,
- Choose Calc > Calculator.
- In Store result in variable, enter YearsSq.
- In Expression, enter ‘Years’^2.
- Click OK.
The squared variable is stored in the column of ‘YearsSq’.
Regression:
Software procedure:
Step by step procedure to obtain the regression equation using MINITAB software:
- Choose Stat > Regression > General Regression.
- Under Responses, enter the column of Yield.
- Under Model, enter the columns of Years, YearsSq.
- Click OK.
Output using MINITAB software is given below:
From the output, the estimated multiple regression equation with
c.
Develop an estimated multiple regression equation using the natural logarithm of years as the independent variable.
Explain whether the current regression provides a better fit than the estimated regression developed in part b.
Answer to Problem 29SE
The estimated multiple regression equation using the natural logarithm of years as the independent variable is:
The estimated regression using the natural logarithm of x provides a better fit than the estimated regression developed in part b.
Explanation of Solution
Calculation:
Logarithmic transformation:
Software procedure:
Step by step procedure to make logarithmic transformation using MINITAB software is given as,
- Choose Calc > Calculator.
- In Store result in variable, enter Years.
- In Expression, enter ln(‘Years’).
- Click OK.
The logarithm of the variable is stored in the column of ‘ln(‘Years’)’.
Regression:
Software procedure:
Step by step procedure to obtain the regression equation using MINITAB software:
- Choose Stat > Regression > General Regression.
- Under Responses, enter the column of Yield.
- Under Model, enter the columns of ln(Years).
- Click OK.
Output using MINITAB software is given below:
From the output, the estimated multiple regression equation using the natural logarithm of years as the independent variable is:
Adjusted-
The adjusted
The value of adjusted
The value of adjusted
Evidently, the current regression equation effectively explains more of the variation the response variable, than the second regression equation.
Thus, the estimated regression using the natural logarithm of x provides a better fit than the estimated regression developed in part b.
Want to see more full solutions like this?
Chapter 16 Solutions
Bundle: Statistics for Business & Economics, Loose-Leaf Version, 13th + MindTap Business Statistics with XLSTAT, 1 term (6 months) Printed Access Card
- Find the equation of the regression line for the following data set. x 1 2 3 y 0 3 4arrow_forwardOlympic Pole Vault The graph in Figure 7 indicates that in recent years the winning Olympic men’s pole vault height has fallen below the value predicted by the regression line in Example 2. This might have occurred because when the pole vault was a new event there was much room for improvement in vaulters’ performances, whereas now even the best training can produce only incremental advances. Let’s see whether concentrating on more recent results gives a better predictor of future records. (a) Use the data in Table 2 (page 176) to complete the table of winning pole vault heights shown in the margin. (Note that we are using x=0 to correspond to the year 1972, where this restricted data set begins.) (b) Find the regression line for the data in part ‚(a). (c) Plot the data and the regression line on the same axes. Does the regression line seem to provide a good model for the data? (d) What does the regression line predict as the winning pole vault height for the 2012 Olympics? Compare this predicted value to the actual 2012 winning height of 5.97 m, as described on page 177. Has this new regression line provided a better prediction than the line in Example 2?arrow_forwardWhat does the y -intercept on the graph of a logistic equation correspond to for a population modeled by that equation?arrow_forward
- Does Table 1 represent a linear function? If so, finda linear equation that models the data.arrow_forwardThe regional transit authority for a major metropolitan area wants to determine whetherthere is a relationship between the age of a bus and the annual maintenance cost. A sampleof ten buses resulted in the following data: a. Develop a scatter chart for these data. What does the scatter chart indicate about therelationship between age of a bus and the annual maintenance cost?b. Use the data to develop an estimated regression equation that could be used to predictthe annual maintenance cost given the age of the bus. What is the estimated regressionmodel?c. Test whether each of the regression parameters b0 and b1 is equal to zero at a 0.05level of significance. What are the correct interpretations of the estimated regressionparameters? Are these interpretations reasonable?d. How much of the variation in the sample values of annual maintenance cost does themodel you estimated in part b explain?e. What do you predict the annual maintenance cost to be for a 3.5-year-old bus?arrow_forwardRefer to the Buena School bus data. Develop a regression equation that expresses the relationship between age of the bus and maintenance. The age of the bus is the inde- pendent variable. Draw a scatter diagram. What does this diagram suggest as to the relationship between the two variables? Is it direct or indirect? Does it appear to be strong or weak? Develop a regression equation. How much does an additional year add to the main- tenance cost. What is the estimated maintenance cost for a 10-year-old bus? Conduct a test of hypothesis to determine whether the slope of the regression line is greater than zero. Use the .05 significance level. Interpret your findings from parts (a), (b), and (c) in a brief report.arrow_forward
- A. Write the equation of the regression line. B. Interpret the slope in this context, and calculate the predicted birth weight of first borns and others C. Is there a statistically significant relationship between the average birth weight and parity? Provide an explanation with numeric support.arrow_forwardConnie Harris, who is in charge of office supplies at First Capital Mortgage Corp., would like to predict the quantity of paper used in the office photocopying machines per month. She believes that the number of loans originated in a month influence the volume of photocopying performed. She has compiled the following recent monthly data: Develop an estimated regression equation for this data set. Place output in cell G1 At .05 level of significance, is the equation significant? Why? Does the equation seem to have a good “fit”? Why? Forecast the amount of paper used in a month when 42 loan originations are expected. Loans Originated in Month Paper Used (000's) 45 22 25 13 50 24 60 25 40 21 25 16 35 18 40 25arrow_forwardBill wants to explore factors affecting work stress. He would like to examine the relationship between age, number of years at the workplace, perceived social support, and work stress. He collects data on the variables from 100 employees (males and females) working in banks. Conduct a multiple regression analysis to answer the following questions: What is the relationship of age, number of years, and social support with work stress? Is the regression significant? If yes, what does it indicate? What is the regression equation for all the predictors? Write a results section based on your analysis that answers the research question. * last person got this wrong*arrow_forward
- The following data show the daily closing prices (in dollars per share) for a stock. Define the independent variable Period, where Period = 1 corresponds to the data for November 3, Period = 2 corresponds to the data for November 4, and so on. Develop the estimated regression equation that can be used to predict the closing price given the value of the Period. At the .05 level of significance, test for any positive autocorrelation in the data. Date Price ($) Nov. 3 82.87 Nov. 4 83.00 Nov. 7 83.61 Nov. 8 83.15 Nov. 9 82.84 Nov. 10 83.99 Nov. 11 84.55 Nov. 14 84.36 Nov. 15 85.53 Nov. 16 86.54 Nov. 17 86.89 Nov. 18 87.77 Nov. 21 87.29 Nov. 22 87.99 Nov. 23 88.80 Nov. 25 88.80 Nov. 28 89.11 Nov. 29 89.10 Nov. 30 88.90 Dec. 1 89.21arrow_forwardThe follow table gives the approximate economic value associated with various levels of oil recovery in Texas. Find the regression line, and use it to estimate the economic value associated with a recovery level of 70%.arrow_forwardThe St. Lucian Government is interested in predicting the number of weekly riders on the public buses using the following variables: • • • • Price of bus trips per weekThe population in the cityThe monthly income of ridersAverage rate to park your personal vehicle Determine the multiple regression equation for the data. What is the predicted value of the number of weekly riders if: price of bus trips per week = $24; population = $2,000,000; the monthly income of riders = $13,500; and average rate to park your personal vehicle = $150. Interpret the coefficient of determination.arrow_forward
- College AlgebraAlgebraISBN:9781305115545Author:James Stewart, Lothar Redlin, Saleem WatsonPublisher:Cengage LearningAlgebra and Trigonometry (MindTap Course List)AlgebraISBN:9781305071742Author:James Stewart, Lothar Redlin, Saleem WatsonPublisher:Cengage LearningGlencoe Algebra 1, Student Edition, 9780079039897...AlgebraISBN:9780079039897Author:CarterPublisher:McGraw Hill
- Functions and Change: A Modeling Approach to Coll...AlgebraISBN:9781337111348Author:Bruce Crauder, Benny Evans, Alan NoellPublisher:Cengage LearningBig Ideas Math A Bridge To Success Algebra 1: Stu...AlgebraISBN:9781680331141Author:HOUGHTON MIFFLIN HARCOURTPublisher:Houghton Mifflin Harcourt