(a) Comment on the suitability of this data set for simple linear regression modelling. (b) Explain the difference between an outlier and an influential observation The data is loaded into R Studio and assigned to two vectors x and y. (c) Write R code that constructs a simple linear regression model with X as the explanatory variable and Y as the response, calculates and plots the leverage of each observation and then formally tests whether there are any influential observations in this data with reference to an appropriate F test assigning the value TRUE in R if there is one or more influential observations and FALSE if there are no influential observations. (d) What should the modeler do if the code in (c) above returns TRUE?

A First Course in Probability (10th Edition)
10th Edition
ISBN:9780134753119
Author:Sheldon Ross
Publisher:Sheldon Ross
Chapter1: Combinatorial Analysis
Section: Chapter Questions
Problem 1.1P: a. How many different 7-place license plates are possible if the first 2 places are for letters and...
icon
Related questions
Question

Note: The answer should be typed 

 

Question 2
Some data is collected with 12 observations of two variables X and Y and the following
scatter plot of the data is produced.
Yi
8
D
0
00088
1
Plot of Y versus X
2
Xi
3
(a) Comment on the suitability of this data set for simple linear regression modelling.
(b) Explain the difference between an outlier and an influential observation
The data is loaded into R Studio and assigned to two vectors x and y.
(c) Write R code that constructs a simple linear regression model with X as the explanatory
variable and Y as the response, calculates and plots the leverage of each observation
and then formally tests whether there are any influential observations in this data with
reference to an appropriate F test assigning the value TRUE in R if there is one or more
influential observations and FALSE if there are no influential observations.
(d) What should the modeler do if the code in (c) above returns TRUE?
Transcribed Image Text:Question 2 Some data is collected with 12 observations of two variables X and Y and the following scatter plot of the data is produced. Yi 8 D 0 00088 1 Plot of Y versus X 2 Xi 3 (a) Comment on the suitability of this data set for simple linear regression modelling. (b) Explain the difference between an outlier and an influential observation The data is loaded into R Studio and assigned to two vectors x and y. (c) Write R code that constructs a simple linear regression model with X as the explanatory variable and Y as the response, calculates and plots the leverage of each observation and then formally tests whether there are any influential observations in this data with reference to an appropriate F test assigning the value TRUE in R if there is one or more influential observations and FALSE if there are no influential observations. (d) What should the modeler do if the code in (c) above returns TRUE?
Expert Solution
steps

Step by step

Solved in 6 steps with 4 images

Blurred answer
Recommended textbooks for you
A First Course in Probability (10th Edition)
A First Course in Probability (10th Edition)
Probability
ISBN:
9780134753119
Author:
Sheldon Ross
Publisher:
PEARSON
A First Course in Probability
A First Course in Probability
Probability
ISBN:
9780321794772
Author:
Sheldon Ross
Publisher:
PEARSON