Assignment7-320-MultipleRegression2
docx
keyboard_arrow_up
School
The University of Tennessee, Knoxville *
*We aren’t endorsed by this school
Course
320
Subject
Statistics
Date
Apr 3, 2024
Type
docx
Pages
19
Uploaded by ProfessorSquidMaster853
BAS 320 - Assignment 7 - Multiple Regression Part 2
Matthew Zook
M <-
lm
(review_taste ~
., data=
BEER)
drop1
(M,
test=
"F"
)
## Single term deletions
## ## Model:
## review_taste ~ ABV + Min.IBU + Max.IBU + Astringency + Body + ## Alcohol + Bitter + Sweet + Sour + Salty + Fruits + Hoppy + ## Spices + Malty
## Df Sum of Sq RSS AIC F value Pr(>F) ## <none> 173.65 -1720.7 ## ABV 1 6.8645 180.52 -1683.9 38.9373 6.491e-10 ***
## Min.IBU 1 2.2290 175.88 -1710.0 12.6432 0.0003948 ***
## Max.IBU 1 0.0203 173.67 -1722.6 0.1150 0.7346205 ## Astringency 1 0.0989 173.75 -1722.1 0.5610 0.4540476 ## Body 1 2.1096 175.76 -1710.6 11.9661 0.0005650 ***
## Alcohol 1 3.1541 176.81 -1704.7 17.8911 2.557e-05 ***
## Bitter 1 0.1093 173.76 -1722.1 0.6200 0.4312491 ## Sweet 1 0.7480 174.40 -1718.4 4.2429 0.0396778 * ## Sour 1 1.8677 175.52 -1712.0 10.5939 0.0011732 ** ## Salty 1 0.3135 173.97 -1720.9 1.7784 0.1826602 ## Fruits 1 2.6961 176.35 -1707.3 15.2932 9.836e-05 ***
## Hoppy 1 0.2920 173.94 -1721.0 1.6563 0.1984012 ## Spices 1 6.5096 180.16 -1685.9 36.9244 1.753e-09 ***
## Malty 1 1.6272 175.28 -1713.4 9.2299 0.0024437 ** ## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
BEER <-
BEER[,
c
(
"review_taste"
,
"ABV"
,
"Fruits"
,
"Sour"
,
"Spices"
,
"Body"
)]
The following gives a skeleton/template that you could fill out to narrate your analysis. You’ll need to adapt words throughout to make it specific to your dataset. Feel free to deviate from the skeleton as long as you’re hitting all the required points (see Rubric and detailed list of requirements on Canvas)! Delete this paragraph (and any prompts provided in other paragraphs) before knitting/submitting. The document should flow nicely as a professional report.
Analysis of … (replace the … with what you’re analyzing)
I’m using a multiple regression model to predict the overall taste review from the alcohol by volume, amount of fruits, sour score, amount of spices, and body score.
I’m making this model because I am curious to see which score has the most effect on the overall score and see how well I can predict the overall score based on the predictors.
The data I’m using comes from https://www.kaggle.com/datasets/ruthgn/beer-profile-
and-ratings-data-set/ and contains a total of 1000 rows and 6 total predictors
Investigation of the relationship between the taste score and spices
I am investigating the relationship between the overall taste score the amounnt of spices in the beer. a polynomial would do a better job than the linear regression model. I would suggest the fourth order polynomial predicting the value from the spices, spices^2, Spices^3, and the Spices^4 for this.
M <-
lm
(review_taste ~
Spices, data=
BEER); choose_order
(M) ## order R2adj AICc
## 1 1 0.06564513 1417.809
## 2 2 0.14089397 1336.854
## 3 3 0.16009325 1317.265
## 4 4 0.18179266 1294.106
## 5 5 0.18539948 1292.707
## 6 6 0.18551032 1295.592
#choose_order(M)
Multiple regression model and checking of assumptions
While there are some violations, they are not enough to cause us to ditch the model. from a statistical standpoint, we have linearity in body, fruits, and sour, but we failed equal spread and normality, but they are relatively small.
M <-
lm
(review_taste ~
., data=
BEER) #Fit a multiple linear regression
summary
(M)
## ## Call:
## lm(formula = review_taste ~ ., data = BEER)
## ## Residuals:
## Min 1Q Median 3Q Max ## -2.24191 -0.20268 0.04858 0.25555 1.48930 ## ## Coefficients:
## Estimate Std. Error t value Pr(>|t|) ## (Intercept) 2.9959999 0.0413381 72.475 < 2e-16 ***
## ABV 0.0362721 0.0052925 6.853 1.26e-11 ***
## Fruits 0.0022523 0.0007679 2.933 0.00343 ** ## Sour 0.0018520 0.0006586 2.812 0.00502 ** ## Spices 0.0028970 0.0006593 4.394 1.23e-05 ***
## Body 0.0059590 0.0005607 10.628 < 2e-16 ***
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## ## Residual standard error: 0.4326 on 994 degrees of freedom
## Multiple R-squared: 0.2743, Adjusted R-squared: 0.2706 ## F-statistic: 75.13 on 5 and 994 DF, p-value: < 2.2e-16
check_regression
(M,
extra=
TRUE
)
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
## ## Tests of Assumptions: ( sample size n = 1000 ):
## Linearity
## p-value for ABV : 0 ## p-value for Fruits : 0.9313 ## p-value for Sour : 0.9122 ## p-value for Spices : 0 ## p-value for Body : 0.2418 ## p-value for overall model : NA (not enough duplicate rows)
## Equal Spread: p-value is 0 ## Normality: p-value is 0 ## ## Advice: if n<25 then all tests must be passed.
## If n >= 25 and test is failed, refer to diagnostic plot to see if violation is severe
## or is small enough to be ignored.
## ## Press [enter] to continue to Predictor vs. Residuals plots or q (then Return) to quit ( 5 plots to show )
Identification of influential points
There are 7 influential points. The first influential point is unusual because it has a 0 in fruits, Sour, and Spices yet has a very high review score.
#M <- lm() #Fit a multiple regression model
influence_plot
(M)
## $Leverage
## [1] 210 477 523 690 824 885 907
influential.rows <-
influence_plot
(M)
$
Leverage
INFLUENCE <-
data.frame
(
matrix
(
0
,
nrow=
length
(influential.rows),
ncol=
ncol
(BEER)) )
names
(INFLUENCE) <-
names
(BEER)
for
(r in
1
:
length
(influential.rows)) {
x <-
as.numeric
(BEER[influential.rows[r],])
INFLUENCE[r,] <-
sapply
(
1
:
length
(x),
function
(i) { mean
(BEER[[i]] <=
x[i])} )
}
round
(INFLUENCE,
digits=
2
)
## review_taste ABV Fruits Sour Spices Body
## 1 0.99 0.98 0.02 0.01 0.05 0.02
## 2 0.14 0.57 0.99 0.98 0.32 0.68
## 3 0.10 0.07 0.47 0.52 1.00 0.60
## 4 0.28 1.00 0.23 0.14 0.15 0.03
## 5 0.11 0.33 0.06 0.05 0.98 0.94
## 6 0.08 0.56 0.40 0.56 1.00 0.14
## 7 0.06 0.11 0.98 0.46 0.98 0.21
BEER[
210
,]
## review_taste ABV Fruits Sour Spices Body
## 3098 4.5 12 0 0 0 4
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
summary
(BEER)
## review_taste ABV Fruits Sour ## Min. :1.439 Min. : 0.000 Min. : 0.00 Min. : 0.00 ## 1st Qu.:3.495 1st Qu.: 5.000 1st Qu.: 12.00 1st Qu.: 11.00 ## Median :3.787 Median : 5.900 Median : 28.00 Median : 22.00 ## Mean :3.697 Mean : 6.502 Mean : 37.31 Mean : 33.09 ## 3rd Qu.:4.028 3rd Qu.: 7.500 3rd Qu.: 56.00 3rd Qu.: 42.00 ## Max. :4.923 Max. :57.500 Max. :148.00 Max. :241.00 ## Spices Body ## Min. : 0.00 Min. : 0.00 ## 1st Qu.: 4.00 1st Qu.: 29.00 ## Median : 10.00 Median : 39.00 ## Mean : 17.41 Mean : 45.17 ## 3rd Qu.: 22.25 3rd Qu.: 57.25 ## Max. :170.00 Max. :175.00
Investigation of an interaction between ____ and ____ (replace with what you’re looking at)
Review = 3.505842 - 0.177307*Spices
Review = 3.70636 -0.355975*Spices
at small spice scores, bodies which have higher scores get larger taste scores, but as we go up in spice score, beers with a high body gets lower scores than ones with higher
#You'll need to run the next 2 lines, but not include them in your report
#M <- lm( y ~ .^2, data=DATA) # do all 2 way interactions
#see_interactions(M,cex=0.6,pos="topleft",many=TRUE)
#This is what we want to see in your report
#M <- lm( y ~ x1*x2, data=DATA)
#summary(DATA$x1)
#visualize_model(M)
M2 <-
lm
(review_taste ~
.
^
2
, data=
BEER)
see_interactions
(M2,
many=
TRUE
)
## ## Press [enter] to continue to see next set of interactions or q (then Enter) to quit
## ## Press [enter] to continue to see next set of interactions or q (then Enter) to quit
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
## ## Press [enter] to continue to see next set of interactions or q (then Enter) to quit
## ## Press [enter] to continue to see next set of interactions or q (then Enter) to quit
## ## Press [enter] to continue to see next set of interactions or q (then Enter) to quit
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
## ## Press [enter] to continue to see next set of interactions or q (then Enter) to quit
## ## Press [enter] to continue to see next set of interactions or q (then Enter) to quit
## ## Press [enter] to continue to see next set of interactions or q (then Enter) to quit
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
## ## Press [enter] to continue to see next set of interactions or q (then Enter) to quit
#Body and Sour
#Body and Spices
M.int <-
lm
(review_taste ~
Body
*
Spices,
data=
BEER)
visualize_model
(M.int)
## ## Interaction term has p-value 0.005983
summary
(M.int)
## ## Call:
## lm(formula = review_taste ~ Body * Spices, data = BEER)
## ## Residuals:
## Min 1Q Median 3Q Max ## -2.17116 -0.21003 0.03287 0.28643 1.48974 ## ## Coefficients:
## Estimate Std. Error t value Pr(>|t|) ## (Intercept) 3.300e+00 3.614e-02 91.322 < 2e-16 ***
## Body 7.098e-03 7.275e-04 9.756 < 2e-16 ***
## Spices 7.742e-03 1.428e-03 5.423 7.35e-08 ***
## Body:Spices -6.381e-05 2.316e-05 -2.755 0.00598 ** ## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## ## Residual standard error: 0.4655 on 996 degrees of freedom
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
## Multiple R-squared: 0.158, Adjusted R-squared: 0.1555 ## F-statistic: 62.3 on 3 and 996 DF, p-value: < 2.2e-16
summary
(BEER
$
Body)
## Min. 1st Qu. Median Mean 3rd Qu. Max. ## 0.00 29.00 39.00 45.17 57.25 175.00
Related Documents
Related Questions
Mental development in humans is related to the
volume of the part of the brain known as the
hippocampus. The given regression output shows the
mental development index at age 24 months vs. the
hippocampus volume in ml at birth for a
representative sample of 17 premature infants.
MDI_24 By Vol(ml)
2.5
2.4-
2.3-
2.2-
2.1-
2-
1.9-
1.8-
1.7-
1.6-
1.5-
50
60
70
80
90
100
110
12
Vol(ml)
Regression Analysis
MDI_24MO = 1.1359094 + 0.0093475*HippoVol
Summary of Fit
RSquare
RSquare Adj
S
Mean of Response
0.265
0.216
0.223
1.97758
NObservations
17
Analysis of Variance
Source
Model
DF Sum of Squares
1 0.268
Mean Square F Ratio
0.268
5.4023
MDI_24M0
arrow_forward
Define regression line
arrow_forward
Find the slope of regression line, y-intercept of regression line, coefficient of determination (r^2), and the linear correlation coefficient (r)
arrow_forward
A regression was run to determine if there is a relationship between the happiness index (y) and lifeexpectancy in years of a given country (x). The results of the regression were: y^=a+bx ; a=-0.423 ,b=0.07
a. Write the equation of the Least Squares Regression line.b. Find the value for the correlation coefficient, r?c. If a country increases its life expectancy, the happiness index will Increase or decrease ( circleone)d. If the life expectancy is increased by 1 year in a certain country, how much will the happinessindex change? Round to two decimal places.e. Use the regression line to predict the happiness index of a country with a life expectancy of 85years. Round to two decimal places.-
arrow_forward
Using SAS, draw a scatterplot between variables CRIME_RATE and PROP_CHANGE_INCOME. Attach the
scatterplot. Are those two variables good candidates to be analyzed using linear regression? Explain why or why
not.
crime_rate
150
100
50
O
15
O
O
20
O
O
O
25
O
8
O
O
8
O
O
o
O
prop_change_income
O
O
O
30
O
O
O
35
O
O
O
40
arrow_forward
Compute the least-squares regression line for predicting y from x given the following summary statistics. Round the slope and y-intercept to at least four decimal places.
x = 12.5
sx = 2.2
y = 1400
sy = 1.8
r = 0.50
Regression line equation: y^ = ___.
image attached bellow for better view.
arrow_forward
Use the least squares regression line of this data set to predict a value.
Meteorologists in a seaside town wanted to understand how their annual rainfall
is affected by the temperature of coastal waters.
For the past few years, they monitored the average temperature of coastal
waters (in Celsius), x, as well as the annual rainfall (in millimetres), y.
Rainfall statistics
• The mean of the x-values is 11.503.
• The mean of the y-values is 366.637.
• The sample standard deviation of the x-values is 4.900.
• The sample standard deviation of the y-values is 44.387.
• The correlation coefficient of the data set is 0.896.
The least squares regression line of this data set is:
y = 8.116x + 273.273
How much rainfall does this line predict in a year if the average temperature of coastal waters
is 15 degrees Celsius?
Round your answer to the nearest integer.
millimetres
arrow_forward
For a regression line y^=mx+b, what does the Sum of Squares Due to Error measure?
Select the correct answer below:
the total variation in x that cannot be explained by variation in y
the total variation in y that cannot be explained by the error, or residuals, for x
the total variation in y that cannot be explained by the variation in x
the total variation in x that cannot be explained by the error, or residuals, for y
arrow_forward
how to perform a linear regression to test the relationship between an independent and dependent variable where the data consists of 5 groups with repeated measures?
arrow_forward
Study conductes in patients with HIV. Primary outcome is CD4 coubt(measure stage of disease). Lower CD4=more advanced disease. Wqnt to find association between taking vitamins and supplements and CD4 count. Multiple regression analysis done relating CD4 to use of supplements (1=yes 0=no) and to duration of HIV in years. (# of yeara between diagnosis and study date) y=CD4 Count
Y=501.41+12.67 supplements -30 23 duration of HIV.
A. What is ezpected CD4 count for patients taking supplements who had HIv for 2.5 years?
B. Expected CD4 count not takong supplements with HIv at study start date?
C. Expected CD4 coubt for patients not taking supplements with HIV for 2.5 years?
D. Uf compare 2 patients, 1 HIV for 5 years longer than other, whats expected difference in CD4 count?
arrow_forward
please anwer whatever you are allowed too. Thank you.
A regression was run to determine if there is a relationship between the happiness index (y) and life expectancy in years of a given country (x).The results of the regression were:
ˆyy^=a+bxa=-1.68b=0.168
(a) Write the equation of the Least Squares Regression line of the formˆyy^= + x(b) Which is a possible value for the correlation coefficient, rr?
-1.417
1.417
0.702
-0.702
(c) If a country increases its life expectancy, the happiness index will
increase
decrease
(d) If the life expectancy is increased by 0.5 years in a certain country, how much will the happiness index change? Round to two decimal places.(e) Use the regression line to predict the happiness index of a country with a life expectancy of 69 years. Round to two decimal places.
arrow_forward
Pre-study scores versus post-study scores for a class of 120 college freshman English students were considered. The residual plot for the least squares regression line showed no pattern. The least squares regression line was \hat{y} = 0.2 + 0.9xy^=0.2+0.9x with a correlation coefficient r = 0.76. What percent of the variation of post-study scores can be explained by the variation in pre-study scores?
57.8%
87.2%
52.0%
76.0%
We cannot determine the answer using the information given.
arrow_forward
Please answer all parts of the question, thank you
arrow_forward
Managers rate employees according to job performance and attitude. The results for several randomly selected employees are given below.
Performance
59
63
65
69
58
77
76
69
70
64
Attitude
72
67
78
82
75
87
92
83
87
78
Use the given data to find the equation of the regression line.
y^ = - 47.3 + 2.02x
y^ = 2.81 + 1.35x
y^ = 11.7 + 1.02x
y^ = 92.3 - 0.669x
arrow_forward
a. what is the equation of the regression line?
b. interpret r of the regression line
c. interpret r^2 of the regression line
L1 - 4,15,12,11,8,6,7,2,7,14,20,3,13
L2 - 120,200,140,110,120,80,190,100,120,190,190,110,120
arrow_forward
^
The money raised and spent (both in millions of dollars) by all congressional campaigns for 8 recent 2-year periods are
shown in the table. The equation of the regression line is y = 0.951x + 17.469. Find the coefficient of determination and
interpret the result.
Money raised, x
471.6
653.1
734.2
786.7
783.8
1044.8 954.2 1216.7
Money spent, y
436.2
689.2
729.3
766.4
726.7
1019.7 925.7 1163.9
Find the coefficient of determination and interpret the result.
2 = 1
(Round to three decimal places as needed.)
arrow_forward
A multiple regression model has the form
y^=b0+b1x1+b2x2
The coefficient b1 is interpreted as the:A. change in y per unit change in x1, holding x2 constantB. change in the average value of y per unit change in x1, holding x2 constantC. change in y per unit change in x1, when x1 and x2 values are correlatedD. change in y per unit change in x1
If multicollinearity exists among the independent variables included in a multiple regression model, the:A. multiple coefficient of determination will assume a value close to zeroB. standard errors of the regression coefficients for the correlated independent variables will increaseC. regression coefficients will be difficult to interpretD. regression coefficients will be difficult to interpret and the standard errors of the regression coefficients for the correlated independent variables will increase
please explain which answer is correct and why
arrow_forward
call:
Researchers measured the percent
1m(formula = Symptoms - wear_mask, data - some_states)
of people in 25 states who ʻknew
someone with COVID symptoms' (ŷ)
and regressed this on the
percent of the population frequently
wearing a mask in public (x).
Residuals:
Min
-7.9167 -2.3306 -0.2469 2.5020 7. 3345
10 Median
30
Маx
coefficients:
(Intercept) 111.0981
wear_mask
Estimate std. Error t value Pr (>|t|)
10. 5423 10. 538 2.82e-10 ***
-8. 375 1. 94 e-08 ***
-1.0419
0.1244
signif. codes: 0 ****' 0.001 ***' 0.01 **' 0.05 '.' 0.1 ' ' 1
Residual standard error: 3.859 on 23 degrees of freedom
Multiple R-squared: 0.7531,
F-statistic: 70.15 on 1 and 23 DF, p-value: 1.936e-08
Adjusted R-squared: 0.7423
If 75 percent of people in a state wear masks regularly, what % of people does this
model predict will know someone with COVID symptons?
1) 32
2) 33
3) 34
4) 35
arrow_forward
Benign prostatic hyperplasia is a noncancerous
enlargement of the prostate gland that adversely
affects the quality of life (QoL) of millions of men. A
study of minimally invasive procedures for the
treatment for this condition looked at pretreatment
QoL (qol_base) and quality of life after 3 month on
treatment (qol_3mo)
The baseline data for 10 patients and their 3 month
follow-up data is presented below:
MAXFLO_B = maximum urine flow at baseline (urine
flow measurement scale misplaced)
MAXFLO3M = maximum urine flow after 3 months of
treatment
maxflo_b
maxflo3m
7
8
18
8
13
9.
16
11
8
4
12
10
8
14
10
13
arrow_forward
Does the Regression line give information about all the data points in the data set? Does the Regression line usually have all the points in the data set on it?
arrow_forward
Find the best predicted value of y corresponding to the given value of x.
^
21) Six pairs of data yield r = 0.789 and the regression equation y = 4x - 2. Also, y = 19.0. What is the
best predicted value of y for x = 5?
A) 18.0
B) 18.5
C) 19.0
D) 22.0
21)
arrow_forward
true or false. Explain
In a fixed effects regression, the residuals (ûit) will be uncorrelated with the estimated fixed effects.
arrow_forward
A regression was run to determine if there is a relationship between hours a week of study (x) and the test scores (y).
The results of the regression were:
y = ax + b
a = 6.225
b = 37.66
r^2 = 0.531441
r = 0.729
Predict the final exam score of a student studies 10.5 hours per week.
Round to a whole number
arrow_forward
Show calculations please
arrow_forward
SEE MORE QUESTIONS
Recommended textbooks for you

MATLAB: An Introduction with Applications
Statistics
ISBN:9781119256830
Author:Amos Gilat
Publisher:John Wiley & Sons Inc

Probability and Statistics for Engineering and th...
Statistics
ISBN:9781305251809
Author:Jay L. Devore
Publisher:Cengage Learning

Statistics for The Behavioral Sciences (MindTap C...
Statistics
ISBN:9781305504912
Author:Frederick J Gravetter, Larry B. Wallnau
Publisher:Cengage Learning

Elementary Statistics: Picturing the World (7th E...
Statistics
ISBN:9780134683416
Author:Ron Larson, Betsy Farber
Publisher:PEARSON

The Basic Practice of Statistics
Statistics
ISBN:9781319042578
Author:David S. Moore, William I. Notz, Michael A. Fligner
Publisher:W. H. Freeman

Introduction to the Practice of Statistics
Statistics
ISBN:9781319013387
Author:David S. Moore, George P. McCabe, Bruce A. Craig
Publisher:W. H. Freeman
Related Questions
- Mental development in humans is related to the volume of the part of the brain known as the hippocampus. The given regression output shows the mental development index at age 24 months vs. the hippocampus volume in ml at birth for a representative sample of 17 premature infants. MDI_24 By Vol(ml) 2.5 2.4- 2.3- 2.2- 2.1- 2- 1.9- 1.8- 1.7- 1.6- 1.5- 50 60 70 80 90 100 110 12 Vol(ml) Regression Analysis MDI_24MO = 1.1359094 + 0.0093475*HippoVol Summary of Fit RSquare RSquare Adj S Mean of Response 0.265 0.216 0.223 1.97758 NObservations 17 Analysis of Variance Source Model DF Sum of Squares 1 0.268 Mean Square F Ratio 0.268 5.4023 MDI_24M0arrow_forwardDefine regression linearrow_forwardFind the slope of regression line, y-intercept of regression line, coefficient of determination (r^2), and the linear correlation coefficient (r)arrow_forward
- A regression was run to determine if there is a relationship between the happiness index (y) and lifeexpectancy in years of a given country (x). The results of the regression were: y^=a+bx ; a=-0.423 ,b=0.07 a. Write the equation of the Least Squares Regression line.b. Find the value for the correlation coefficient, r?c. If a country increases its life expectancy, the happiness index will Increase or decrease ( circleone)d. If the life expectancy is increased by 1 year in a certain country, how much will the happinessindex change? Round to two decimal places.e. Use the regression line to predict the happiness index of a country with a life expectancy of 85years. Round to two decimal places.-arrow_forwardUsing SAS, draw a scatterplot between variables CRIME_RATE and PROP_CHANGE_INCOME. Attach the scatterplot. Are those two variables good candidates to be analyzed using linear regression? Explain why or why not. crime_rate 150 100 50 O 15 O O 20 O O O 25 O 8 O O 8 O O o O prop_change_income O O O 30 O O O 35 O O O 40arrow_forwardCompute the least-squares regression line for predicting y from x given the following summary statistics. Round the slope and y-intercept to at least four decimal places. x = 12.5 sx = 2.2 y = 1400 sy = 1.8 r = 0.50 Regression line equation: y^ = ___. image attached bellow for better view.arrow_forward
- Use the least squares regression line of this data set to predict a value. Meteorologists in a seaside town wanted to understand how their annual rainfall is affected by the temperature of coastal waters. For the past few years, they monitored the average temperature of coastal waters (in Celsius), x, as well as the annual rainfall (in millimetres), y. Rainfall statistics • The mean of the x-values is 11.503. • The mean of the y-values is 366.637. • The sample standard deviation of the x-values is 4.900. • The sample standard deviation of the y-values is 44.387. • The correlation coefficient of the data set is 0.896. The least squares regression line of this data set is: y = 8.116x + 273.273 How much rainfall does this line predict in a year if the average temperature of coastal waters is 15 degrees Celsius? Round your answer to the nearest integer. millimetresarrow_forwardFor a regression line y^=mx+b, what does the Sum of Squares Due to Error measure? Select the correct answer below: the total variation in x that cannot be explained by variation in y the total variation in y that cannot be explained by the error, or residuals, for x the total variation in y that cannot be explained by the variation in x the total variation in x that cannot be explained by the error, or residuals, for yarrow_forwardhow to perform a linear regression to test the relationship between an independent and dependent variable where the data consists of 5 groups with repeated measures?arrow_forward
- Study conductes in patients with HIV. Primary outcome is CD4 coubt(measure stage of disease). Lower CD4=more advanced disease. Wqnt to find association between taking vitamins and supplements and CD4 count. Multiple regression analysis done relating CD4 to use of supplements (1=yes 0=no) and to duration of HIV in years. (# of yeara between diagnosis and study date) y=CD4 Count Y=501.41+12.67 supplements -30 23 duration of HIV. A. What is ezpected CD4 count for patients taking supplements who had HIv for 2.5 years? B. Expected CD4 count not takong supplements with HIv at study start date? C. Expected CD4 coubt for patients not taking supplements with HIV for 2.5 years? D. Uf compare 2 patients, 1 HIV for 5 years longer than other, whats expected difference in CD4 count?arrow_forwardplease anwer whatever you are allowed too. Thank you. A regression was run to determine if there is a relationship between the happiness index (y) and life expectancy in years of a given country (x).The results of the regression were: ˆyy^=a+bxa=-1.68b=0.168 (a) Write the equation of the Least Squares Regression line of the formˆyy^= + x(b) Which is a possible value for the correlation coefficient, rr? -1.417 1.417 0.702 -0.702 (c) If a country increases its life expectancy, the happiness index will increase decrease (d) If the life expectancy is increased by 0.5 years in a certain country, how much will the happiness index change? Round to two decimal places.(e) Use the regression line to predict the happiness index of a country with a life expectancy of 69 years. Round to two decimal places.arrow_forwardPre-study scores versus post-study scores for a class of 120 college freshman English students were considered. The residual plot for the least squares regression line showed no pattern. The least squares regression line was \hat{y} = 0.2 + 0.9xy^=0.2+0.9x with a correlation coefficient r = 0.76. What percent of the variation of post-study scores can be explained by the variation in pre-study scores? 57.8% 87.2% 52.0% 76.0% We cannot determine the answer using the information given.arrow_forward
arrow_back_ios
SEE MORE QUESTIONS
arrow_forward_ios
Recommended textbooks for you
- MATLAB: An Introduction with ApplicationsStatisticsISBN:9781119256830Author:Amos GilatPublisher:John Wiley & Sons IncProbability and Statistics for Engineering and th...StatisticsISBN:9781305251809Author:Jay L. DevorePublisher:Cengage LearningStatistics for The Behavioral Sciences (MindTap C...StatisticsISBN:9781305504912Author:Frederick J Gravetter, Larry B. WallnauPublisher:Cengage Learning
- Elementary Statistics: Picturing the World (7th E...StatisticsISBN:9780134683416Author:Ron Larson, Betsy FarberPublisher:PEARSONThe Basic Practice of StatisticsStatisticsISBN:9781319042578Author:David S. Moore, William I. Notz, Michael A. FlignerPublisher:W. H. FreemanIntroduction to the Practice of StatisticsStatisticsISBN:9781319013387Author:David S. Moore, George P. McCabe, Bruce A. CraigPublisher:W. H. Freeman

MATLAB: An Introduction with Applications
Statistics
ISBN:9781119256830
Author:Amos Gilat
Publisher:John Wiley & Sons Inc

Probability and Statistics for Engineering and th...
Statistics
ISBN:9781305251809
Author:Jay L. Devore
Publisher:Cengage Learning

Statistics for The Behavioral Sciences (MindTap C...
Statistics
ISBN:9781305504912
Author:Frederick J Gravetter, Larry B. Wallnau
Publisher:Cengage Learning

Elementary Statistics: Picturing the World (7th E...
Statistics
ISBN:9780134683416
Author:Ron Larson, Betsy Farber
Publisher:PEARSON

The Basic Practice of Statistics
Statistics
ISBN:9781319042578
Author:David S. Moore, William I. Notz, Michael A. Fligner
Publisher:W. H. Freeman

Introduction to the Practice of Statistics
Statistics
ISBN:9781319013387
Author:David S. Moore, George P. McCabe, Bruce A. Craig
Publisher:W. H. Freeman