Residuals ° 90 Fitted values Im hardness-.) 100 110 Residuals vs Fitted Figure 1: 15 1.0 05 Standardized residuals -15 -10 -0.5 .0 -1.5 -1.0 -0.5 0.0 0.5 1.0 1.5 Theoretical Quantles Im(hardness -.) Normal Q-Q Statistical Modelling Exam 25/01/2024 Exercise 1 The data contained in the cement dataset represent the hardness (hardness variable) of 13 types of cement with different chemical compositions. Specifically, each type is obtained with varying proportions of aluminium (aluminium variable), silicate (silicate variable), calcium aluminoferrite (aluminium ferrite), and silicate bic (silicate_bic). The interest is explaining how the hardness of cement depends on the proportions of chemicals. A regression model was fitted for this purpose and produced the following result: Estimate Std. Error t statistic Pr(>|t|) (Intercept) 124.4809 26.7557 4.653 0.0016 aluminium 0.9739 ?? 3.435 0.0089 silicate -0.1405 0.2891 -0.486 0.6400 aluminium ferrite -0.4974 0.2751 ?? ?? silicate_bic ?? 0.3214 -2.481 0.0381 Error sum of squares 49.378 Total sum of squares R² coefficient 2715.763 ?? a) Write the model formulation and assumptions. b) Complete the missing values in the table. For "Pr(> |t)" of aluminium ferrite provide an approximate value. What variables have a statistically significant effect? c) Test the statistical hypothesis corresponding to the statement "the covariates do not have an effect on the hardness of cement". d) On a reduced model ("model B") that includes only the variables aluminium and silicate_bic the error sum of squares is equal to SSEB = 74.762. Perform an F test to compare this model with the complete model ("model A") that includes all the covariates. Interpret the result: which model would you prefer? e) Obtain the coefficient R² of model B. Instead of performing the test in point (d), could you have simply compared the coefficient R² of the two models? Why? f) Figure 1 shows two plots regarding the complete model (model A). Explain what they represent and interpret them.

MATLAB: An Introduction with Applications
6th Edition
ISBN:9781119256830
Author:Amos Gilat
Publisher:Amos Gilat
Chapter1: Starting With Matlab
Section: Chapter Questions
Problem 1P
icon
Related questions
Question

For context, the images attached below (the question and the related figure) is from a january 2024 past paper

Residuals
°
90
Fitted values
Im hardness-.)
100
110
Residuals vs Fitted
Figure 1:
15
1.0
05
Standardized residuals
-15
-10
-0.5
.0
-1.5
-1.0
-0.5 0.0
0.5
1.0
1.5
Theoretical Quantles
Im(hardness -.)
Normal Q-Q
Transcribed Image Text:Residuals ° 90 Fitted values Im hardness-.) 100 110 Residuals vs Fitted Figure 1: 15 1.0 05 Standardized residuals -15 -10 -0.5 .0 -1.5 -1.0 -0.5 0.0 0.5 1.0 1.5 Theoretical Quantles Im(hardness -.) Normal Q-Q
Statistical Modelling
Exam 25/01/2024
Exercise 1
The data contained in the cement dataset represent the hardness (hardness variable) of 13
types of cement with different chemical compositions. Specifically, each type is obtained with
varying proportions of aluminium (aluminium variable), silicate (silicate variable), calcium
aluminoferrite (aluminium ferrite), and silicate bic (silicate_bic). The interest is explaining
how the hardness of cement depends on the proportions of chemicals.
A regression model was fitted for this purpose and produced the following result:
Estimate Std. Error t statistic Pr(>|t|)
(Intercept)
124.4809
26.7557
4.653
0.0016
aluminium
0.9739
??
3.435
0.0089
silicate
-0.1405
0.2891
-0.486
0.6400
aluminium ferrite
-0.4974
0.2751
??
??
silicate_bic
??
0.3214
-2.481
0.0381
Error sum of squares
49.378
Total sum of squares
R² coefficient
2715.763
??
a) Write the model formulation and assumptions.
b) Complete the missing values in the table. For "Pr(> |t)" of aluminium ferrite provide
an approximate value. What variables have a statistically significant effect?
c) Test the statistical hypothesis corresponding to the statement "the covariates do not have
an effect on the hardness of cement".
d) On a reduced model ("model B") that includes only the variables aluminium and silicate_bic
the error sum of squares is equal to SSEB = 74.762. Perform an F test to compare this
model with the complete model ("model A") that includes all the covariates. Interpret the
result: which model would you prefer?
e) Obtain the coefficient R² of model B. Instead of performing the test in point (d), could
you have simply compared the coefficient R² of the two models? Why?
f) Figure 1 shows two plots regarding the complete model (model A). Explain what they
represent and interpret them.
Transcribed Image Text:Statistical Modelling Exam 25/01/2024 Exercise 1 The data contained in the cement dataset represent the hardness (hardness variable) of 13 types of cement with different chemical compositions. Specifically, each type is obtained with varying proportions of aluminium (aluminium variable), silicate (silicate variable), calcium aluminoferrite (aluminium ferrite), and silicate bic (silicate_bic). The interest is explaining how the hardness of cement depends on the proportions of chemicals. A regression model was fitted for this purpose and produced the following result: Estimate Std. Error t statistic Pr(>|t|) (Intercept) 124.4809 26.7557 4.653 0.0016 aluminium 0.9739 ?? 3.435 0.0089 silicate -0.1405 0.2891 -0.486 0.6400 aluminium ferrite -0.4974 0.2751 ?? ?? silicate_bic ?? 0.3214 -2.481 0.0381 Error sum of squares 49.378 Total sum of squares R² coefficient 2715.763 ?? a) Write the model formulation and assumptions. b) Complete the missing values in the table. For "Pr(> |t)" of aluminium ferrite provide an approximate value. What variables have a statistically significant effect? c) Test the statistical hypothesis corresponding to the statement "the covariates do not have an effect on the hardness of cement". d) On a reduced model ("model B") that includes only the variables aluminium and silicate_bic the error sum of squares is equal to SSEB = 74.762. Perform an F test to compare this model with the complete model ("model A") that includes all the covariates. Interpret the result: which model would you prefer? e) Obtain the coefficient R² of model B. Instead of performing the test in point (d), could you have simply compared the coefficient R² of the two models? Why? f) Figure 1 shows two plots regarding the complete model (model A). Explain what they represent and interpret them.
Expert Solution
steps

Step by step

Solved in 2 steps

Blurred answer
Similar questions
  • SEE MORE QUESTIONS
Recommended textbooks for you
MATLAB: An Introduction with Applications
MATLAB: An Introduction with Applications
Statistics
ISBN:
9781119256830
Author:
Amos Gilat
Publisher:
John Wiley & Sons Inc
Probability and Statistics for Engineering and th…
Probability and Statistics for Engineering and th…
Statistics
ISBN:
9781305251809
Author:
Jay L. Devore
Publisher:
Cengage Learning
Statistics for The Behavioral Sciences (MindTap C…
Statistics for The Behavioral Sciences (MindTap C…
Statistics
ISBN:
9781305504912
Author:
Frederick J Gravetter, Larry B. Wallnau
Publisher:
Cengage Learning
Elementary Statistics: Picturing the World (7th E…
Elementary Statistics: Picturing the World (7th E…
Statistics
ISBN:
9780134683416
Author:
Ron Larson, Betsy Farber
Publisher:
PEARSON
The Basic Practice of Statistics
The Basic Practice of Statistics
Statistics
ISBN:
9781319042578
Author:
David S. Moore, William I. Notz, Michael A. Fligner
Publisher:
W. H. Freeman
Introduction to the Practice of Statistics
Introduction to the Practice of Statistics
Statistics
ISBN:
9781319013387
Author:
David S. Moore, George P. McCabe, Bruce A. Craig
Publisher:
W. H. Freeman