HW4_STAT425_Spr2024

pdf

School

University of Illinois, Urbana Champaign *

*We aren’t endorsed by this school

Course

425

Subject

Statistics

Date

Apr 3, 2024

Type

pdf

Pages

2

Uploaded by TheWongLife2

Report
STAT 425 Statistical Modeling I Spring 2024 Homework No. 4 1 Instructions Due Date: Mar 4th, 11:59 pm Homework presentation should be neat and submitted through Canvas. R codes should be sub- mitted through Canvas as well. Please use RMardown to prepare your solutions, and submit your .Rmd file (source code) and your .pdf file, after doing the knitting of the .Rmd file. Only two files should be submitted for your homework: Your .Rmd files and your .pdf file. For more information about R Markdown you can also check this link. You must show all your work for full credit. If you feel it would help, you are encouraged to work together with your class mates on the Homework, but you have to present assignments individually using your own words. The aim of the Homework is to help you learn the material and practice for the exams. Late assignments will not be accepted . Graduate students should attempt all problems. Under- graduate students can skip problems marked as GR (if any). 2 Problems 1. Problem 1 : For the salmonella data set fit a linear model with colonies as the response, and log( dose + 1) as predictor. (a) Comments on the diagnostic plot results. (b) Use an appropriate test to determine whether the model fits the data well (Hint: Check for lack of fit) 2. Problem 2 : The gammaray data set shows the x-ray decay light curve of gamma ray burst. Note that the measurement errors on the response are provided ( error ). (a) Plot the data and comment on your results. (b) Is there any transformation suggested for the response and/or the predictors? Justify your answer. (c) Build a model to predict the flux as a function of time using appropriate transformations of the response and/or predictors and appropriate weights (Hint: consider using weighted least squares). 3. Problem 3 : For the longley data set, fit a model with Employed as the response and the other variables as predictors. (a) Compute and comment on the correlation between predictors. (b) Compute and comment on the Condition number. Is there an indication of collinearity? (c) Compute and comment on the variance inflation factors.
4. Problem 4 : Use the cheddar data for this question (a) Fit a multiple linear regression model for taste as a response, with the other three variables as predictors. Is any transformation of the predictors suggested? Justify your answer. (b) Use the Box-Cox method to determine an optimal transformation of the response. Would it be reasonable to leave the response untransformed? (c) Use an appropriate transformation of the response and refit the linear model. Do these new results make any difference to the transformations suggested for the predictors in part a) (if any)? Note: All data sets are from the faraway library in R
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help