HW1_Starter Template_R (Summer 24, 5
.Rmd
keyboard_arrow_up
School
Georgia Institute Of Technology *
*We aren’t endorsed by this school
Course
6414
Subject
Statistics
Date
Jun 2, 2024
Type
Rmd
Pages
5
Uploaded by GrandRat2927
---
title: "HW1 Peer Assessment"
output:
html_document:
df_print: paged
---
```{r setup, include=FALSE}
knitr::opts_chunk$set(echo = TRUE)
```
# Part A. Variables
In the field of psychology, much research is done using self-
report surveys using Likert scales (look it up!).
### A1
__What type of variable is a Likert response?__ (1 pt)
### A2
__What are some (at least 2) benefits of using Likert scales?__ (2
pts)
### A3
__What are some drawbacks of using them? Make sure you mention at
least one 'drawback' and one 'danger' (a 'drawback' is a shortcoming, while a 'danger' implies potential harm).__ (2 pts)
# Part B. Simple Linear Regression
Perform linear regressions on a dataset from a European Toyota car dealer on the sales records of used cars (Toyota Corolla). We
would like to construct a reasonable linear regression model for the relationship between the sales prices of used cars and various explanatory variables (such as age, mileage, horsepower).
We are interested to see what factors affect the sales price of a used car and by how much.
Data Description
*Id* - ID number of each used car *Model* - Model name of each used car *Price* - The price (in Euros) at which each used car was sold *Age* - Age (in months) of each used car as of August 2004 *KM* - Accumulated kilometers on odometer
*HP* - Horsepower *Metallic* - Metallic color? (Yes = 1, No = 0) *Automatic* - Automatic transmission? ( Yes = 1, No = 0) *CC* - Cylinder volume (in cubic centimeters) *Doors* - Number of doors *Gears* - Number of gears *Weight* - Weight (in kilograms)
The data is in the file "UsedCars.csv". To read the data in `R`, save the file in your working directory (make sure you have changed the directory if different from the R working directory) and read the data using the `R` function `read.csv()`.
Read data and show few rows of data.
```{r}
# Read in the data
data = read.csv("UsedCars.csv",sep = ",",header = TRUE)
# Show the first few rows of data
head(data, 3)
```
## Question B1: Exploratory Data Analysis
a. **3 pts** Use a scatter plot to describe the relationship between Price and the Accumulated kilometers on odometer. Describe the general trend (direction and form). Include plots and R-code used.
```{r}
# Your code here...
```
b. **3 pts** What is the value of the correlation coefficient between *Price* and *KM*? Please interpret the strength of the correlation based on the correlation coefficient.
```{r}
# Your code here...
```
c. **2 pts** Based on this exploratory analysis, would you recommend a simple linear regression model for the relationship?
d. **1 pts** Based on the analysis above, would you pursue a transformation of the data? *Do not transform the data.*
## Question B2: Fitting the Simple Linear Regression Model
Fit a linear regression model, named *model_1*, to evaluate the relationship between UsedCars Price and the accumulated KM. *Do not transform the data.* The function you should use in R is:
```{r}
# Your code here...
```
a. **3 pts** What are the model parameters and what are their estimates?
b. **2 pts** Write down the estimated simple linear regression equation.
c. **2 pts** Interpret the estimated value of the $\beta_1$ parameter in the context of the problem.
d. **2 pts** Find a 95% confidence interval for the $\beta_1$ parameter. Is $\beta_1$ statistically significant at this level?
```{r}
# Your code here...
```
e. **2 pts** Is $\beta_1$ statistically significantly negative at
an $\alpha$-level of 0.01? What is the approximate p-value of this test?
```{r}
# Your code here...
```
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
Related Questions
Help
arrow_forward
How Panel Data is useful to control some types of omitted variables without actually oberving them?
arrow_forward
what are the four imporatant sources of data?
arrow_forward
A categorical variable with multiple categories is converted to sufficient indicator variables for inclusion in a model. Will those indicator variables be correlated to one another? CLearly explain
arrow_forward
Differentiate between nominal data and ordinal data.
Give at least two examples of nominal and ordinal data.
arrow_forward
How to draw a boxplot with the same quartile 2 and quartile 3?
arrow_forward
Give different methods of collection of data.
arrow_forward
What is a cross tabulation? How are cross tabulationsuseful to analyze and interpret data
arrow_forward
A graphing calculator is recommended.
(a) Graph the functions f(x) =
0.56
==
and g(x)
+1.1
=
Viewing window [0, 10] by [0, 1]:
y
1.0
0.8
0.56
in the viewing rectangles [0, 10] by [0, 1] and [0, 100] by [0, 1].
+0.9
y
1.0
0.8
0.6
0.6
f(x)
f(x)
0.4
g(x)
0.4
g(x)
0.2
y
10t
2
4
6
CO
8
10
X
0.2
y
101
2
4
6
8
CO
10
arrow_forward
n
texsu.blackboard.com/ultra/courses/ 3217867_1/grades/assessment/ 2701735_1/overvipw/attempt/_1129
Examity::Start Ex...
YouTube
Review Example 8.5 and answer the following questions.
EXAMPLE 8.5
Problem
Maps M Gmail
Do you
Additional content
Content
Drag and drop files here or click to add text.
Questions Filter (2) ▼
General Psycholog...
The average earnings per share (EPS) for 10 industrial stocks randomly selected from those listed on
the Dow Jones Industrial Average was found to be X = 1.85 with a standard deviation of s=0.395.
Calculate a 99% confidence interval for the average EPS of all the industrials listed on the DJIA.
a. A confidence interval is a type of estimator. What are we estimating in this example?
b. Name each symbol in the confidence interval given and explain the use of the symbol.
c Suppose you were conducting a two-tailed hypothesis test: Ho: μ = 3.0 at a = .01. What do you conclude?
a. A confidence interval is a type of estimator. What are we estimating in this…
arrow_forward
How to construct Scatterplot?
arrow_forward
Examus
Or
cdn.student.uae.examus.net?rldbqn31&sessi...
STAT-101 FEX_2021_2_Male
21 - 30
->
115:27
Choose the correct answer for the following question:
ce362d
A data set consist of four (x, y) pairs of numbers: (0, 1), (1,0), (0,2) & (2,0). What is the value of linear correlation coefficient?
2dc
F-E<µ
arrow_forward
What is the dependent variable?
Internet access
Father’s education
Number of absences
Age of the child
https://isle.stat.cmu.edu/data-explorers/schoolabsence/
that is the link to the data set that the question refers to
arrow_forward
Base on the same given data uploaded in module 4, will you conclude that the number of bathroom of houses is a significant factor for house price? I your answer is affirmative, you need to explain how the number of bathroom influences the house price, using a post hoc procedure. (Please treat number of bathrooms as a categorical variable in this analysis)
Base on the same given data, conduct an analysis for the variable sale price to see if sale price is influenced by living area. Summarize your finding including all regular steps for your method. Also, will you conclude that larger house corresponding to higher price (justify)?
arrow_forward
SEE MORE QUESTIONS
Recommended textbooks for you
Glencoe Algebra 1, Student Edition, 9780079039897...
Algebra
ISBN:9780079039897
Author:Carter
Publisher:McGraw Hill
Big Ideas Math A Bridge To Success Algebra 1: Stu...
Algebra
ISBN:9781680331141
Author:HOUGHTON MIFFLIN HARCOURT
Publisher:Houghton Mifflin Harcourt
Holt Mcdougal Larson Pre-algebra: Student Edition...
Algebra
ISBN:9780547587776
Author:HOLT MCDOUGAL
Publisher:HOLT MCDOUGAL
Related Questions
- A categorical variable with multiple categories is converted to sufficient indicator variables for inclusion in a model. Will those indicator variables be correlated to one another? CLearly explainarrow_forwardDifferentiate between nominal data and ordinal data. Give at least two examples of nominal and ordinal data.arrow_forwardHow to draw a boxplot with the same quartile 2 and quartile 3?arrow_forward
- Give different methods of collection of data.arrow_forwardWhat is a cross tabulation? How are cross tabulationsuseful to analyze and interpret dataarrow_forwardA graphing calculator is recommended. (a) Graph the functions f(x) = 0.56 == and g(x) +1.1 = Viewing window [0, 10] by [0, 1]: y 1.0 0.8 0.56 in the viewing rectangles [0, 10] by [0, 1] and [0, 100] by [0, 1]. +0.9 y 1.0 0.8 0.6 0.6 f(x) f(x) 0.4 g(x) 0.4 g(x) 0.2 y 10t 2 4 6 CO 8 10 X 0.2 y 101 2 4 6 8 CO 10arrow_forward
- n texsu.blackboard.com/ultra/courses/ 3217867_1/grades/assessment/ 2701735_1/overvipw/attempt/_1129 Examity::Start Ex... YouTube Review Example 8.5 and answer the following questions. EXAMPLE 8.5 Problem Maps M Gmail Do you Additional content Content Drag and drop files here or click to add text. Questions Filter (2) ▼ General Psycholog... The average earnings per share (EPS) for 10 industrial stocks randomly selected from those listed on the Dow Jones Industrial Average was found to be X = 1.85 with a standard deviation of s=0.395. Calculate a 99% confidence interval for the average EPS of all the industrials listed on the DJIA. a. A confidence interval is a type of estimator. What are we estimating in this example? b. Name each symbol in the confidence interval given and explain the use of the symbol. c Suppose you were conducting a two-tailed hypothesis test: Ho: μ = 3.0 at a = .01. What do you conclude? a. A confidence interval is a type of estimator. What are we estimating in this…arrow_forwardHow to construct Scatterplot?arrow_forwardExamus Or cdn.student.uae.examus.net?rldbqn31&sessi... STAT-101 FEX_2021_2_Male 21 - 30 -> 115:27 Choose the correct answer for the following question: ce362d A data set consist of four (x, y) pairs of numbers: (0, 1), (1,0), (0,2) & (2,0). What is the value of linear correlation coefficient? 2dc F-E<µarrow_forwardarrow_back_iosSEE MORE QUESTIONSarrow_forward_ios
Recommended textbooks for you
- Glencoe Algebra 1, Student Edition, 9780079039897...AlgebraISBN:9780079039897Author:CarterPublisher:McGraw HillBig Ideas Math A Bridge To Success Algebra 1: Stu...AlgebraISBN:9781680331141Author:HOUGHTON MIFFLIN HARCOURTPublisher:Houghton Mifflin HarcourtHolt Mcdougal Larson Pre-algebra: Student Edition...AlgebraISBN:9780547587776Author:HOLT MCDOUGALPublisher:HOLT MCDOUGAL
Glencoe Algebra 1, Student Edition, 9780079039897...
Algebra
ISBN:9780079039897
Author:Carter
Publisher:McGraw Hill
Big Ideas Math A Bridge To Success Algebra 1: Stu...
Algebra
ISBN:9781680331141
Author:HOUGHTON MIFFLIN HARCOURT
Publisher:Houghton Mifflin Harcourt
Holt Mcdougal Larson Pre-algebra: Student Edition...
Algebra
ISBN:9780547587776
Author:HOLT MCDOUGAL
Publisher:HOLT MCDOUGAL