Data science is one of the fastest growing business functions, which is reflected in salaries for data science jobs. Because the work is done almost entirely using computers, such jobs can usually be performed remotely. The Covid pandemic has made remote work more common. To determine how remote work affects salaries, a dataset with a sample of 169 data science jobs can be found below. The dataset includes the following variables: itle

MATLAB: An Introduction with Applications
6th Edition
ISBN:9781119256830
Author:Amos Gilat
Publisher:Amos Gilat
Chapter1: Starting With Matlab
Section: Chapter Questions
Problem 1P
icon
Related questions
Question

 

 

For parts: a,b,c,d,e

Job Title

Salary

Large Company

US Employer

Remote

Data Science Consultant

64369

1

0

1

Head of Data Science

85000

0

0

0

Head of Data

230000

1

0

1

Machine Learning Engineer

125000

0

1

1

Data Analytics Manager

120000

0

1

1

Data Science Engineer

127543

1

0

1

Manager Data Science

144000

1

1

1

Data Scientist

13400

1

0

1

Data Scientist

75966

1

0

1

Data Scientist

150000

0

1

1

Data Engineering Manager

153000

1

1

1

Data Engineer

90000

1

1

1

Data Analyst

90000

0

1

1

Data Analyst

60000

0

1

1

Data Scientist

50000

1

0

1

Applied Data Scientist

54376

1

0

1

Machine Learning Engineer

47681

0

0

1

Director of Data Science

154963

1

0

1

Data Engineer

28801

1

0

1

Data Analytics Engineer

110000

1

1

1

Research Scientist

83000

1

0

1

Data Analyst

59601

0

0

1

Data Analyst

80000

0

0

1

Data Engineer

140000

1

1

1

Data Analytics Engineer

79866

1

0

1

Lead Data Analyst

170000

1

1

1

Data Analyst

80000

0

1

1

BI Data Analyst

100000

0

1

1

Data Scientist

53641

1

0

1

Head of Data

235000

1

1

1

BI Data Analyst

150000

1

0

1

Machine Learning Scientist

225000

1

1

1

Data Science Consultant

77481

0

0

1

Marketing Data Analyst

89402

1

0

1

Lead Data Engineer

103750

0

0

1

Director of Data Engineering

114125

0

0

1

Machine Learning Engineer

95362

1

0

1

Data Engineer

30509

1

0

1

Data Engineer

150000

0

1

1

Data Engineer

115000

0

1

1

Research Scientist

187917

1

0

1

Data Analyst

51814

1

0

1

BI Data Analyst

36732

1

0

1

Data Engineer

150000

1

1

1

Computer Vision Software Engineer

96554

0

0

1

Computer Vision Software Engineer

70000

0

1

1

Financial Data Analyst

450000

1

1

1

Cloud Data Engineer

89514

1

0

1

Data Scientist

29831

1

0

1

Lead Data Engineer

276000

1

1

0

Cloud Data Engineer

160000

0

0

1

Data Engineer

200000

1

1

1

Data Engineering Manager

174000

1

1

1

Data Analyst

93000

1

1

1

Data Scientist

28475

0

0

1

Research Scientist

61270

1

0

1

Data Scientist

90000

0

1

1

Principal Data Analyst

170000

0

1

1

Data Engineer

96833

1

0

1

Data Engineer

13105

0

0

0

Data Scientist

36952

1

0

1

Data Engineer

72625

1

0

1

Big Data Architect

99956

0

0

1

Data Scientist

165000

1

1

1

Data Analyst

80000

1

1

1

Data Scientist

103954

1

0

1

Data Engineer

21695

0

0

1

Research Scientist

63971

0

0

1

Head of Data Science

110000

0

1

0

Data Architect

180000

1

1

1

Data Analyst

200000

1

1

1

Director of Data Engineering

200000

1

1

1

ML Engineer

256000

0

1

1

Data Engineer

110000

1

1

1

Data Engineer

72500

1

1

1

Machine Learning Engineer

185000

1

1

1

Research Scientist

100000

1

0

0

Data Engineer

112000

1

1

1

Data Scientist

21843

1

0

1

AI Scientist

55000

1

0

1

Data Scientist

58000

1

1

1

Data Scientist

100000

0

1

1

Data Scientist

78340

0

0

1

Machine Learning Engineer

85000

0

0

1

Data Science Consultant

77481

1

0

0

Data Engineer

65561

0

0

1

Data Engineer

30337

0

0

1

Data Engineer

111775

0

1

0

Data Engineer

93150

0

1

0

Lead Data Engineer

160000

0

0

1

Data Scientist

25747

0

0

1

Machine Learning Engineer

66442

1

0

0

Data Scientist

16949

0

0

1

Data Analyst

64369

1

0

1

Director of Data Science

143043

1

0

0

Big Data Engineer

16271

1

0

1

Data Analyst

71968

0

0

1

Data Scientist

135000

1

1

0

Machine Learning Engineer

25032

0

0

1

Data Science Manager

54238

1

0

1

Machine Learning Engineer

24407

1

0

1

BI Data Analyst

9272

0

0

1

Data Scientist

147000

1

1

1

Research Scientist

96357

1

0

1

Data Science Manager

174000

1

1

1

Machine Learning Engineer

21844

0

0

1

Data Science Consultant

70329

0

0

1

Data Analytics Engineer

50000

0

0

1

Data Engineer

4000

0

0

1

Data Engineer

26224

1

0

0

Data Scientist

91500

1

0

1

Big Data Engineer

22671

1

0

0

Data Scientist

5695

0

0

1

Machine Learning Engineer

81000

0

1

1

Data Scientist

40798

1

0

1

Data Scientist

2876

0

0

0

Data Science Consultant

90000

0

1

1

Data Scientist

61985

0

0

1

Machine Learning Infrastructure Engineer

195000

0

1

1

Data Scientist

38144

1

0

1

Machine Learning Scientist

225000

1

1

1

Data Scientist

56578

1

0

1

Data Scientist

33899

0

0

0

Data Scientist

117583

1

0

1

Machine Learning Engineer

47129

1

0

1

Machine Learning Engineer

89402

0

0

1

Data Scientist

89402

1

0

1

Data Engineer

66400

0

0

1

Research Scientist

57217

0

0

1

Machine Learning Engineer

25032

1

0

1

Data Analytics Manager

120000

1

1

0

Machine Learning Engineer

200000

1

1

1

Data Scientist

160000

1

1

1

Research Scientist

50000

0

0

1

Data Science Engineer

40529

0

0

1

Principal Data Engineer

600000

1

1

1

Data Scientist

13000

0

0

0

Data Engineer

165000

0

1

0

Big Data Engineer

5898

1

0

0

Principal Data Engineer

185000

1

1

1

Data Scientist

91500

1

0

1

Data Analytics Manager

140000

1

1

1

Data Scientist

87961

0

0

1

Finance Data Analyst

62250

1

0

1

Data Engineer

77481

0

0

1

Machine Learning Engineer

74000

0

0

1

Data Science Manager

152000

1

1

1

Big Data Engineer

18000

0

0

0

Data Scientist

130000

1

1

1

Computer Vision Engineer

19052

0

0

0

Business Data Analyst

59601

1

0

1

Principal Data Scientist

175228

0

0

1

Data Scientist

47204

0

0

1

Data Scientist

4000

0

0

0

AI Scientist

18102

0

0

1

Data Scientist

115000

1

1

1

Principal Data Scientist

235000

1

1

1

Lead Data Analyst

19661

1

0

1

Data Analyst

75000

1

1

0

Data Analyst

62000

1

1

0

Data Scientist

73000

1

1

0

Data Engineer

45773

1

0

1

Director of Data Science

168000

0

0

0

Data Scientist

119353

0

0

1

Applied Machine Learning Scientist

423000

1

1

1

Data Engineer

28608

1

0

1

Data Specialist

165000

1

1

1

Principal Data Scientist

151000

1

1

1

Data Science Manager

94917

1

0

1

1. Data science is one of the fastest growing business functions, which is reflected in salaries for data science jobs. Because the
work is done almost entirely using computers, such jobs can usually be performed remotely. The Covid pandemic has made
remote work more common. To determine how remote work affects salaries, a dataset with a sample of 169 data science jobs
can be found below. The dataset includes the following variables:
Job Title
Salary (US $) - annual salary in US dollars
Large Company – dummy variable indicating whether the company has more than 250 employees (1=yes, 0=no)
US Company – dummy variable indicating whether the company is US-based (1=yes, 0=no) Remote – dummy variable
indicating whether the job is remote (1=yes, 0=no)
a) Using the below", estimate a simple linear regression to predict Salary using only Remote as an independent variable. How
well does the model fit the dependent variable? Report the relevant value and briefly explain.
b) What is the estimated slope coefficient for the model estimated in part (a)? Interpret that slope coefficient. Is there a linear
relationship between the independent and dependent variables at a= .05? Report the relevant test statistic and p-value for
that test statistic.
c) Now estimate a multiple linear regression to predict Salary using all available independent variables: Large Company, US
Company, and Remote (do not include Job Title). Write the estimated regression equation.
d) Is the model estimated in part (c) statistically significant at the a = .05 level? Write the null and alternative hypotheses for this
test, the appropriate test statistic, the p-value for that test statistic, as well as your conclusion.
e) Which independent variables (if any) in the model estimated in part (c) are related to the dependent variable Salary at the a =
.05 level, given the other independent variables in the model? Include relevant values from the data output to support your
conclusions.
Transcribed Image Text:1. Data science is one of the fastest growing business functions, which is reflected in salaries for data science jobs. Because the work is done almost entirely using computers, such jobs can usually be performed remotely. The Covid pandemic has made remote work more common. To determine how remote work affects salaries, a dataset with a sample of 169 data science jobs can be found below. The dataset includes the following variables: Job Title Salary (US $) - annual salary in US dollars Large Company – dummy variable indicating whether the company has more than 250 employees (1=yes, 0=no) US Company – dummy variable indicating whether the company is US-based (1=yes, 0=no) Remote – dummy variable indicating whether the job is remote (1=yes, 0=no) a) Using the below", estimate a simple linear regression to predict Salary using only Remote as an independent variable. How well does the model fit the dependent variable? Report the relevant value and briefly explain. b) What is the estimated slope coefficient for the model estimated in part (a)? Interpret that slope coefficient. Is there a linear relationship between the independent and dependent variables at a= .05? Report the relevant test statistic and p-value for that test statistic. c) Now estimate a multiple linear regression to predict Salary using all available independent variables: Large Company, US Company, and Remote (do not include Job Title). Write the estimated regression equation. d) Is the model estimated in part (c) statistically significant at the a = .05 level? Write the null and alternative hypotheses for this test, the appropriate test statistic, the p-value for that test statistic, as well as your conclusion. e) Which independent variables (if any) in the model estimated in part (c) are related to the dependent variable Salary at the a = .05 level, given the other independent variables in the model? Include relevant values from the data output to support your conclusions.
Expert Solution
trending now

Trending now

This is a popular solution!

steps

Step by step

Solved in 5 steps with 1 images

Blurred answer
Similar questions
Recommended textbooks for you
MATLAB: An Introduction with Applications
MATLAB: An Introduction with Applications
Statistics
ISBN:
9781119256830
Author:
Amos Gilat
Publisher:
John Wiley & Sons Inc
Probability and Statistics for Engineering and th…
Probability and Statistics for Engineering and th…
Statistics
ISBN:
9781305251809
Author:
Jay L. Devore
Publisher:
Cengage Learning
Statistics for The Behavioral Sciences (MindTap C…
Statistics for The Behavioral Sciences (MindTap C…
Statistics
ISBN:
9781305504912
Author:
Frederick J Gravetter, Larry B. Wallnau
Publisher:
Cengage Learning
Elementary Statistics: Picturing the World (7th E…
Elementary Statistics: Picturing the World (7th E…
Statistics
ISBN:
9780134683416
Author:
Ron Larson, Betsy Farber
Publisher:
PEARSON
The Basic Practice of Statistics
The Basic Practice of Statistics
Statistics
ISBN:
9781319042578
Author:
David S. Moore, William I. Notz, Michael A. Fligner
Publisher:
W. H. Freeman
Introduction to the Practice of Statistics
Introduction to the Practice of Statistics
Statistics
ISBN:
9781319013387
Author:
David S. Moore, George P. McCabe, Bruce A. Craig
Publisher:
W. H. Freeman