Lab 2-Graphing Data and Numerical Summaries_Student
docx
keyboard_arrow_up
School
Purdue University *
*We aren’t endorsed by this school
Course
301
Subject
Statistics
Date
Feb 20, 2024
Type
docx
Pages
4
Uploaded by AdmiralScorpionPerson691
Name: T.A. name:
Lab 2: Graphing Data and Numerical Summaries
NOTE: SPSS outputs are necessary to show full completion of the lab. Please paste all SPSS outputs into your lab report and submit the completed reports including all requested tables and graphs via Brightspace (under the "Lab" folder) by 11:50 pm Friday.
Two points will be deducted for each SPSS requested output that is not included in the submitted lab document.
Also, 30% points will be deducted for late submission, up to 24 hours.
Dataset
: This lab uses the dataset (
SleepPatterns
), located on Brightspace under Lab in the Datasets submodule. Instructions for opening the dataset in SPSS are found as follows.
SPSS installed on a computer: Reference page 4 of the SPSS Instruction Manual
SPSS running remotely: Reference the slide “Opening your Dataset Remotely in SPSS via Go Remote” in the document “SPSS using Citrix access guidelines” on Brightspace.
Two hundred fifty college students in Indiana participated in a study examining the
associations among sleep habits, sleep quality and physical/emotional factors.
Participants completed an online survey about sleep habits that included the Pittsburgh
Sleep Quality Index (PSQI), the Epworth Sleepiness Scale (ESS), the Horne-Ostberg
Morningness Eveningness Scale (MES), the Subjective Units of Distress Scale (SUDS),
and questions about academic performance and physical health. 1.
(2 points) Fill in the chart below. In the second column, record whether the variable is
quantitative
or categorical, based on the data in the lab dataset
. In the third column
record whether a bar graph
or a histogram
correctly shows the distribution of the
data.
Variable Name
Type of variable
Type of graph
Sleep_time_week
Quantitative
Histogram
Gender
Categorical Bar Graph
Weight
Quantitative
Histogram
Class
Categorical
Bar Graph
Age
Quantitative
Histogram
1
2.
(2 points) Use SPSS to make a bar graph
for the variable Gender
. Copy and paste
the graph into this document here
. 3.
(2 points) Use SPSS to make a histogram
for the variable Sleep_time_week
. Copy
and paste the graph into this document here
.
4.
(2 points) Use SPSS to find the mean
and standard deviation
of Sleep_time_week
.
Record the values rounded to
two decimal places below. Copy and paste
t
he SPSS
output into this document here.
Sleep_time_week: Mean: 7.05 Standard deviation: 0.46
2
Descriptive Statistics
N
Minimum
Maximum
Mean
Std. Deviation
Sleep_time_wee
k
250
5.74
8.27
7.0448
.45649
Valid N (listwise)
250
5.
(4 points) SPSS is not used in this course to find quartiles because the program’s
default method yields an unbiased estimate of the population quartiles. Rather, in this
course, the interest is to find quartiles associated with the sample. BY HAND
, find the 5-number summary
of
Sleep_time_week
. Show your work on
how to find the location of Q1, Median and Q3. (You can use SPSS to sort the data
from smallest to largest first, and use the row number on the left of the Data View tab
to make the job easier).
Q
1
: (6.74+6.74)/2 = 6.74
Median: (7.05+7.07)/2 = 7.06
Q
3
: (7.33+7.34)/2 = 7.34
5# summary for Sleep_time_week
: Min: 5.74
Q
1
: 6.74 Median: 7.06 Q
3
: 7.34 Max: 8.27
6.
(2 points) Inspect the histogram for the variable Sleep_time_week
created in
Question 3. Is the graph approximately symmetric, skewed left, or skewed right
?
Explain how the graph’s shape is related to the Mean and Median calculated in
Questions 4 and 5.
The graph is approximately symmetric. Because the mean and median are so close, they
make the graph approximately symmetric.
7.
(2 points) BY HAND
, use the 1.5*IQR Rule to determine if there are any suspected
outliers for the variable Sleep_time_week
. Show your work. State and explain why
there are or not suspected outliers. If there are suspected outliers, identify the values.
IQR: 7.34 – 6.74 = 0.6
Lower: 6.74 – (1.5*0.6) = 5.84
Upper: 7.34 + (1.5*0.6) = 8.24
Outliers: 5.74, 5.79, 8.27
8.
(2 points) Use SPSS to make a boxplot
or modified boxplot
for the variable
Sleep_time_week.
Copy and paste the graph into this document here
. In the space
below, explain where the numbers calculated in Question 5 appear.
3
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
The min is the bottom line, Q
1
is the next line above it, the median is the bolded line, above that is Q
3
, and the top line is the max.
9.
(2 points) For the variable Sleep_time_week
, explain if the mean and standard
deviation
are appropriate to describe the distribution. If not, what should be used
instead?
It is appropriated to use mean and standard deviation for this distribution because the
mean is central to the data and the data is approximately symmetric.
4
Related Documents
Related Questions
Create a side-by-side boxplot for vitamin D level vs. NewAge and a side-
by-side boxplot for vitamin D level vs. country.
Create a scatterplot to show the relationship between vitamin D level
and Age.
Compare these two side-by-side boxplots and the scatterplot and explain
your findings.
• Note: Write appropriate captions for the tables, graphs, and outputs.
arrow_forward
Now monitor the process. An additional ten days of data have been collected, see table labeled “1st 10 Days of Monitoring Reservation Processing Time” in the Data File.
Develop Xbar and R charts for the 1st 10 days of monitoring. Plot the data for the 1st 10 days on the Xbar and R charts.
Is the process in control? If the control chart indicates an out-of-control process, note which days, the pattern, and whether it is the Xbar or R chart.
Based on the X-bar and R Charts that you developed for the 1st 10 days of data, is the process in control?
Group of answer choices
No. The X-bar and R Charts are both out of control.
No. The X-bar Chart is in control, but the R Chart is out of control.
No. The R Chart is in control, but the X-bar Chart is out of control.
Yes. The X-bar and R Charts are both in control.
arrow_forward
Install RStudio: Begin by installing RStudio on your computer. If you haven't done so, please refer to the official RStudio website for download and installation instructions.
Watch the Tutorial Video: Watch the provided video tutorial that explains how to run RStudio. Pay close attention to the steps for opening and managing data files. https://www.youtube.com/watch?v=RhJp6vSZ7z0
Open RStudio: Once RStudio is installed, open the application.
Load the Dataset: In RStudio, open a data file named "mtcars". To do this, type the command mtcars in the script editor and run the command.
Attach the Data: Next, attach the dataset using the command attach(mtcars).
Examine the Variables: Carefully review and note the names of all variables in the dataset. Examples of these variables include:
Mileage (mpg)
Number of Cylinders (cyl)
Displacement (disp)
Horsepower (hp)
Research: Google to understand these variables.
Statistical Analysis: Select mpg variable, and perform the following…
arrow_forward
A group of people at the park were asked their ages, and the results can be downloaded from the data file Ages.
In StatKey, which menu option would you select under "Descriptive Statistics and Graphs" to graph the data?
One Quantitative
One Categorical
One Quantitative and One Categorical
A group of people at the park were asked their ages, and the results can be downloaded from the data file Ages.
Summarize this data by creating a histogram with StatKey, and submit your graph as a PDF. When creating your graph, please make sure the number of buckets is set to 10.
arrow_forward
tion 2 of 15
Last summer, the Smith family drove through seven different states and visited various popular landmarks. The prices of gasoline
in dollars per gallon varied from state to state and are listed below.
$2.34, $2.75, $2.48, $3.58, $2.87, $2.53, $3.31
Click to download the data in your preferred format.
CrunchIt! CSV Excel JMP Mac Text Minitab PC Text R SPSS TI Calc
Calculate the range of the price of gas. Give your solution to the nearest cent.
range:
dollars per gallon
DELL
&
4.
7
8.
arrow_forward
Johnson Filtration, Inc. provides maintenance service for water-filtration systems. Suppose that in addition to information on the
number of months since the machine was serviced and whether a mechanical or an electrical repair was necessary, the managers
obtained a list showing which repairperson performed the service. The revised data follow.
Click on the datafile logo to reference the data.
DATA file
Repair Time
Months Since
in Hours
Last Service
Type of Repair
Repairperson
2.9
Electrical
Dave Newton
3.0
Mechanical
Dave Newton
4.8
8.
Electrical
Bob Jones
1.8
Mechanical
Dave Newton
2.9
Electrical
Dave Newton
4.9
Electrical
Bob Jones
4.2
6.
Mechanical
Bob Jones
4.8
8.
Mechanical
Bob Jones
4.4
4.
Electrical
Bob Jones
4.5
Electrical
Dave Newton
a. Ignore for now the months since the last maintenance service (1 ) and the repairperson who performed the service. Develop the
estimated simpe linear regression equation to predict the repair time (y) given the type of repair (2 ). Recall that…
arrow_forward
Please answer number 3 from a-c. Neglect the number 2 questions. Thanks.
arrow_forward
On a cold day in Minneapolis, the afternoon temperature was 48 degrees before a cold front moved through. As
the front moved through the temperature dropped an average of 5 degrees per hour for a total of 5 hours.
14
2/1
目
Identify the domain of the data set.
arrow_forward
On December 17, 2007 baseball writer John Hickey wrote an article for the Seattle P-I about increases to ticket prices for Seattle Mariners
games during the 2008 season. The article included a data set that listed the average ticket price for each MLB team, the league in which the team
plays (AL or NL), the number of wins during the 2007 season and the cost per win (in dollars). The data for the 16 National League teams are shown
below.
league
price
wins
cost/win
team
Arizona Diamondbacks
NL
19.68
90
35.40
Atlanta Braves
NL
17.07
84
32.89
Chicago Cubs
NL
34.30
85
65.33
cincinnati Reds
NL
17.90
72
40.32
Colorado Rockies
NL
14.72
90
26.67
Florida Marlins
NL
16.70
71
38.13
Houston Astros
NL
26.66
73
59.11
Los Angeles Dodgers
20.09
82
34.64
NL
Milwaukee Brewers
NL
18.11
83
35.37
N.Y. Mets
NL
25.28
88
46.56
Philadelphia Phillies
26.73
89
48.69
NL
Pittsburgh Pirates
NL
17.08
68
40.67
San Diego Padres
NL
20.83
89
38.15
San Francisco Giants
NL
24.53
71
56.00
St. Louis Cardinals
NL
29.78
78…
arrow_forward
The spreadsheet at MarathonWinningTimes.xlsx shows the history of the winning
time in the Boston Marathon for men and women from 1966 (when women first ran) through
2013.
What is the average rate at which the men’s finishing time changed from year to year?
arrow_forward
Briefly describe the methods of collecting primary data
arrow_forward
please assist with this NON GRADED assignment
arrow_forward
A lecturer at WIN wanted to know if he can predict student’s quiz results by asking them to complete a simple survey. The result of the survey is found in the file: Assignment 2 sem22020 data set 1.Quiz ResultActual Mark (0-15) for quiz student attainedEQRQuiz score (0-15) expected to get before taking the quizStudy Hrs.Number of hours per week (on average) spent studying for StatisticsAgeAge (in years)BBTSatisfaction rating of Big Bang TheorySexM=1 F=0MBMB=1 for good math background, otherwise 0MCMC= 1 if math centre is used regularly, otherwise 0AuHSAuHS = 1 if student completed high school in Australia, otherwise 0LMLM=1 if student likes math, 0 otherwiseTask 1: Variable List(a) Using the variables listed in the table above, Describe each variable.(b) State for each variable whether it is qualitative or quantitative; if it is qualitative, state whether it is nominal or ordinal, and if it is quantitative, state whether it is discrete or continuous.Task 2: HistogramCreate a histogram…
arrow_forward
Four different paints are advertised as having the same drying time. To check the manufacturer's claims, five samples were tested for each of the paints. The time in minutes until the paint was dry enough for a second coat to be applied was recorded. The following data were obtained.
Excel users: The data set is available in file named Paint. All data sets can be found on the premium online datasite. Click on the datafile logo to reference the data. (the excel data is below)
Paint 1
Paint 2
Paint 3
Paint 4
128
144
133
150
137
133
143
142
135
142
137
135
124
146
136
140
141
130
131
153
At the A=0.05 level of significance, test to see whether the mean drying time is the same for each type of paint.
Compute the values identified below (to 2 decimals, if necessary).
Sum of Squares, Treatment
Sum of Squares,…
arrow_forward
An insurance company hires an actuary to determine whether the number of hours of safety drivingclasses can be used to predict the number of driving accidents for each driver. Identify theexplanatory variable, if any.
arrow_forward
SEE MORE QUESTIONS
Recommended textbooks for you
data:image/s3,"s3://crabby-images/b9e14/b9e141b888912793d57db61a53fa701d5defdb09" alt="Text book image"
Glencoe Algebra 1, Student Edition, 9780079039897...
Algebra
ISBN:9780079039897
Author:Carter
Publisher:McGraw Hill
data:image/s3,"s3://crabby-images/9ae58/9ae58d45ce2e430fbdbd90576f52102eefa7841e" alt="Text book image"
Holt Mcdougal Larson Pre-algebra: Student Edition...
Algebra
ISBN:9780547587776
Author:HOLT MCDOUGAL
Publisher:HOLT MCDOUGAL
Related Questions
- Create a side-by-side boxplot for vitamin D level vs. NewAge and a side- by-side boxplot for vitamin D level vs. country. Create a scatterplot to show the relationship between vitamin D level and Age. Compare these two side-by-side boxplots and the scatterplot and explain your findings. • Note: Write appropriate captions for the tables, graphs, and outputs.arrow_forwardNow monitor the process. An additional ten days of data have been collected, see table labeled “1st 10 Days of Monitoring Reservation Processing Time” in the Data File. Develop Xbar and R charts for the 1st 10 days of monitoring. Plot the data for the 1st 10 days on the Xbar and R charts. Is the process in control? If the control chart indicates an out-of-control process, note which days, the pattern, and whether it is the Xbar or R chart. Based on the X-bar and R Charts that you developed for the 1st 10 days of data, is the process in control? Group of answer choices No. The X-bar and R Charts are both out of control. No. The X-bar Chart is in control, but the R Chart is out of control. No. The R Chart is in control, but the X-bar Chart is out of control. Yes. The X-bar and R Charts are both in control.arrow_forwardInstall RStudio: Begin by installing RStudio on your computer. If you haven't done so, please refer to the official RStudio website for download and installation instructions. Watch the Tutorial Video: Watch the provided video tutorial that explains how to run RStudio. Pay close attention to the steps for opening and managing data files. https://www.youtube.com/watch?v=RhJp6vSZ7z0 Open RStudio: Once RStudio is installed, open the application. Load the Dataset: In RStudio, open a data file named "mtcars". To do this, type the command mtcars in the script editor and run the command. Attach the Data: Next, attach the dataset using the command attach(mtcars). Examine the Variables: Carefully review and note the names of all variables in the dataset. Examples of these variables include: Mileage (mpg) Number of Cylinders (cyl) Displacement (disp) Horsepower (hp) Research: Google to understand these variables. Statistical Analysis: Select mpg variable, and perform the following…arrow_forward
- A group of people at the park were asked their ages, and the results can be downloaded from the data file Ages. In StatKey, which menu option would you select under "Descriptive Statistics and Graphs" to graph the data? One Quantitative One Categorical One Quantitative and One Categorical A group of people at the park were asked their ages, and the results can be downloaded from the data file Ages. Summarize this data by creating a histogram with StatKey, and submit your graph as a PDF. When creating your graph, please make sure the number of buckets is set to 10.arrow_forwardtion 2 of 15 Last summer, the Smith family drove through seven different states and visited various popular landmarks. The prices of gasoline in dollars per gallon varied from state to state and are listed below. $2.34, $2.75, $2.48, $3.58, $2.87, $2.53, $3.31 Click to download the data in your preferred format. CrunchIt! CSV Excel JMP Mac Text Minitab PC Text R SPSS TI Calc Calculate the range of the price of gas. Give your solution to the nearest cent. range: dollars per gallon DELL & 4. 7 8.arrow_forwardJohnson Filtration, Inc. provides maintenance service for water-filtration systems. Suppose that in addition to information on the number of months since the machine was serviced and whether a mechanical or an electrical repair was necessary, the managers obtained a list showing which repairperson performed the service. The revised data follow. Click on the datafile logo to reference the data. DATA file Repair Time Months Since in Hours Last Service Type of Repair Repairperson 2.9 Electrical Dave Newton 3.0 Mechanical Dave Newton 4.8 8. Electrical Bob Jones 1.8 Mechanical Dave Newton 2.9 Electrical Dave Newton 4.9 Electrical Bob Jones 4.2 6. Mechanical Bob Jones 4.8 8. Mechanical Bob Jones 4.4 4. Electrical Bob Jones 4.5 Electrical Dave Newton a. Ignore for now the months since the last maintenance service (1 ) and the repairperson who performed the service. Develop the estimated simpe linear regression equation to predict the repair time (y) given the type of repair (2 ). Recall that…arrow_forward
- Please answer number 3 from a-c. Neglect the number 2 questions. Thanks.arrow_forwardOn a cold day in Minneapolis, the afternoon temperature was 48 degrees before a cold front moved through. As the front moved through the temperature dropped an average of 5 degrees per hour for a total of 5 hours. 14 2/1 目 Identify the domain of the data set.arrow_forwardOn December 17, 2007 baseball writer John Hickey wrote an article for the Seattle P-I about increases to ticket prices for Seattle Mariners games during the 2008 season. The article included a data set that listed the average ticket price for each MLB team, the league in which the team plays (AL or NL), the number of wins during the 2007 season and the cost per win (in dollars). The data for the 16 National League teams are shown below. league price wins cost/win team Arizona Diamondbacks NL 19.68 90 35.40 Atlanta Braves NL 17.07 84 32.89 Chicago Cubs NL 34.30 85 65.33 cincinnati Reds NL 17.90 72 40.32 Colorado Rockies NL 14.72 90 26.67 Florida Marlins NL 16.70 71 38.13 Houston Astros NL 26.66 73 59.11 Los Angeles Dodgers 20.09 82 34.64 NL Milwaukee Brewers NL 18.11 83 35.37 N.Y. Mets NL 25.28 88 46.56 Philadelphia Phillies 26.73 89 48.69 NL Pittsburgh Pirates NL 17.08 68 40.67 San Diego Padres NL 20.83 89 38.15 San Francisco Giants NL 24.53 71 56.00 St. Louis Cardinals NL 29.78 78…arrow_forward
- The spreadsheet at MarathonWinningTimes.xlsx shows the history of the winning time in the Boston Marathon for men and women from 1966 (when women first ran) through 2013. What is the average rate at which the men’s finishing time changed from year to year?arrow_forwardBriefly describe the methods of collecting primary dataarrow_forwardplease assist with this NON GRADED assignmentarrow_forward
arrow_back_ios
SEE MORE QUESTIONS
arrow_forward_ios
Recommended textbooks for you
- Glencoe Algebra 1, Student Edition, 9780079039897...AlgebraISBN:9780079039897Author:CarterPublisher:McGraw HillHolt Mcdougal Larson Pre-algebra: Student Edition...AlgebraISBN:9780547587776Author:HOLT MCDOUGALPublisher:HOLT MCDOUGAL
data:image/s3,"s3://crabby-images/b9e14/b9e141b888912793d57db61a53fa701d5defdb09" alt="Text book image"
Glencoe Algebra 1, Student Edition, 9780079039897...
Algebra
ISBN:9780079039897
Author:Carter
Publisher:McGraw Hill
data:image/s3,"s3://crabby-images/9ae58/9ae58d45ce2e430fbdbd90576f52102eefa7841e" alt="Text book image"
Holt Mcdougal Larson Pre-algebra: Student Edition...
Algebra
ISBN:9780547587776
Author:HOLT MCDOUGAL
Publisher:HOLT MCDOUGAL