STAT 151 STUDY DOCUMENT PRACTISE LAB FINAL QUIZ 1 NUSRAT.docx
pdf
keyboard_arrow_up
School
University of Alberta *
*We aren’t endorsed by this school
Course
151
Subject
Statistics
Date
Feb 20, 2024
Type
Pages
12
Uploaded by ChancellorFang13225
LAB QUIZ 1 REVIEW
Question
1
C1 Q1 V1: The survey data STATISTICSSTUDENTSSURVEYFORR contains several
columns. Identify the type of variable in the column FEDBEST, a variable that
measures which Federal political party students think had the best platform
(Conservative, Green, Liberal, NDP).
Answer
qualitative categorical (non-ordinal)
qualitative categorical (ordinal)
quantitative numerical discrete
quantitative numerical continuous
Question
2
C1 Q2 V1: The survey data STATISTICSSTUDENTSSURVEYFORR contains several
columns. Identify the type of variable in the column FEDVOTE, a variable that
measures which Federal political party students will vote for (Conservative, Green,
Liberal, NDP).
Answer
qualitative categorical (non-ordinal)
qualitative categorical(ordinal)
quantitative numerical discrete
quantitative numerical continuous
Question3
C2 Q1 V1 The survey data STATISTICSSTUDENTSSURVEYFORR
contains several columns. Use R to calculate the counts of the
outcomes in the ALBBEST column. Which of the following
statements is most correct?
The mode of the outcomes of the variable ALBBEST is NDP.
The mode of the outcomes of the variable ALBBEST is 29.
The mode of the outcomes of the variable ALBEST is 48.33%.
The mode of the outcomes of the variable ALBBEST cannot be determined.
Command:
TALLY COUNTS AND PERCENTS
Statistics > Summaries > Frequency Distributions
Variables (pick one or more):
ALBBEST
Apply, OK
Output
1
LAB QUIZ 1 REVIEW
The highest count occurs for the outcome of NDP
Question4
Answer
C2 Q5 V1 The survey data STATISTICSSTUDENTSSURVEYFORR
contains several columns. Use R to calculate the percentages of
the outcomes in the ALBBEST column. Which of the following
statements below is FALSE?
An observation in the ALBBEST data can take on one of 4 possible outcomes (choices
percent of data that is NDP is 48.33%, and the percent of data that is not NDP is 51.6
The number of observations in the ALBBEST column is 60, the percent of data that is
48.33%, and the percent of data that is not NDP is 51.67%
An observation in the ALBBEST data can take on one of 60 possible outcomes (choice
count of NDP in the ALBBEST column is 29, and the percent of data that is NDP is 48.
The count of NDP in the ALBBEST column is 29, the percent of data that is NDP is 48.
and the percent of data that is not NDP is 51.67%
Command:
TALLY COUNTS AND PERCENTS
1b) Statistics > Summaries > Frequency
Distributions Variables (pick one or more):
ALBBEST
Apply, OK
Total count = 5+5+29+21 = 60
Percent NDP = 48.33%
Percent not NDP = 100% -48.33% = 51.67%
Question
5
C2 Q3 V1 The survey data STATISTICSSTUDENTSSURVEYFORR contains the column
FAAGESTUDBIRTH which is a variable that measures the age of students' fathers
when students were born. Use R to make a histogram to describe the data. Choose
the most correct answer to describe what you see in the shape of the data
distribution.
Answer
symmetric
left skewed
right skewed
uniform
2
LAB QUIZ 1 REVIEW
Command : 6)
Graphs >
Histogram Variable (pick one):
FAAGESTUDBIRTH
Options:
Frequency counts
x-axis label:
Agey-axis label:
FREQUENCY
Graph Title:
…………
..
Apply, OK
Output
Question6
Answer
C2 Q4 V1 The survey data STATISTICSSTUDENTSSURVEYFORR
contains the column FAVSPORTWATCH which is a variable that
measures the favourite sports that students like to watch
(baseball, football, hockey, tennis). Use R to make an appropriate
chart for this column. Indicate which of the following statements
in most correct.
The mode of FAVSPORTWATCH is tennis, and the mean of FAVSPORTWATCH is 15.
The mode of FAVSPORTWATCH is tennis, and the mean of FAVSPORTWATCH canno
3
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
LAB QUIZ 1 REVIEW
calculated.
The mode of FAVSPORTWATCH is 21, and the mean of FAVSPORTWATCH is 15.
The mode of FAVSPORTWATCH is 21, and the mean of FAVSPORTWATCH cannot be
calculated.
Command:
TALLY COUNTS AND PERCENTS
Statistics > Summaries > Frequency Distributions
Variables (pick one or more):
FAVSPORTWATCH
Apply, OK
Output
The most common occurring sport is tennis, with a 21 count. Since the data is categorical, the mean (a
number) cannot be calculated.
Question7
Answer
C3 Q1 V1: The survey data STATISTICSSTUDENTSSURVEYFORR
contains the column FAAGESTUDBIRTH (a variable that
measures father's age at student birth in years) and the
column BAORBS (a variable that measures whether the
student is pursuing a BA or a BS). Indicate which of the
following statements is most correct.
The BA group has 2 outliers and the BS group has 3 outliers.
The outlier FAAGESTUDBIRTH data values in the BA group are 29 years and 37 years.
The outlier FAAGESTUDBIRTH data values in the BS group are 32 years, 27 years, and 2
years.
The range for the BA group is wider than the range for the BS group.
SIDE BY SIDE BOXPLOTS
Command: Graphs > Boxplot Variable
(pick one):
FAAGESTUDBIRTH
Plot by
Groups:
BAORBS
OK
Options:
x-axis label: Degree
4
LAB QUIZ 1 REVIEW
y-axis label:
Age
Graph Title:
………………
.
Apply, OK.
Output (Outliers can be counted to be 3 for BS, and 2 for BA)
5
LAB QUIZ 1 REVIEW
Question8
C3 Q8 V1 The survey data STATISTICSSTUDENTSSURVEYFORR
contains the column MOREAD which is a variable that measures
the age (in months) at which students first began to read on their
own. Use R to determine the 5 number summary of this column
data. Which of the following statements is incorrect?
Answer
:
75% of this data is above 50.75
25% of this data is below 50.75
25% of this data is above 60.00
75% of this data is above 60.00
Command: Statistics > Summaries > Numerical Summaries
Variables (pick one or more):
MOREAD
Statistics:
Leave default checks in Mean, Standard Deviation, Interquartile Range, and Quantiles
APPLY, OK
Output
Q1 = 50.75, so 25% of the data lies below Q1, so 75% of the data lies above 50.75
Q3 = 60.00, so 75% of the data lies below Q3, and 25% of the data lies above 60
Question9
C3 Q7 V1 The survey data STATISTICSSTUDENTSSURVEYFORR contains the column
MOREAD which is a variable that measures the age (in months) at which students
first began to read on their own. Use R to find the mean, standard deviation, 5
number summary, and the number of observations for this data. Which of the
following statements is incorrect?
Answer
The range of this data is 31 months.
The mean for this data is 55.03333
months and the standard deviation for
this data is 7.147280 months.
The number of observations for this data
is 60.
6
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
LAB QUIZ 1 REVIEW
25% of the data in this data set lies
between 50.75 months and 60.00
months.
Command: Statistics > Summaries > Numerical Summaries
Variables (pick one or more):
MOREAD
Statistics:
Leave default checks in Mean, Standard Deviation, Interquartile Range, and Quantiles
APPLY, OK
Output
Q1 = 50.75 and Q2 (Median) = 55, so 25% of the data lies between 50.75 and 55. The last statement is
incorrect. With Q3 = 60, 50% of the data actually lies between 50.75 and 60.
From the dotplot, we can see 3 peaks at 54, 55 and 60.
The mean and standard deviation are 50.03333 and 7.14728, respectively.
The range of the data is 67-36 = 31 months
7
LAB QUIZ 1 REVIEW
Question10
C3 Q4 V1: The survey data STATISTICSSTUDENTSSURVEYFORR contains the column
ENDPULSEMIN (a variable that measures student pulse in beats per min (bpm) after
doing the survey) and the column UNDERGORGRAD (a variable that measures
whether the student is pursuing an undergraduate or graduate degree). Create side
by side boxplots to describe this data. Choose the most correct answer below.
Answer
The interquartile range for the graduate/professional data is smaller than the interquart
range for the undergraduate data, while the percent of data in the interquartile range o
the graduate/professional data is the same as the percent of data in the interquartile ra
of the undergraduate data.
The interquartile range for the graduate/professional data is smaller than the interquart
range for the undergraduate data, while the percent of data in the interquartile range o
the graduate/professional data is higher than the percent of data in the interquartile ran
of the undergraduate data.
The interquartile range for the graduate/professional data is smaller than the interquart
range for the undergraduate data, while the percent of data in the interquartile range o
the graduate/professional data is lower than percent of data in the interquartile range o
the undergraduate data.
The interquartile range for the graduate/professional data is larger than the interquartil
range for the undergraduate data, while the percent of data in the interquartile range o
the graduate/professional data is the same as the percent of data in the interquartile ra
of the undergraduate data.
SIDE BY SIDE BOXPLOTS
Command: Graphs > Boxplots
Variable (pick one):
ENDPULSEMIN
Plot by Groups:
UNDERGORGRAD
OK
Options:
x-axis label: Degree
y-axis label:
Pulse rate
Graph Title:
………………
.
Output REGARDLESS OF THE VALUE OF IQR, 50% OF THE DATA LIES BETWEEN Q1 AND Q3. WE CAN SEE
FROM THE BOXPLOTS THAT THE IQR IS SMALLER IN VALUE FOR THE GRADUATEPROFESSIONAL DATA
THAN FOR THE UNDERGRADUATE DATA.
8
LAB QUIZ 1 REVIEW
Question11
C3 Q5 V1:The survey data STATISTICSSTUDENTSSURVEYFORR contains the column
ENDPULSEMIN (a variable that measures student pulse in beats per min (bpm)
after doing the survey) and the column UNDERGORGRAD (a variable that
measures whether the student is pursuing an undergraduate or graduate degree).
Use R to find the mean and standard deviation of student pulse rate in beats per
minute for the undergrad students and the mean and standard deviation of
student pulse rate in beats per minute for the graduate students. Choose the most
correct answer below.
Answer
The undergraduate mean is more than the graduate mean and the
undergraduate standard deviation is less than the graduate standard deviation.
The undergraduate mean is more than the graduate mean and the
undergraduate standard deviation is more than the graduate standard
deviation.
The undergraduate mean is less than the graduate mean and the
undergraduate standard deviation is less than the graduate standard
deviation.
The undergraduate mean is less than the graduate mean and the undergraduate
standard deviation is more than the graduate standard deviation.
DESCRIPTIVE STATISTICS FOR A NUMERICAL VARIABLE FOR EACH OF 2 OUTCOMES OF A CATEGORICAL
VARIABLE
Command: Statistics > Summaries > Numerical Summaries
Variables (pick one or more):
ENDPULSEMIN
Summarize by Groups:
Groups variable (pick one): UNDERGORGRAD, OK
Statistics:
Leave default checks in Mean, Standard Deviation, Interquartile Range, and Quantiles
APPLY, OK
Output
9
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
LAB QUIZ 1 REVIEW
The undergraduate mean of 76.18182 is more (larger than) the graduate mean of 74.44444, and the
undergraduate standard deviation of 6.530958 is more (larger than) the graduate standard deviation of
5.451417, so the last offered answer is correct.
10
LAB QUIZ 1 REVIEW
Question12
C3 Q6 V1: The survey data STATISTICSSTUDENTSSURVEYFORR contains the column
BEFBREATHMIN which is a variable that measures the number of breaths taken by
students before completing a survey and the column ENDBREATHMIN which is a
variable that measures the number of breaths taken by students after completing
a survey. Use R to make an appropriate graph to describe the data. Indicate which
of the following statements is most correct.
Answer
The before-survey breaths per minute and the end-survey breaths per minute
increase together for this data
As the before-survey breaths per minute increase, the end-survey breaths per
minute decrease.
As the end-survey breaths per minute increase, the before-survey breaths per
minute decrease.
As the before-survey breaths per minute increase, the end-survey breaths per
minute do not change.
SCATTERPLOT OF TWO NUMERICAL VARIABLES
Command: Graphs > Scatterplot
x-variable (pick one):
BEFBREATHMIN
y-variable (pick one):
ENDBREATHMIN
Apply, OK
Output
11
LAB QUIZ 1 REVIEW
12
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
Related Documents
Recommended textbooks for you

Glencoe Algebra 1, Student Edition, 9780079039897...
Algebra
ISBN:9780079039897
Author:Carter
Publisher:McGraw Hill

Big Ideas Math A Bridge To Success Algebra 1: Stu...
Algebra
ISBN:9781680331141
Author:HOUGHTON MIFFLIN HARCOURT
Publisher:Houghton Mifflin Harcourt

Holt Mcdougal Larson Pre-algebra: Student Edition...
Algebra
ISBN:9780547587776
Author:HOLT MCDOUGAL
Publisher:HOLT MCDOUGAL
Recommended textbooks for you
- Glencoe Algebra 1, Student Edition, 9780079039897...AlgebraISBN:9780079039897Author:CarterPublisher:McGraw HillBig Ideas Math A Bridge To Success Algebra 1: Stu...AlgebraISBN:9781680331141Author:HOUGHTON MIFFLIN HARCOURTPublisher:Houghton Mifflin HarcourtHolt Mcdougal Larson Pre-algebra: Student Edition...AlgebraISBN:9780547587776Author:HOLT MCDOUGALPublisher:HOLT MCDOUGAL

Glencoe Algebra 1, Student Edition, 9780079039897...
Algebra
ISBN:9780079039897
Author:Carter
Publisher:McGraw Hill

Big Ideas Math A Bridge To Success Algebra 1: Stu...
Algebra
ISBN:9781680331141
Author:HOUGHTON MIFFLIN HARCOURT
Publisher:Houghton Mifflin Harcourt

Holt Mcdougal Larson Pre-algebra: Student Edition...
Algebra
ISBN:9780547587776
Author:HOLT MCDOUGAL
Publisher:HOLT MCDOUGAL