STAT 151 STUDY DOCUMENT PRACTISE LAB FINAL QUIZ 1 NUSRAT.docx

pdf

School

University of Alberta *

*We aren’t endorsed by this school

Course

151

Subject

Statistics

Date

Feb 20, 2024

Type

pdf

Pages

12

Uploaded by ChancellorFang13225

Report
LAB QUIZ 1 REVIEW Question 1 C1 Q1 V1: The survey data STATISTICSSTUDENTSSURVEYFORR contains several columns. Identify the type of variable in the column FEDBEST, a variable that measures which Federal political party students think had the best platform (Conservative, Green, Liberal, NDP). Answer qualitative categorical (non-ordinal) qualitative categorical (ordinal) quantitative numerical discrete quantitative numerical continuous Question 2 C1 Q2 V1: The survey data STATISTICSSTUDENTSSURVEYFORR contains several columns. Identify the type of variable in the column FEDVOTE, a variable that measures which Federal political party students will vote for (Conservative, Green, Liberal, NDP). Answer qualitative categorical (non-ordinal) qualitative categorical(ordinal) quantitative numerical discrete quantitative numerical continuous Question3 C2 Q1 V1 The survey data STATISTICSSTUDENTSSURVEYFORR contains several columns. Use R to calculate the counts of the outcomes in the ALBBEST column. Which of the following statements is most correct? The mode of the outcomes of the variable ALBBEST is NDP. The mode of the outcomes of the variable ALBBEST is 29. The mode of the outcomes of the variable ALBEST is 48.33%. The mode of the outcomes of the variable ALBBEST cannot be determined. Command: TALLY COUNTS AND PERCENTS Statistics > Summaries > Frequency Distributions Variables (pick one or more): ALBBEST Apply, OK Output 1
LAB QUIZ 1 REVIEW The highest count occurs for the outcome of NDP Question4 Answer C2 Q5 V1 The survey data STATISTICSSTUDENTSSURVEYFORR contains several columns. Use R to calculate the percentages of the outcomes in the ALBBEST column. Which of the following statements below is FALSE? An observation in the ALBBEST data can take on one of 4 possible outcomes (choices percent of data that is NDP is 48.33%, and the percent of data that is not NDP is 51.6 The number of observations in the ALBBEST column is 60, the percent of data that is 48.33%, and the percent of data that is not NDP is 51.67% An observation in the ALBBEST data can take on one of 60 possible outcomes (choice count of NDP in the ALBBEST column is 29, and the percent of data that is NDP is 48. The count of NDP in the ALBBEST column is 29, the percent of data that is NDP is 48. and the percent of data that is not NDP is 51.67% Command: TALLY COUNTS AND PERCENTS 1b) Statistics > Summaries > Frequency Distributions Variables (pick one or more): ALBBEST Apply, OK Total count = 5+5+29+21 = 60 Percent NDP = 48.33% Percent not NDP = 100% -48.33% = 51.67% Question 5 C2 Q3 V1 The survey data STATISTICSSTUDENTSSURVEYFORR contains the column FAAGESTUDBIRTH which is a variable that measures the age of students' fathers when students were born. Use R to make a histogram to describe the data. Choose the most correct answer to describe what you see in the shape of the data distribution. Answer symmetric left skewed right skewed uniform 2
LAB QUIZ 1 REVIEW Command : 6) Graphs > Histogram Variable (pick one): FAAGESTUDBIRTH Options: Frequency counts x-axis label: Agey-axis label: FREQUENCY Graph Title: ………… .. Apply, OK Output Question6 Answer C2 Q4 V1 The survey data STATISTICSSTUDENTSSURVEYFORR contains the column FAVSPORTWATCH which is a variable that measures the favourite sports that students like to watch (baseball, football, hockey, tennis). Use R to make an appropriate chart for this column. Indicate which of the following statements in most correct. The mode of FAVSPORTWATCH is tennis, and the mean of FAVSPORTWATCH is 15. The mode of FAVSPORTWATCH is tennis, and the mean of FAVSPORTWATCH canno 3
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help
LAB QUIZ 1 REVIEW calculated. The mode of FAVSPORTWATCH is 21, and the mean of FAVSPORTWATCH is 15. The mode of FAVSPORTWATCH is 21, and the mean of FAVSPORTWATCH cannot be calculated. Command: TALLY COUNTS AND PERCENTS Statistics > Summaries > Frequency Distributions Variables (pick one or more): FAVSPORTWATCH Apply, OK Output The most common occurring sport is tennis, with a 21 count. Since the data is categorical, the mean (a number) cannot be calculated. Question7 Answer C3 Q1 V1: The survey data STATISTICSSTUDENTSSURVEYFORR contains the column FAAGESTUDBIRTH (a variable that measures father's age at student birth in years) and the column BAORBS (a variable that measures whether the student is pursuing a BA or a BS). Indicate which of the following statements is most correct. The BA group has 2 outliers and the BS group has 3 outliers. The outlier FAAGESTUDBIRTH data values in the BA group are 29 years and 37 years. The outlier FAAGESTUDBIRTH data values in the BS group are 32 years, 27 years, and 2 years. The range for the BA group is wider than the range for the BS group. SIDE BY SIDE BOXPLOTS Command: Graphs > Boxplot Variable (pick one): FAAGESTUDBIRTH Plot by Groups: BAORBS OK Options: x-axis label: Degree 4
LAB QUIZ 1 REVIEW y-axis label: Age Graph Title: ……………… . Apply, OK. Output (Outliers can be counted to be 3 for BS, and 2 for BA) 5
LAB QUIZ 1 REVIEW Question8 C3 Q8 V1 The survey data STATISTICSSTUDENTSSURVEYFORR contains the column MOREAD which is a variable that measures the age (in months) at which students first began to read on their own. Use R to determine the 5 number summary of this column data. Which of the following statements is incorrect? Answer : 75% of this data is above 50.75 25% of this data is below 50.75 25% of this data is above 60.00 75% of this data is above 60.00 Command: Statistics > Summaries > Numerical Summaries Variables (pick one or more): MOREAD Statistics: Leave default checks in Mean, Standard Deviation, Interquartile Range, and Quantiles APPLY, OK Output Q1 = 50.75, so 25% of the data lies below Q1, so 75% of the data lies above 50.75 Q3 = 60.00, so 75% of the data lies below Q3, and 25% of the data lies above 60 Question9 C3 Q7 V1 The survey data STATISTICSSTUDENTSSURVEYFORR contains the column MOREAD which is a variable that measures the age (in months) at which students first began to read on their own. Use R to find the mean, standard deviation, 5 number summary, and the number of observations for this data. Which of the following statements is incorrect? Answer The range of this data is 31 months. The mean for this data is 55.03333 months and the standard deviation for this data is 7.147280 months. The number of observations for this data is 60. 6
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help
LAB QUIZ 1 REVIEW 25% of the data in this data set lies between 50.75 months and 60.00 months. Command: Statistics > Summaries > Numerical Summaries Variables (pick one or more): MOREAD Statistics: Leave default checks in Mean, Standard Deviation, Interquartile Range, and Quantiles APPLY, OK Output Q1 = 50.75 and Q2 (Median) = 55, so 25% of the data lies between 50.75 and 55. The last statement is incorrect. With Q3 = 60, 50% of the data actually lies between 50.75 and 60. From the dotplot, we can see 3 peaks at 54, 55 and 60. The mean and standard deviation are 50.03333 and 7.14728, respectively. The range of the data is 67-36 = 31 months 7
LAB QUIZ 1 REVIEW Question10 C3 Q4 V1: The survey data STATISTICSSTUDENTSSURVEYFORR contains the column ENDPULSEMIN (a variable that measures student pulse in beats per min (bpm) after doing the survey) and the column UNDERGORGRAD (a variable that measures whether the student is pursuing an undergraduate or graduate degree). Create side by side boxplots to describe this data. Choose the most correct answer below. Answer The interquartile range for the graduate/professional data is smaller than the interquart range for the undergraduate data, while the percent of data in the interquartile range o the graduate/professional data is the same as the percent of data in the interquartile ra of the undergraduate data. The interquartile range for the graduate/professional data is smaller than the interquart range for the undergraduate data, while the percent of data in the interquartile range o the graduate/professional data is higher than the percent of data in the interquartile ran of the undergraduate data. The interquartile range for the graduate/professional data is smaller than the interquart range for the undergraduate data, while the percent of data in the interquartile range o the graduate/professional data is lower than percent of data in the interquartile range o the undergraduate data. The interquartile range for the graduate/professional data is larger than the interquartil range for the undergraduate data, while the percent of data in the interquartile range o the graduate/professional data is the same as the percent of data in the interquartile ra of the undergraduate data. SIDE BY SIDE BOXPLOTS Command: Graphs > Boxplots Variable (pick one): ENDPULSEMIN Plot by Groups: UNDERGORGRAD OK Options: x-axis label: Degree y-axis label: Pulse rate Graph Title: ……………… . Output REGARDLESS OF THE VALUE OF IQR, 50% OF THE DATA LIES BETWEEN Q1 AND Q3. WE CAN SEE FROM THE BOXPLOTS THAT THE IQR IS SMALLER IN VALUE FOR THE GRADUATEPROFESSIONAL DATA THAN FOR THE UNDERGRADUATE DATA. 8
LAB QUIZ 1 REVIEW Question11 C3 Q5 V1:The survey data STATISTICSSTUDENTSSURVEYFORR contains the column ENDPULSEMIN (a variable that measures student pulse in beats per min (bpm) after doing the survey) and the column UNDERGORGRAD (a variable that measures whether the student is pursuing an undergraduate or graduate degree). Use R to find the mean and standard deviation of student pulse rate in beats per minute for the undergrad students and the mean and standard deviation of student pulse rate in beats per minute for the graduate students. Choose the most correct answer below. Answer The undergraduate mean is more than the graduate mean and the undergraduate standard deviation is less than the graduate standard deviation. The undergraduate mean is more than the graduate mean and the undergraduate standard deviation is more than the graduate standard deviation. The undergraduate mean is less than the graduate mean and the undergraduate standard deviation is less than the graduate standard deviation. The undergraduate mean is less than the graduate mean and the undergraduate standard deviation is more than the graduate standard deviation. DESCRIPTIVE STATISTICS FOR A NUMERICAL VARIABLE FOR EACH OF 2 OUTCOMES OF A CATEGORICAL VARIABLE Command: Statistics > Summaries > Numerical Summaries Variables (pick one or more): ENDPULSEMIN Summarize by Groups: Groups variable (pick one): UNDERGORGRAD, OK Statistics: Leave default checks in Mean, Standard Deviation, Interquartile Range, and Quantiles APPLY, OK Output 9
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help
LAB QUIZ 1 REVIEW The undergraduate mean of 76.18182 is more (larger than) the graduate mean of 74.44444, and the undergraduate standard deviation of 6.530958 is more (larger than) the graduate standard deviation of 5.451417, so the last offered answer is correct. 10
LAB QUIZ 1 REVIEW Question12 C3 Q6 V1: The survey data STATISTICSSTUDENTSSURVEYFORR contains the column BEFBREATHMIN which is a variable that measures the number of breaths taken by students before completing a survey and the column ENDBREATHMIN which is a variable that measures the number of breaths taken by students after completing a survey. Use R to make an appropriate graph to describe the data. Indicate which of the following statements is most correct. Answer The before-survey breaths per minute and the end-survey breaths per minute increase together for this data As the before-survey breaths per minute increase, the end-survey breaths per minute decrease. As the end-survey breaths per minute increase, the before-survey breaths per minute decrease. As the before-survey breaths per minute increase, the end-survey breaths per minute do not change. SCATTERPLOT OF TWO NUMERICAL VARIABLES Command: Graphs > Scatterplot x-variable (pick one): BEFBREATHMIN y-variable (pick one): ENDBREATHMIN Apply, OK Output 11
LAB QUIZ 1 REVIEW 12
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help