Lab1_Sampling Methods and Bias_ Final (1)

docx

School

Purdue University *

*We aren’t endorsed by this school

Course

301

Subject

Statistics

Date

Apr 3, 2024

Type

docx

Pages

3

Uploaded by ColonelElk3886

Report
Name: TA name/class time: Lecturer name: Lab 1: Sampling Methods/ Bias NOTE: SPSS outputs are necessary to show full completion of the lab. Please paste all SPSS outputs into your lab report and submit the completed reports including all requested tables and graphs via Brightspace (under the "Lab" folder) by 11:59 pm Friday. Two points will be deducted for each SPSS requested output that is not included in the submitted lab document. Also, 30% points will be deducted for late submission, up to 24 hours. Dataset : This lab uses the dataset ( SleepPatterns ), located on Brightspace under Lab in the Datasets submodule. Instructions for opening the dataset in SPSS are found as follows. SPSS installed on a computer: Reference page 4 of the SPSS Instruction Manual SPSS running remotely: Reference the slide “Opening your Dataset Remotely in SPSS via Go Remote” in the document “SPSS using Citrix access guidelines” on Brightspace. One hundred fifty college students in Indiana participated in a study examining the associations among sleep habits, sleep quality and physical/emotional factors. Participants completed an online survey about sleep habits that included the Pittsburgh Sleep Quality Index (PSQI), the Epworth Sleepiness Scale (ESS), the Horne-Ostberg Morningness Eveningness Scale (MES), the Subjective Units of Distress Scale (SUDS), and questions about academic performance and physical health. 1. (2 points) Form a simple random sample (SRS) of 5 subjects from the dataset with SPSS using page 6 of the SPSS Instruction Manual . Once the sample is formed, record the Height for each selected subject in table SRS #1 below. Next, compute the sample mean for the Height in one of the two ways below and record the calculated sample mean in the corresponding entry of Table 1. 1) Calculator: Add the five numbers and then divide by 5. 2) SPSS: Reference the SPSS Instruction Manual page 10 – Descriptive Statistics: Calculate the Mean . Make sure to move the variable Height to the variable box . 2. (6 points) Repeat to re-open the dataset each time. Simply repeat the sampling process from the previous question to obtain a new random sample. SRS #2 1 2 3 4 5 SRS #1 1 2 3 4 5 Height
Height SRS #3 1 2 3 4 5 Height SRS #4 1 2 3 4 5 Height Fill in the table below with the mean values calculated for each sample: Table 1 : Height SRSs Means SRS #1 SRS #2 SRS #3 SRS #4 x i Do the following before proceeding with the lab. Navigate to “Data -> Select Cases”, select “All Cases”, and click OK. This should bring back the original dataset. Pause for a moment and reflect on the following statements. This will help you understand what you have done and prepare you for the rest of the lab. Consider the sample means in Table 1. Are they all the same? Most likely not, because sample statistics such as the sample mean vary between samples. Furthermore, a small sample size, like those above, means that it is more likely that the chosen individuals are different from sample to sample, leading to much variability. The fact that variability is reduced by increasing the sample’s size is a concept used all semester. In general, the value of a population’s parameter is not known because the information needed to find it cannot be measured for every individual in the population. However, the parameter could be directly found if all the needed information was known. Continue with the lab. In questions 3 and 4 , consider the dataset of 150 subjects as if it were a fictitious “population” of interest. 3. (2 points) Using SPSS, calculate the mean Height for this fictitious “population”. Write the value below to 3 decimals and include the output from SPSS here . Reference the SPSS Instruction Manual page 10 (mean) and 5 (copying output). 4. a) (1 point) In your opinion, are the sample means in Table 1 similar to or different than the mean of the fictitious “population”? Was variability due to small sample size apparent?
b) (1 point) Explain how increasing the sample size would affect how the sample means compare to the mean of the fictitious “population”. For the remainder of the lab , again consider the dataset as a sample of the real population of interest. 5. (1 point) The subjects in the dataset answered a call for volunteers to participate in this study. Assuming every subject who volunteered has a recorded response for every variable, what type of bias could possibly result from this sampling design ? 6. (2 points) What is an alternative sampling design that could avoid the bias mentioned in question #5? Explain how it would address the bias. 7. (1 point) The variables GPA and Height were measured through self-reporting . Describe the bias that could possibly result. 8. (2 points) How could you avoid the type of bias mentioned in question #7? Give at least one idea for each variable. 9. a) (1 point) Consider the following scenario: Subjects were asked to volunteer for this study and were told they would be paid for their time spent completing a questionnaire. As part of the questionnaire, the subjects provided their own average hours slept per night. Also, some of the questions on the questionnaire asked personal questions that several subjects refused to answer. Given this information, do you think that the data from the questionnaire will provide an accurate reflection of all subjects’ average hours slept per night? b) (1 point) If so, explain why. If not, identify at least two flaws in the sampling method of the study.
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help