Chapter 12- Quantitative Data Analysis

rtf

School

Florida Atlantic University *

*We aren’t endorsed by this school

Course

4704

Subject

Statistics

Date

Apr 3, 2024

Type

rtf

Pages

14

Uploaded by PresidentQuailPerson763

Report
1. Which of the following most accurately describes Singleton's study of alcohol consumption and academic performance? A) It involved the secondary analysis of a national survey. B) It was based on a telephone interview survey. C) It measured academic performance in terms of self-reported GPA. D) It combined data from four campus surveys. 2. What is the order of steps in the quantitative analysis of survey data? A) inspect/modify data data processing bivariate analysis multivariate testing B) inspect/modify data data processing multivariate testing bivariate analysis C) data processing inspect/modify data bivariate analysis multivariate testing D) data processing inspect/modify data multivariate testing bivariate analysis 3. Editing of survey data A) involves checking for inconsistencies and omitted responses. B) is carried out prior to the process of data collection. C) is applied mostly to computer-assisted interviewing surveys. D) is the sole responsibility of the project supervisor. 4. In a data matrix, __________ are placed in rows and __________ are placed in columns. A) variables; missing data B) cases or observations; variables C) dependent variables; independent variables D) independent variables, dependent variables 5. Codebooks may contain all but which one of the following? A) raw survey data B) numerical codes for each response C) question wording D) editing and coding rules E) interviewer directions 6. Obtaining frequency distributions for all the variables in a data file is one way to A) do wild-code checking. B) do consistency checking. C) verify data entry. Page 1
D) edit the data. 7. What is the usual order of steps in processing completed survey interviews or questionnaires? A) data entry coding editing cleaning B) editing coding data entry cleaning C) cleaning coding data entry editing D) coding data entry cleaning editing 8. In terms of data processing, one advantage of computer-assisted interviewing over paper-and-pencil questionnaire surveys is that A) it is easier to determine if interviewers are recording answers accurately and adequately. B) there is no need to code responses. C) open-ended questions can be coded more easily. D) data entry occurs directly when interviewers record respondents' answers. 9. Percentage distributions A) may be applied only to interval-/ratio-scale variables. B) should include missing values in the computation of percentages. C) cannot be computed when there are missing data. D) provide an explicit comparative framework for interpreting distributions. 10. Univariate distributions of interval-/ratio-scale variables include all but which one of the following properties? A) regression B) central tendency C) dispersion D) shape 11. If the median in a distribution is 75, this means that A) 75 percent of the cases scored above the median. B) a score of 75 has the highest frequency. C) 75 is average score. D) a score of 75 divides the frequency distribution in half. 12. What is the mode in the following set of data? 1, 2, 2, 3, 5, 6, 9 A) 1 B) 2 C) 3 Page 2
D) 4 E) 5 13. In the campus survey, what was the shape of the distribution of reported number of drinks consumed on a typical weekend night? A) normal. B) abnormal. C) positively skewed. D) negatively skewed. 14. Which of the following methods is not an option for handling missing data? A) index construction B) listwise deletion C) recoding D) imputation 15. What are the marginals in a cross-tabulation or contingency table? A) outliers B) standard deviates C) cell frequencies D) lowest and highest frequencies E) row and column totals 16. To analyze the relationship in a contingency table, the rule for calculating percentages is to compute percentages based on the A) total number of cases in the table. B) number of cases in each category of the dependent variable. C) number of cases in each category of the independent variable. D) column variable, regardless of whether it is independent or dependent. 17. Consider the following table from the 2016 GSS, which shows the relationship between race and whether someone favors or opposes “a law which would require a person to obtain a police permit before he or she could buy a gun.” The data in this table suggest that (the answer may require some calculation) A) there is a near-zero association between race and support for gun control. B) whites are more likely to favor gun control than blacks. Page 3
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help
C) blacks are more likely to favor gun control than whites. 18. Suppose a researcher finds a statistically significant relationship between salary and job satisfaction among a random sample of employees. From this information, he can A) reject the null hypothesis that there is no relationship between job satisfaction and salary. B) conclude that differences in salary cause differences in job satisfaction. C) conclude that the relationship between salary and job satisfaction probably occurred at random. D) infer that salary is the most important factor in job satisfaction. 19. The chi-square test for independence indicates A) how two variables are related to one another. B) whether a relationship exists between variables. C) the strength of the relationship between variables. D) the direction of the relationship between variables. 20. In the general formula for a linear relationship, Y = a + bX, “a” is called the A) least squares point. B) Y-intercept. C) regression coefficient. D) slope. 21. For the 2016 GSS, you regress number of hours of television watched on the average day (Y) on number of years of education completed (X) and obtain the following result: Y = 5.37 – .18X. How much change in hours of television watched is associated with a change of one year in a respondent's education? A) 1 B) 5.37 – .18 C) 5.37 D) –.18 22. Consider the regression equation Y = –5.43 + 4.16X. This equation tells us that A) one unit increase in X is associated with a 5.43 unit decrease in Y. B) one unit increase in X is associated with a 4.16 unit increase in Y. C) one unit increase in Y is associated with a 5.43 decrease in X. D) one unit increase in Y is associated with a 4.16 increase in X Page 4
23. A correlation of –.85 indicates a __________ relationship, and a correlation of +.10 indicates a __________ relationship. A) strong; weak B) weak; strong C) weak; moderate D) weak; weak 24. The difference between an actual score and the score predicted by the regression equation is called A) a slope. B) the explained variation. C) a residual. D) a regression coefficient. 25. Suppose two variables are negatively related. Which of the following regression equations might describe this relationship? A) Y = 3.21 + 2.41X B) Y = –.45 + 4.12X C) Y = 18.62 – 1.21X 26. Which of the following is an example of an inferential statistic ? A) range B) mean C) correlation coefficient D) chi-square test for independence 27. Statistical control of extraneous variables in a survey is analogous to __________ in an experiment. A) randomly assigning research participants to experimental conditions B) manipulating the independent variable C) pretesting (i.e., premeasuring) the dependent variable D) posttesting the dependent variable. 28. According to the theoretical model: A) the inferred causal relationship between Y and Z is weak. B) X is an intervening variable. C) X affects Y and Z, but there is no causal relationship between Y and Z. D) Y and Z are independent of X. Page 5
29. Which of the following is not typically a step in elaboration analysis? A) examine the bivariate relationship between X and Y B) consider extraneous variables that might affect the original X – Y relationship C) control statistically for the influence of extraneous variables D) simultaneously control for several additional extraneous variables 30. Suppose you examine the relationship between participation in interscholastic sports (yes/no) and grade-point average (low, medium, high). Controlling for the variable sex (male/female) would require the creation of __________ partial tables. A) one B) two C) three D) four E) five 31. Which variable(s) is (are) controlled, or held constant, in each partial table of Table 1? Table 1. Percentage of Respondents Expressing High Tolerance of Civil Liberties for Political Dissidents, by Gender and Religiosity A) gender B) religiosity C) gender and religiosity D) tolerance of civil liberties for political dissidents 32. In elaboration analysis, two variables are spuriously related if the original bivariate relationship __________ when an extraneous __________ variable is controlled. A) remains the same; intervening B) remains the same; antecedent C) disappears; intervening D) disappears; antecedent Page 6
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help
33. A partial-regression coefficient in a multiple regression equation A) is the result of a specification error. B) usually has the same value as the corresponding bivariate regression coefficient. C) indicates the influence of an independent variable on the dependent variable when other independent variables are held constant. D) is a product of multicollinearity. 34. (Box 12.3) Which of the following would not pose a problem for the interpretation of a multiple linear regression analysis? A) nominal-scale independent variables B) multicollinearity C) misspecification D) curvilinear relationship between independent and dependent variable 35. A dummy variable in a multiple regression analysis is A) an unmeasured variable that is excluded from the underlying causal model. B) an uncontrolled variable that creates a spurious relationship. C) an uncontrolled variable that intervenes or “mediates” a relationship. D) an interval- or ratio-scale variable with a mean of 0 and a standard deviation of 1. E) a nominal- or ordinal-scale variable that is recoded to have the values of 0 and 1. 36. Using the 2016 GSS, we regressed the number of hours of television watched on the average day on three variables: years of education, age, and marital status. Marital status is a dummy variables with 1 = married. We get the following results for the unstandardized regression coefficients : TVhours = 3.52 – .16Educ + .04Age – .69Married. According to this equation, A) There is no association between age and television viewing. B) Younger people watch more hours of television, on average, than older people. C) Married people watch fewer hours of television, on average, than unmarried people. D) Marital status is more strongly associated with hours of television viewing than age. 37. Using the 2016 GSS, we regressed the number of hours of television watched on the average day on three variables: years of education, age, and marital status. Marital status is a dummy variables with 1 = married. We get the following results for the unstandardized regression coefficients : TVhours = 3.52 – .16Educ + .04Age – .69Married. What is the predicted number of hours of television viewing for a 20-year old, unmarried person with 14 years of education? A) 1.39 B) 2.08 C) 2.71 Page 7
D) 4.41 38. Using the 2016 GSS, we regressed the number of hours of television watched on the average day on three variables: years of education, age, and marital status. Marital status is a dummy variables with 1 = married. We get the following results for the standardized regression coefficients : TVhours = – .17Educ + .27Age – .13Married. According to this equation, A) none of the variables is significantly related to television viewing. B) all three variables are significantly related to television viewing. C) age has the strongest association with television viewing. D) one year of education has a greater impact on viewing than one year of age. 39. Closed-ended questions usually are coded before data collection. A) True B) False 40. Editing is carried out after all the data have been entered into a data file. A) True B) False 41. Some data-processing activities can be programmed into computer-assisted interviewing. A) True B) False 42. Using computer-assisted interviewing eliminates the need for coding and data cleaning. A) True B) False 43. Verification may involve entering the data twice into separate files and then comparing the two files for noncomparable entries. A) True B) False 44. Consistency checking compares entries in a data file with entries in the interview schedule or questionnaire. A) True B) False Page 8
45. Calculations in a percentage distribution usually are based on the total number of responses, including those coded “don't know” and “not applicable.” A) True B) False 46. The mean is a statistical property of the distribution of a nominal-scale variable. A) True B) False 47. Univariate analysis can determine whether to recode variable categories for further analysis. A) True B) False 48. Outliers are unusual or suspicious values that are far removed from the preponderance of observations for a variable. A) True B) False 49. Listwise deletion is the best method of handling missing values, regardless of the number of missing cases. A) True B) False 50. Bivariate distributions may be constructed for variables with nominal and ordinal as well as interval and ratio measurement. A) True B) False 51. In a cross-tabulation, the row totals and the column totals each describe univariate distributions. A) True B) False 52. To interpret the relationship between variables in a contingency table, the rule is “percentage across, read across; percentage down, read down.” A) True Page 9
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help
B) False 53. Tests of statistical significance may be applied only to interval- and ratio-scale variables. A) True B) False 54. The chi-square test is a measure of degree of association. A) True B) False 55. Examining a scatterplot can reveal whether a linear regression analysis is appropriate. A) True B) False 56. Regression coefficients indicate, among other things, the direction of the relationship between two variables. A) True B) False 57. The correlation coefficient measures the direction and strength of association between variables. A) True B) False 58. In the model X T Y, the relationship between X and Y is spurious. A) True B) False 59. In elaboration analysis, statistically controlling for an intervening variable may reveal a spurious association. A) True B) False 60. One strength of elaboration analysis is that it permits the simultaneous control of several independent variables. A) True Page 10
B) False 61. Leaving out important variables from a model is called a “specification error.” (Box 12.3) A) True B) False 62. “Collinearity” refers to a perfect association between variables. (Box 12.3) A) True B) False 63. Multiple regression is limited to the analysis of interval- and ratio-scale independent variables. A) True B) False 64. A dummy variable is a dichotomous variable with variable categories arbitrarily coded 0 and 1. A) True B) False 65. To determine which independent variable has the greatest impact in a multiple regression, researchers use unstandardized regression coefficients. A) True B) False 66. The quality of data is affected at several stages of social research, including data processing. What techniques do survey researchers apply to avoid errors and enhance data quality during data processing? Are data processing errors unavoidable, like random sampling error? Explain. 67. Describe the differences in the univariate analysis of nominal/ordinal variables and interval/ratio variables. What descriptive statistics are used to describe each type of variable? 68. Describe the differences in the bivariate analysis of nominal/ordinal variables and interval/ratio variables. What descriptive and inferential statistics are used to describe each type of variable? Page 11
69. Someone at your college conducts a survey on helping and voluntarism. Suppose you are consulted about how to analyze the data to test the hypothesis that students majoring in the arts are more likely to do volunteer work than students majoring in the sciences. (a) What questions would you ask about the data before you make your recommendations? (b) As you might point out, why is a bivariate analysis seldom, if ever, sufficient to test a hypothesis that one variable causes another? (c) As you might explain, how is multiple regression superior to elaboration? Page 12
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help
Answer Key 1. D 2. C 3. A 4. B 5. A 6. A 7. B 8. D 9. D 10. A 11. D 12. B 13. C 14. A 15. E 16. C 17. C 18. A 19. B 20. B 21. D 22. B 23. A 24. C 25. C 26. D 27. A 28. C 29. D 30. B 31. B 32. D 33. C 34. A 35. E 36. C 37. B 38. C 39. A 40. B 41. A 42. B 43. A 44. B Page 13
45. B 46. B 47. A 48. A 49. B 50. A 51. A 52. B 53. B 54. B 55. A 56. A 57. A 58. B 59. B 60. B 61. A 62. A 63. B 64. A 65. B 66. 67. 68. 69. Page 14