Concept explainers
In addition to the key words, you should also be able to define each of the following terms:
Linear relationship
Pearson
Monotonic relationship
Spearman correlation
Statistical significance of a
Regression
Third-variable problem
Directionality problem
Multiple regression
data:image/s3,"s3://crabby-images/2698b/2698b129880c27e76a91019c9f73226195062b2d" alt="Check Mark"
To define:
Each of the following terms: Scatter plot, linear relationships, Pearson correlation, monotonic relationship, Spearman correlation, Statistical significance of a correlation, regression, third variable problem, directionality problem, multiple regressions.
Explanation of Solution
Explanations:
Scatter Plot: It is a 2D graph between 2 variables X and Y obtained by plotting X on horizontal and Y on vertical axes. The scatter plot is mainly done to study the extent of correlation between 2 variables. If a large correlation exists then the points scatter in a line and if there's no such correlation then they are scattered randomly.
Linear Relationship: In a linear relationship the relationship between 2 variables can be represented with a line. The linear relationship can be positive or negative depending on the fact that if X increase then Y increases too and it is positive. It is negative when X increases but Y decreases. Linear relationships can be determined using scatter plots between 2 variables. There ca be no relationship too in that case the points scattered randomly. Graphical representation of positive linear relation:
Pearson correlation:
Pearson product correlation coefficient is a measure of linear association between 2 variables X and Y. It has a value between -1 and +1 and is denoted by r.
r consists of a ratio comparing the covariance( X and Y) (numerator) with the variance of X and Y separately (in the denominator).
The formula:
Monotonic relationship: It is the degree to which the relationship is one directional. So as one value increases the other also increases and as one decreases other also decreases. Spearman correlation measures this value.
Spearman Correlation: This is distribution free alternative to Pearson r value. It measures strength and direction of monotonic relationship and used mostly for ordinal observations.
The formula is:
In this case ranks are calculated for each data X and Y and their difference is taken represented as d in the formula. It can also take values in (-1,+1)range. A high positive value like 0.9 indicates a strong positive correlation and vice versa. So here as X increases so does Y (Just like r).
Statistical Significance of a correlation: Correlation is tested using t statistic.
The formula is provided as
to test whether the linear relationship is strong enough to model the relationship in the population. The hypothesis tests decide if the population correlation
Regression: It is a set of statistical processes for establishing or estimating the relationship among variables. Regression analysis helps to understand how changes in independent variable results in changes in dependent variable. It is widely used for prediction and forecasting. It helps in understanding which independent variables are related to the dependent variable and what is the form of the relationship. In other words it can be used to establish causal relationship.
Third variable problem: In context of experimental design third variable can be confounded variable resulting in relationship between 2 variables. Example can be cities with higher churches have higher crime rate but more churches do not lead to more crime so a 3rd variable population lead to more churches and more crime.
Directionality problem: It's a problem with 2 variables when the cause and effect is not known. It's known that there exists a relationship between X and Y however whether X is due to Y or vice versa is not known. For that apart from correlational study experimental study are required to be conducted. So an example can be there's a strong correlation between amount of violence seen on Tv and amount of aggressive behavior by children but its not known whether amount of violence seen on TV is due to aggressive behavior or vice versa.
Multiple regressions: The general purpose of the multiple regressions is to establish a relationship between several predictor variables and a dependent variable. Notation ally if Y is a dependent variable and Xs are several predictors then f is a function or line of least square which is estimated in multiple regressions.
It is widely used in social and natural sciences. Educators might be interested in predictors for success in high school or psychologist might be interested in personality best predicting social adjustments.
For example Price of the house sold can be dependent variable and size of house, number of bedrooms, locality, average income in respective neighborhoods, and appeal of the house can be predictors. The other example can be Salary being dependent and amount of responsibility, number of people to supervise etc. are independent.
Want to see more full solutions like this?
Chapter 12 Solutions
MINDTAP PSYCHOLOGY FOR GRAVETTER/FORZAN
- Exercise 6-6 (Algo) (LO6-3) The director of admissions at Kinzua University in Nova Scotia estimated the distribution of student admissions for the fall semester on the basis of past experience. Admissions Probability 1,100 0.5 1,400 0.4 1,300 0.1 Click here for the Excel Data File Required: What is the expected number of admissions for the fall semester? Compute the variance and the standard deviation of the number of admissions. Note: Round your standard deviation to 2 decimal places.arrow_forward1. Find the mean of the x-values (x-bar) and the mean of the y-values (y-bar) and write/label each here: 2. Label the second row in the table using proper notation; then, complete the table. In the fifth and sixth columns, show the 'products' of what you're multiplying, as well as the answers. X y x minus x-bar y minus y-bar (x minus x-bar)(y minus y-bar) (x minus x-bar)^2 xy 16 20 34 4-2 5 2 3. Write the sums that represents Sxx and Sxy in the table, at the bottom of their respective columns. 4. Find the slope of the Regression line: bi = (simplify your answer) 5. Find the y-intercept of the Regression line, and then write the equation of the Regression line. Show your work. Then, BOX your final answer. Express your line as "y-hat equals...arrow_forwardApply STATA commands & submit the output for each question only when indicated below i. Generate the log of birthweight and family income of children. Name these new variables Ibwght & Ifaminc. Include the output of this code. ii. Apply the command sum with the detail option to the variable faminc. Note: you should find the 25th percentile value, the 50th percentile and the 75th percentile value of faminc from the output - you will need it to answer the next question Include the output of this code. iii. iv. Use the output from part ii of this question to Generate a variable called "high_faminc" that takes a value 1 if faminc is less than or equal to the 25th percentile, it takes the value 2 if faminc is greater than 25th percentile but less than or equal to the 50th percentile, it takes the value 3 if faminc is greater than 50th percentile but less than or equal to the 75th percentile, it takes the value 4 if faminc is greater than the 75th percentile. Include the outcome of this code…arrow_forward
- solve this on paperarrow_forwardApply STATA commands & submit the output for each question only when indicated below i. Apply the command egen to create a variable called "wyd" which is the rowtotal function on variables bwght & faminc. ii. Apply the list command for the first 10 observations to show that the code in part i worked. Include the outcome of this code iii. Apply the egen command to create a new variable called "bwghtsum" using the sum function on variable bwght by the variable high_faminc (Note: need to apply the bysort' statement) iv. Apply the "by high_faminc" statement to find the V. descriptive statistics of bwght and bwghtsum Include the output of this code. Why is there a difference between the standard deviations of bwght and bwghtsum from part iv of this question?arrow_forwardAccording to a health information website, the distribution of adults’ diastolic blood pressure (in millimeters of mercury, mmHg) can be modeled by a normal distribution with mean 70 mmHg and standard deviation 20 mmHg. b. Above what diastolic pressure would classify someone in the highest 1% of blood pressures? Show all calculations used.arrow_forward
- Write STATA codes which will generate the outcomes in the questions & submit the output for each question only when indicated below i. ii. iii. iv. V. Write a code which will allow STATA to go to your favorite folder to access your files. Load the birthweight1.dta dataset from your favorite folder and save it under a different filename to protect data integrity. Call the new dataset babywt.dta (make sure to use the replace option). Verify that it contains 2,998 observations and 8 variables. Include the output of this code. Are there missing observations for variable(s) for the variables called bwght, faminc, cigs? How would you know? (You may use more than one code to show your answer(s)) Include the output of your code (s). Write the definitions of these variables: bwght, faminc, male, white, motheduc,cigs; which of these variables are categorical? [Hint: use the labels of the variables & the browse command] Who is this dataset about? Who can use this dataset to answer what kind of…arrow_forwardApply STATA commands & submit the output for each question only when indicated below İ. ii. iii. iv. V. Apply the command summarize on variables bwght and faminc. What is the average birthweight of babies and family income of the respondents? Include the output of this code. Apply the tab command on the variable called male. How many of the babies and what share of babies are male? Include the output of this code. Find the summary statistics (i.e. use the sum command) of the variables bwght and faminc if the babies are white. Include the output of this code. Find the summary statistics (i.e. use the sum command) of the variables bwght and faminc if the babies are male but not white. Include the output of this code. Using your answers to previous subparts of this question: What is the difference between the average birthweight of a baby who is male and a baby who is male but not white? What can you say anything about the difference in family income of the babies that are male and male…arrow_forwardA public health researcher is studying the impacts of nudge marketing techniques on shoppers vegetablesarrow_forward
- The director of admissions at Kinzua University in Nova Scotia estimated the distribution of student admissions for the fall semester on the basis of past experience. Admissions Probability 1,100 0.5 1,400 0.4 1,300 0.1 Click here for the Excel Data File Required: What is the expected number of admissions for the fall semester? Compute the variance and the standard deviation of the number of admissions. Note: Round your standard deviation to 2 decimal places.arrow_forwardA pollster randomly selected four of 10 available people. Required: How many different groups of 4 are possible? What is the probability that a person is a member of a group? Note: Round your answer to 3 decimal places.arrow_forwardWind Mountain is an archaeological study area located in southwestern New Mexico. Potsherds are broken pieces of prehistoric Native American clay vessels. One type of painted ceramic vessel is called Mimbres classic black-on-white. At three different sites the number of such sherds was counted in local dwelling excavations. Test given. Site I Site II Site III 63 19 60 43 34 21 23 49 51 48 11 15 16 46 26 20 31 Find .arrow_forward
- Glencoe Algebra 1, Student Edition, 9780079039897...AlgebraISBN:9780079039897Author:CarterPublisher:McGraw HillBig Ideas Math A Bridge To Success Algebra 1: Stu...AlgebraISBN:9781680331141Author:HOUGHTON MIFFLIN HARCOURTPublisher:Houghton Mifflin HarcourtCollege AlgebraAlgebraISBN:9781305115545Author:James Stewart, Lothar Redlin, Saleem WatsonPublisher:Cengage Learning
- Algebra and Trigonometry (MindTap Course List)AlgebraISBN:9781305071742Author:James Stewart, Lothar Redlin, Saleem WatsonPublisher:Cengage LearningHolt Mcdougal Larson Pre-algebra: Student Edition...AlgebraISBN:9780547587776Author:HOLT MCDOUGALPublisher:HOLT MCDOUGALFunctions and Change: A Modeling Approach to Coll...AlgebraISBN:9781337111348Author:Bruce Crauder, Benny Evans, Alan NoellPublisher:Cengage Learning
data:image/s3,"s3://crabby-images/b9e14/b9e141b888912793d57db61a53fa701d5defdb09" alt="Text book image"
data:image/s3,"s3://crabby-images/af711/af7111c99977ff8ffecac4d71f474692077dfd4c" alt="Text book image"
data:image/s3,"s3://crabby-images/70f5c/70f5cef52227d3e827c226418ce33af96e43372d" alt="Text book image"
data:image/s3,"s3://crabby-images/86990/869902122cc988a8b1078ef9afcefe0673468505" alt="Text book image"
data:image/s3,"s3://crabby-images/9ae58/9ae58d45ce2e430fbdbd90576f52102eefa7841e" alt="Text book image"
data:image/s3,"s3://crabby-images/f7b2e/f7b2e13a7986b0da326090f527c815066b5aa9ba" alt="Text book image"