Concept explainers
In addition to the key words, you should also be able to define each of the following terms:
Linear relationship
Pearson
Monotonic relationship
Spearman correlation
Statistical significance of a
Regression
Third-variable problem
Directionality problem
Multiple regression
To define:
Each of the following terms: Scatter plot, linear relationships, Pearson correlation, monotonic relationship, Spearman correlation, Statistical significance of a correlation, regression, third variable problem, directionality problem, multiple regressions.
Explanation of Solution
Explanations:
Scatter Plot: It is a 2D graph between 2 variables X and Y obtained by plotting X on horizontal and Y on vertical axes. The scatter plot is mainly done to study the extent of correlation between 2 variables. If a large correlation exists then the points scatter in a line and if there's no such correlation then they are scattered randomly.
Linear Relationship: In a linear relationship the relationship between 2 variables can be represented with a line. The linear relationship can be positive or negative depending on the fact that if X increase then Y increases too and it is positive. It is negative when X increases but Y decreases. Linear relationships can be determined using scatter plots between 2 variables. There ca be no relationship too in that case the points scattered randomly. Graphical representation of positive linear relation:
Pearson correlation:
Pearson product correlation coefficient is a measure of linear association between 2 variables X and Y. It has a value between -1 and +1 and is denoted by r.
r consists of a ratio comparing the covariance( X and Y) (numerator) with the variance of X and Y separately (in the denominator).
The formula:
Monotonic relationship: It is the degree to which the relationship is one directional. So as one value increases the other also increases and as one decreases other also decreases. Spearman correlation measures this value.
Spearman Correlation: This is distribution free alternative to Pearson r value. It measures strength and direction of monotonic relationship and used mostly for ordinal observations.
The formula is:
In this case ranks are calculated for each data X and Y and their difference is taken represented as d in the formula. It can also take values in (-1,+1)range. A high positive value like 0.9 indicates a strong positive correlation and vice versa. So here as X increases so does Y (Just like r).
Statistical Significance of a correlation: Correlation is tested using t statistic.
The formula is provided as
to test whether the linear relationship is strong enough to model the relationship in the population. The hypothesis tests decide if the population correlation
Regression: It is a set of statistical processes for establishing or estimating the relationship among variables. Regression analysis helps to understand how changes in independent variable results in changes in dependent variable. It is widely used for prediction and forecasting. It helps in understanding which independent variables are related to the dependent variable and what is the form of the relationship. In other words it can be used to establish causal relationship.
Third variable problem: In context of experimental design third variable can be confounded variable resulting in relationship between 2 variables. Example can be cities with higher churches have higher crime rate but more churches do not lead to more crime so a 3rd variable population lead to more churches and more crime.
Directionality problem: It's a problem with 2 variables when the cause and effect is not known. It's known that there exists a relationship between X and Y however whether X is due to Y or vice versa is not known. For that apart from correlational study experimental study are required to be conducted. So an example can be there's a strong correlation between amount of violence seen on Tv and amount of aggressive behavior by children but its not known whether amount of violence seen on TV is due to aggressive behavior or vice versa.
Multiple regressions: The general purpose of the multiple regressions is to establish a relationship between several predictor variables and a dependent variable. Notation ally if Y is a dependent variable and Xs are several predictors then f is a function or line of least square which is estimated in multiple regressions.
It is widely used in social and natural sciences. Educators might be interested in predictors for success in high school or psychologist might be interested in personality best predicting social adjustments.
For example Price of the house sold can be dependent variable and size of house, number of bedrooms, locality, average income in respective neighborhoods, and appeal of the house can be predictors. The other example can be Salary being dependent and amount of responsibility, number of people to supervise etc. are independent.
Want to see more full solutions like this?
Chapter 12 Solutions
Research Methods for the Behavioral Sciences (MindTap Course List)
- 1.2.17. (!) Let G,, be the graph whose vertices are the permutations of (1,..., n}, with two permutations a₁, ..., a,, and b₁, ..., b, adjacent if they differ by interchanging a pair of adjacent entries (G3 shown below). Prove that G,, is connected. 132 123 213 312 321 231arrow_forwardYou are planning an experiment to determine the effect of the brand of gasoline and the weight of a car on gas mileage measured in miles per gallon. You will use a single test car, adding weights so that its total weight is 3000, 3500, or 4000 pounds. The car will drive on a test track at each weight using each of Amoco, Marathon, and Speedway gasoline. Which is the best way to organize the study? Start with 3000 pounds and Amoco and run the car on the test track. Then do 3500 and 4000 pounds. Change to Marathon and go through the three weights in order. Then change to Speedway and do the three weights in order once more. Start with 3000 pounds and Amoco and run the car on the test track. Then change to Marathon and then to Speedway without changing the weight. Then add weights to get 3500 pounds and go through the three gasolines in the same order.Then change to 4000 pounds and do the three gasolines in order again. Choose a gasoline at random, and run the car with this gasoline at…arrow_forwardAP1.2 A child is 40 inches tall, which places her at the 90th percentile of all children of similar age. The heights for children of this age form an approximately Normal distribution with a mean of 38 inches. Based on this information, what is the standard deviation of the heights of all children of this age? 0.20 inches (c) 0.65 inches (e) 1.56 inches 0.31 inches (d) 1.21 inchesarrow_forward
- AP1.1 You look at real estate ads for houses in Sarasota, Florida. Many houses range from $200,000 to $400,000 in price. The few houses on the water, however, have prices up to $15 million. Which of the following statements best describes the distribution of home prices in Sarasota? The distribution is most likely skewed to the left, and the mean is greater than the median. The distribution is most likely skewed to the left, and the mean is less than the median. The distribution is roughly symmetric with a few high outliers, and the mean is approximately equal to the median. The distribution is most likely skewed to the right, and the mean is greater than the median. The distribution is most likely skewed to the right, and the mean is less than the median.arrow_forwardDuring busy political seasons, many opinion polls are conducted. In apresidential race, how do you think the participants in polls are generally selected?Discuss any issues regarding simple random, stratified, systematic, cluster, andconvenience sampling in these polls. What about other types of polls, besides political?arrow_forwardPlease could you explain why 0.5 was added to each upper limpit of the intervals.Thanksarrow_forward
- 28. (a) Under what conditions do we say that two random variables X and Y are independent? (b) Demonstrate that if X and Y are independent, then it follows that E(XY) = E(X)E(Y); (e) Show by a counter example that the converse of (ii) is not necessarily true.arrow_forward1. Let X and Y be random variables and suppose that A = F. Prove that Z XI(A)+YI(A) is a random variable.arrow_forward30. (a) What is meant by the term "product measur"? ANDarrow_forward
- Glencoe Algebra 1, Student Edition, 9780079039897...AlgebraISBN:9780079039897Author:CarterPublisher:McGraw HillBig Ideas Math A Bridge To Success Algebra 1: Stu...AlgebraISBN:9781680331141Author:HOUGHTON MIFFLIN HARCOURTPublisher:Houghton Mifflin HarcourtCollege AlgebraAlgebraISBN:9781305115545Author:James Stewart, Lothar Redlin, Saleem WatsonPublisher:Cengage Learning
- Algebra and Trigonometry (MindTap Course List)AlgebraISBN:9781305071742Author:James Stewart, Lothar Redlin, Saleem WatsonPublisher:Cengage LearningHolt Mcdougal Larson Pre-algebra: Student Edition...AlgebraISBN:9780547587776Author:HOLT MCDOUGALPublisher:HOLT MCDOUGALFunctions and Change: A Modeling Approach to Coll...AlgebraISBN:9781337111348Author:Bruce Crauder, Benny Evans, Alan NoellPublisher:Cengage Learning