
Concept explainers
Association Suppose you were to collect data for each pair of variables. You want to make a
- a) Apples: weight in grams, weight in ounces
- b) Apples: circumference (inches), weight (ounces)
- c) College freshmen: shoe size, grade point average
- d) Gasoline: number of miles you drove since filling up, gallons remaining in your tank
a.

Find the explanatory variable and response variable to plot a scatterplot.
Find the direction, form and strength of the scatterplot.
Answer to Problem 1E
Either weight in grams or weight in ounces could be the explanatory or response variable.
The association between the variables is straight, positive and strong.
Explanation of Solution
Given info:
The variables of the apples are given one is weight in grams and the other is weight in ounces.
Justification:
Associated variables:
Two variables are associated or related if the value of one variable gives you information about the value of the other variable.
The two variables weight in grams and weight in ounces are associated variables.
Response variable:
The variable to be measured or observed in regression analysis is called as response variable. In other words it can also be defined as, the variable that is changed due to the impact of the explanatory variable is defined as response variable.
Therefore, the dependent variables which is measured by the independent variables is called the response variable.
Here, given two variables are weight in grams of apple and weight in ounces of apple.
That is, each apple’s weight is measured in two different scales.
Therefore, there will be chances for weight in grams to depend on weight in ounces and vice versa.
Thus, either weight in grams or weight in ounces could be the explanatory or response variable.
Explanatory variable:
The variable used to predict or explain the response variable is called as predictor variable or explanatory variable. In other words it can also be defined as, the variable that explains the changes in the response variable is defined as explanatory variable.
Therefore, the independent variables to predict the response variable is called the predictor variable.
Here, given two variables are weight in grams of apple and weight in ounces of apple.
That is, each apple’s weight is measured in two different scales.
Therefore, there will be chances for weight in grams to depend on weight in ounces and vice versa.
Thus, either weight in grams or weight in ounces could be the explanatory or response variable.
Form of the association between variable:
The form of the association describes whether the data points follow a linear pattern or some other complicated curves. For data if it appears that a line would do a reasonable job of summarizing the overall pattern in the data. Then, the association between two variables is linear.
Here, weight in ounces increases or decreases with the increase or decrease in the weight in grams.
The pattern of the relationship between weight in ounces and weight in grams represents a straight line.
Hence, the association between the weight in ounces and weight in grams is linear.
Direction of association:
If the increase in the values of one variable increases the values of another variable, then the direction is positive. If the increase in the values of one variable decreases the values of another variable, then the direction is negative.
Here, weight in ounces increases or decreases with the increase or decrease in the weight in grams.
Hence, the direction of the association is positive.
Strength of the association:
The association is said to be strong if all the points are close to the straight line. It is said to be weak if all points are far away from the straight line and it is said to be moderate if the data points are moderately close to straight line.
Here, the variables will have perfect correlation between them.
Hence, the association between the variables is strong.
b.

Find the explanatory variable and response variable to plot a scatterplot.
Find the direction, form and strength of the scatterplot.
Answer to Problem 1E
Circumference of apple is explanatory variable and weight is the response variable.
The association between the variables is straight, positive and strong.
Explanation of Solution
Given info:
The variables of the apples are given one is circumference in inches and the other is weight in ounces.
Justification:
Associated variables:
Two variables are associated or related if the value of one variable gives you information about the value of the other variable.
The two variables circumference in inches and weight in ounces are associated variables.
Response variable:
The variable to be measured or observed in regression analysis is called as response variable. In other words it can also be defined as, the variable that is changed due to the impact of the explanatory variable is defined as response variable.
Therefore, the dependent variables which is measured by the independent variables is called the response variable.
Here, given two variables are circumference in inches of apple and weight in ounces of apple.
Three dimensional volume is nothing but the weight and one dimensional circumference explains the three dimensional volume.
Therefore, weight of the apple is predicted with the circumference of the apple.
That is, weight of the apple is depend on the circumference of the apple.
Thus, weight in ounces is dependent or response variable.
Explanatory variable:
The variable used to predict or explain the response variable is called as predictor variable or explanatory variable. In other words it can also be defined as, the variable that explains the changes in the response variable is defined as explanatory variable.
Therefore, the independent variables to predict the response variable is called the predictor variable.
Here, given two variables are circumference in inches of apple and weight in ounces of apple.
Weight of the apple is predicted with the circumference of the apple.
Thus, circumference in inches is independent or explanatory variable.
Form of the association between variable:
The form of the association describes whether the data points follow a linear pattern or some other complicated curves. For data if it appears that a line would do a reasonable job of summarizing the overall pattern in the data. Then, the association between two variables is linear.
Here, weight in ounces increases or decreases with the increase or decrease in the circumference in inches of apple.
The pattern of the relationship between weight in ounces and circumference in inches of apple represents a straight line for same size apples.
Hence, the association between the weight in ounces and circumference in inches of apple is linear for same size apples.
The association curve will be apparent if the sample contains very large and very small apples.
Direction of association:
If the increase in the values of one variable increases the values of another variable, then the direction is positive. If the increase in the values of one variable decreases the values of another variable, then the direction is negative.
Here, weight in ounces increases or decreases with the increase or decrease in the circumference in inches of apple.
Hence, the direction of the association is positive.
Strength of the association:
The association is said to be strong if all the points are close to the straight line. It is said to be weak if all points are far away from the straight line and it is said to be moderate if the data points are moderately close to straight line.
Here, the variables will have perfect correlation between them.
Hence, the association between the variables is strong.
c.

Find the explanatory variable and response variable to plot a scatterplot.
Find the direction, form and strength of the scatterplot.
Answer to Problem 1E
The variables shoe size and grade point average are not associated with each other.
Explanation of Solution
Given info:
The variables of the college freshmen are given one is shoe size and the other is grade point average.
Justification:
Associated variables:
Two variables are associated or related if the value of one variable gives you information about the value of the other variable.
There is no relationship between the variables shoe size and grade point average.
Therefore, there is no association between the variables.
Hence, the discussion will not go further.
d.

Find the explanatory variable and response variable to plot a scatterplot.
Find the direction, form and strength of the scatterplot.
Answer to Problem 1E
Circumference of apple is explanatory variable and weight is the response variable.
The association between the variables is straight, negative and strong.
Explanation of Solution
Given info:
The variables of the gasoline are given one is number of miles drove since filling up and the other is gallons remaining in the tank.
Justification:
Associated variables:
Two variables are associated or related if the value of one variable gives you information about the value of the other variable.
The two variables number of miles drove since filling up and gallons remaining in the tank are associated variables.
Response variable:
The variable to be measured or observed in regression analysis is called as response variable. In other words it can also be defined as, the variable that is changed due to the impact of the explanatory variable is defined as response variable.
Therefore, the dependent variables which is measured by the independent variables is called the response variable.
Here, given two variables are number of miles drove since filling up and gallons remaining in the tank.
The fuel that is remained in the tank is dependent on the fuel that is used for driving.
Therefore, gallons remaining in the tank is predicted with the number of miles drove since filling up.
That is, gallons remaining in the tank is depend on the number of miles drove since filling up.
Thus, gallons remaining in the tank is dependent or response variable.
Explanatory variable:
The variable used to predict or explain the response variable is called as predictor variable or explanatory variable. In other words it can also be defined as, the variable that explains the changes in the response variable is defined as explanatory variable.
Therefore, the independent variables to predict the response variable is called the predictor variable.
Here, given two variables are number of miles drove since filling up and gallons remaining in the tank.
Gallons remaining in the tank is predicted with the number of miles drove since filling up.
Thus, the number of miles drove since filling up is independent or explanatory variable.
Form of the association between variable:
The form of the association describes whether the data points follow a linear pattern or some other complicated curves. For data if it appears that a line would do a reasonable job of summarizing the overall pattern in the data. Then, the association between two variables is linear.
Here, gallons remaining in the tank decreases with the increase in the number of miles drove since filling up.
The pattern of the relationship between gallons remaining in the tank and the number of miles drove since filling up represents a straight line.
Hence, the association between the gallons remaining in the tank and the number of miles drove since filling up is linear.
Direction of association:
If the increase in the values of one variable increases the values of another variable, then the direction is positive. If the increase in the values of one variable decreases the values of another variable, then the direction is negative.
Here, gallons remaining in the tank decreases with the increase in the number of miles drove since filling up and gallons remaining in the tank increases with the decrease in the number of miles drove since filling up.
Hence, the direction of the association is negative.
Strength of the association:
The association is said to be strong if all the points are close to the straight line. It is said to be weak if all points are far away from the straight line and it is said to be moderate if the data points are moderately close to straight line.
Here, the variables will have moderate correlation between them.
Hence, the association between the variables is moderate.
Want to see more full solutions like this?
Chapter 6 Solutions
Intro Stats, Books a la carte Plus NEW MyLab Statistics with Pearson eText -- Access Card Package (5th Edition)
Additional Math Textbook Solutions
A First Course in Probability (10th Edition)
Pathways To Math Literacy (looseleaf)
Finite Mathematics for Business, Economics, Life Sciences and Social Sciences
Elementary Statistics: Picturing the World (7th Edition)
College Algebra (Collegiate Math)
Elementary Statistics ( 3rd International Edition ) Isbn:9781260092561
- Exercise 6-6 (Algo) (LO6-3) The director of admissions at Kinzua University in Nova Scotia estimated the distribution of student admissions for the fall semester on the basis of past experience. Admissions Probability 1,100 0.5 1,400 0.4 1,300 0.1 Click here for the Excel Data File Required: What is the expected number of admissions for the fall semester? Compute the variance and the standard deviation of the number of admissions. Note: Round your standard deviation to 2 decimal places.arrow_forward1. Find the mean of the x-values (x-bar) and the mean of the y-values (y-bar) and write/label each here: 2. Label the second row in the table using proper notation; then, complete the table. In the fifth and sixth columns, show the 'products' of what you're multiplying, as well as the answers. X y x minus x-bar y minus y-bar (x minus x-bar)(y minus y-bar) (x minus x-bar)^2 xy 16 20 34 4-2 5 2 3. Write the sums that represents Sxx and Sxy in the table, at the bottom of their respective columns. 4. Find the slope of the Regression line: bi = (simplify your answer) 5. Find the y-intercept of the Regression line, and then write the equation of the Regression line. Show your work. Then, BOX your final answer. Express your line as "y-hat equals...arrow_forwardApply STATA commands & submit the output for each question only when indicated below i. Generate the log of birthweight and family income of children. Name these new variables Ibwght & Ifaminc. Include the output of this code. ii. Apply the command sum with the detail option to the variable faminc. Note: you should find the 25th percentile value, the 50th percentile and the 75th percentile value of faminc from the output - you will need it to answer the next question Include the output of this code. iii. iv. Use the output from part ii of this question to Generate a variable called "high_faminc" that takes a value 1 if faminc is less than or equal to the 25th percentile, it takes the value 2 if faminc is greater than 25th percentile but less than or equal to the 50th percentile, it takes the value 3 if faminc is greater than 50th percentile but less than or equal to the 75th percentile, it takes the value 4 if faminc is greater than the 75th percentile. Include the outcome of this code…arrow_forward
- solve this on paperarrow_forwardApply STATA commands & submit the output for each question only when indicated below i. Apply the command egen to create a variable called "wyd" which is the rowtotal function on variables bwght & faminc. ii. Apply the list command for the first 10 observations to show that the code in part i worked. Include the outcome of this code iii. Apply the egen command to create a new variable called "bwghtsum" using the sum function on variable bwght by the variable high_faminc (Note: need to apply the bysort' statement) iv. Apply the "by high_faminc" statement to find the V. descriptive statistics of bwght and bwghtsum Include the output of this code. Why is there a difference between the standard deviations of bwght and bwghtsum from part iv of this question?arrow_forwardAccording to a health information website, the distribution of adults’ diastolic blood pressure (in millimeters of mercury, mmHg) can be modeled by a normal distribution with mean 70 mmHg and standard deviation 20 mmHg. b. Above what diastolic pressure would classify someone in the highest 1% of blood pressures? Show all calculations used.arrow_forward
- Write STATA codes which will generate the outcomes in the questions & submit the output for each question only when indicated below i. ii. iii. iv. V. Write a code which will allow STATA to go to your favorite folder to access your files. Load the birthweight1.dta dataset from your favorite folder and save it under a different filename to protect data integrity. Call the new dataset babywt.dta (make sure to use the replace option). Verify that it contains 2,998 observations and 8 variables. Include the output of this code. Are there missing observations for variable(s) for the variables called bwght, faminc, cigs? How would you know? (You may use more than one code to show your answer(s)) Include the output of your code (s). Write the definitions of these variables: bwght, faminc, male, white, motheduc,cigs; which of these variables are categorical? [Hint: use the labels of the variables & the browse command] Who is this dataset about? Who can use this dataset to answer what kind of…arrow_forwardApply STATA commands & submit the output for each question only when indicated below İ. ii. iii. iv. V. Apply the command summarize on variables bwght and faminc. What is the average birthweight of babies and family income of the respondents? Include the output of this code. Apply the tab command on the variable called male. How many of the babies and what share of babies are male? Include the output of this code. Find the summary statistics (i.e. use the sum command) of the variables bwght and faminc if the babies are white. Include the output of this code. Find the summary statistics (i.e. use the sum command) of the variables bwght and faminc if the babies are male but not white. Include the output of this code. Using your answers to previous subparts of this question: What is the difference between the average birthweight of a baby who is male and a baby who is male but not white? What can you say anything about the difference in family income of the babies that are male and male…arrow_forwardA public health researcher is studying the impacts of nudge marketing techniques on shoppers vegetablesarrow_forward
- The director of admissions at Kinzua University in Nova Scotia estimated the distribution of student admissions for the fall semester on the basis of past experience. Admissions Probability 1,100 0.5 1,400 0.4 1,300 0.1 Click here for the Excel Data File Required: What is the expected number of admissions for the fall semester? Compute the variance and the standard deviation of the number of admissions. Note: Round your standard deviation to 2 decimal places.arrow_forwardA pollster randomly selected four of 10 available people. Required: How many different groups of 4 are possible? What is the probability that a person is a member of a group? Note: Round your answer to 3 decimal places.arrow_forwardWind Mountain is an archaeological study area located in southwestern New Mexico. Potsherds are broken pieces of prehistoric Native American clay vessels. One type of painted ceramic vessel is called Mimbres classic black-on-white. At three different sites the number of such sherds was counted in local dwelling excavations. Test given. Site I Site II Site III 63 19 60 43 34 21 23 49 51 48 11 15 16 46 26 20 31 Find .arrow_forward
- Glencoe Algebra 1, Student Edition, 9780079039897...AlgebraISBN:9780079039897Author:CarterPublisher:McGraw HillHolt Mcdougal Larson Pre-algebra: Student Edition...AlgebraISBN:9780547587776Author:HOLT MCDOUGALPublisher:HOLT MCDOUGALAlgebra: Structure And Method, Book 1AlgebraISBN:9780395977224Author:Richard G. Brown, Mary P. Dolciani, Robert H. Sorgenfrey, William L. ColePublisher:McDougal Littell
- Big Ideas Math A Bridge To Success Algebra 1: Stu...AlgebraISBN:9781680331141Author:HOUGHTON MIFFLIN HARCOURTPublisher:Houghton Mifflin HarcourtFunctions and Change: A Modeling Approach to Coll...AlgebraISBN:9781337111348Author:Bruce Crauder, Benny Evans, Alan NoellPublisher:Cengage LearningAlgebra and Trigonometry (MindTap Course List)AlgebraISBN:9781305071742Author:James Stewart, Lothar Redlin, Saleem WatsonPublisher:Cengage Learning





