
Concept explainers
a.
Find the number of variables included in the
Identify whether each of the variables in Figure 2.84(a) is categorical or quantitative.
Estimate the
a.

Answer to Problem 226E
The number of variables included in the scatterplot in Figure 2.84(a) is 2.
Both of the variables are quantitative.
The range for Variable1 is 16.
The range for Variable2 is 90.
Explanation of Solution
From the given scatterplot in Figure 2.84(a), it is clear that there are two variables included, Variable1 in x axis and Variable2 in y axis.
The scales of Variable1 and Variable2 are numerical values. Hence, both of these variables are quantitative.
The minimum and maximum values of the data points observed from the scatterplot for Variable1 are approximately 14 and 29, respectively.
The minimum and maximum values of the data points observed from the scatterplot for Variable2 are approximately 70 and 160, respectively.
The ranges for Variable1 and Variable2 are computed as follows:
Therefore, the range for Variable1 is 16 and the range for Variable2 is 90.
b.
Explain whether the association between the variables appears to be positive or negative in Figure 2.84(a).
b.

Answer to Problem 226E
The association between the variables appears to be positive.
Explanation of Solution
In Figure 2.84(a), as the Variable1 increases, Variable2 also increases. This is an indication of positive association.
Therefore, the association between the variables appears to be positive.
c.
Identify the response variable.
Explain whether the line shows a positive or negative association.
c.

Answer to Problem 226E
The response variable is Variable2.
The regression line shows a positive association.
Explanation of Solution
The variable in the vertical axis represents a response variable and the variable in the horizontal axis represents an explanatory variable.
In Figure 2.84(b), Variable2 is in the vertical axis, whereas Variable1 is in the horizontal axis. Therefore, the response variable is Variable2.
It is also clear from the regression line that the slope of the line is increasing. This indicates that there is a positive association between the variables.
d.
Identify whether the third variable included is categorical or quantitative.
Find the number of categories if it is a categorical variable.
Find the range if it is a quantitative variable.
d.

Answer to Problem 226E
The third variable included is categorical.
The number of categories is 4.
Explanation of Solution
From Figure 2.85(a), the data points are indicated by different symbols, which are labeled as A, B, C, and D. They are non-numerical values. Thus, it is clear that the Variable3 is a categorical variable.
There are four different labels. Thus, the number of categories is 4.
e.
Identify whether the association between Variable1 and Variable2 appears to be positive or negative by considering the case in Group A.
Identify whether the association between Variable1 and Variable2 appears to be positive or negative by considering the case in Group B.
Identify whether the association between Variable1 and Variable2 appears to be positive or negative by considering the case in Group C.
Identify whether the association between Variable1 and Variable2 appears to be positive or negative by considering the case in Group D.
e.

Answer to Problem 226E
The association between Variable1 and Variable2 by considering the case in Group A appears to be negative.
The association between Variable1 and Variable2 by considering the case in Group B appears to be negative.
The association between Variable1 and Variable2 by considering the case in Group C appears to be negative.
The association between Variable1 and Variable2 by considering the case in Group D appears to be negative.
Explanation of Solution
From Figure 2.85 (a), it is clear that the data points of the all categories (A, B, C, and D) are in the decreasing order. That is, as Variable1 increases, Variable2 decreases. Thus, the association between Variable1 and Variable2 is negative by considering Group A, Group B, Group C, and Group D.
f.
Explain whether the regression line for Group A shows a positive or negative association.
Explain whether the regression line for Group B shows a positive or negative association.
Explain whether the regression line for Group C shows a positive or negative association.
Explain whether the regression line for Group D shows a positive or negative association.
f.

Answer to Problem 226E
The regression line for Group A shows a negative association.
The regression line for Group B shows a negative association.
The regression line for Group C shows a negative association.
The regression line for Group D shows a negative association.
Explanation of Solution
In Figure 2.85(b), it is clear from the regression line that the slope of the line is increasing for all the four categories. This indicates the negative association between the variables.
That is, the regression line for Groups A, B, C, and D shows a negative association.
g.
Explain about the difference in the direction of association between Figure 2.84 and Figure 2.85.
g.

Explanation of Solution
In Figure 2.84, the association between variables is positive, while the association between variables is shown as negative in Figure 2.85.
By including additional information contained in Variable3, the association switches from positive to negative.
Want to see more full solutions like this?
Chapter 2 Solutions
Statistics, Binder Ready Version: Unlocking the Power of Data
- The following relates to Problems 4 and 5. Christchurch, New Zealand experienced a major earthquake on February 22, 2011. It destroyed 100,000 homes. Data were collected on a sample of 300 damaged homes. These data are saved in the file called CIEG315 Homework 4 data.xlsx, which is available on Canvas under Files. A subset of the data is shown in the accompanying table. Two of the variables are qualitative in nature: Wall construction and roof construction. Two of the variables are quantitative: (1) Peak ground acceleration (PGA), a measure of the intensity of ground shaking that the home experienced in the earthquake (in units of acceleration of gravity, g); (2) Damage, which indicates the amount of damage experienced in the earthquake in New Zealand dollars; and (3) Building value, the pre-earthquake value of the home in New Zealand dollars. PGA (g) Damage (NZ$) Building Value (NZ$) Wall Construction Roof Construction Property ID 1 0.645 2 0.101 141,416 2,826 253,000 B 305,000 B T 3…arrow_forwardRose Par posted Apr 5, 2025 9:01 PM Subscribe To: Store Owner From: Rose Par, Manager Subject: Decision About Selling Custom Flower Bouquets Date: April 5, 2025 Our shop, which prides itself on selling handmade gifts and cultural items, has recently received inquiries from customers about the availability of fresh flower bouquets for special occasions. This has prompted me to consider whether we should introduce custom flower bouquets in our shop. We need to decide whether to start offering this new product. There are three options: provide a complete selection of custom bouquets for events like birthdays and anniversaries, start small with just a few ready-made flower arrangements, or do not add flowers. There are also three possible outcomes. First, we might see high demand, and the bouquets could sell quickly. Second, we might have medium demand, with a few sold each week. Third, there might be low demand, and the flowers may not sell well, possibly going to waste. These outcomes…arrow_forwardConsider the state space model X₁ = §Xt−1 + Wt, Yt = AX+Vt, where Xt Є R4 and Y E R². Suppose we know the covariance matrices for Wt and Vt. How many unknown parameters are there in the model?arrow_forward
- Business Discussarrow_forwardYou want to obtain a sample to estimate the proportion of a population that possess a particular genetic marker. Based on previous evidence, you believe approximately p∗=11% of the population have the genetic marker. You would like to be 90% confident that your estimate is within 0.5% of the true population proportion. How large of a sample size is required?n = (Wrong: 10,603) Do not round mid-calculation. However, you may use a critical value accurate to three decimal places.arrow_forward2. [20] Let {X1,..., Xn} be a random sample from Ber(p), where p = (0, 1). Consider two estimators of the parameter p: 1 p=X_and_p= n+2 (x+1). For each of p and p, find the bias and MSE.arrow_forward
- 1. [20] The joint PDF of RVs X and Y is given by xe-(z+y), r>0, y > 0, fx,y(x, y) = 0, otherwise. (a) Find P(0X≤1, 1arrow_forward4. [20] Let {X1,..., X} be a random sample from a continuous distribution with PDF f(x; 0) = { Axe 5 0, x > 0, otherwise. where > 0 is an unknown parameter. Let {x1,...,xn} be an observed sample. (a) Find the value of c in the PDF. (b) Find the likelihood function of 0. (c) Find the MLE, Ô, of 0. (d) Find the bias and MSE of 0.arrow_forward3. [20] Let {X1,..., Xn} be a random sample from a binomial distribution Bin(30, p), where p (0, 1) is unknown. Let {x1,...,xn} be an observed sample. (a) Find the likelihood function of p. (b) Find the MLE, p, of p. (c) Find the bias and MSE of p.arrow_forwardGiven the sample space: ΩΞ = {a,b,c,d,e,f} and events: {a,b,e,f} A = {a, b, c, d}, B = {c, d, e, f}, and C = {a, b, e, f} For parts a-c: determine the outcomes in each of the provided sets. Use proper set notation. a. (ACB) C (AN (BUC) C) U (AN (BUC)) AC UBC UCC b. C. d. If the outcomes in 2 are equally likely, calculate P(AN BNC).arrow_forwardSuppose a sample of O-rings was obtained and the wall thickness (in inches) of each was recorded. Use a normal probability plot to assess whether the sample data could have come from a population that is normally distributed. Click here to view the table of critical values for normal probability plots. Click here to view page 1 of the standard normal distribution table. Click here to view page 2 of the standard normal distribution table. 0.191 0.186 0.201 0.2005 0.203 0.210 0.234 0.248 0.260 0.273 0.281 0.290 0.305 0.310 0.308 0.311 Using the correlation coefficient of the normal probability plot, is it reasonable to conclude that the population is normally distributed? Select the correct choice below and fill in the answer boxes within your choice. (Round to three decimal places as needed.) ○ A. Yes. The correlation between the expected z-scores and the observed data, , exceeds the critical value, . Therefore, it is reasonable to conclude that the data come from a normal population. ○…arrow_forwardding question ypothesis at a=0.01 and at a = 37. Consider the following hypotheses: 20 Ho: μ=12 HA: μ12 Find the p-value for this hypothesis test based on the following sample information. a. x=11; s= 3.2; n = 36 b. x = 13; s=3.2; n = 36 C. c. d. x = 11; s= 2.8; n=36 x = 11; s= 2.8; n = 49arrow_forwardarrow_back_iosSEE MORE QUESTIONSarrow_forward_ios
- MATLAB: An Introduction with ApplicationsStatisticsISBN:9781119256830Author:Amos GilatPublisher:John Wiley & Sons IncProbability and Statistics for Engineering and th...StatisticsISBN:9781305251809Author:Jay L. DevorePublisher:Cengage LearningStatistics for The Behavioral Sciences (MindTap C...StatisticsISBN:9781305504912Author:Frederick J Gravetter, Larry B. WallnauPublisher:Cengage Learning
- Elementary Statistics: Picturing the World (7th E...StatisticsISBN:9780134683416Author:Ron Larson, Betsy FarberPublisher:PEARSONThe Basic Practice of StatisticsStatisticsISBN:9781319042578Author:David S. Moore, William I. Notz, Michael A. FlignerPublisher:W. H. FreemanIntroduction to the Practice of StatisticsStatisticsISBN:9781319013387Author:David S. Moore, George P. McCabe, Bruce A. CraigPublisher:W. H. Freeman





