
Concept explainers
Iris setosa is a beautiful wildflower that is found in such diverse places as Alaska, the Gulf of St. Lawrence, much of North America, and even in English meadows and parks. R. A. Fisher, with his colleague Dr. Edgar Anderson, studied these flowers extensively. Dr. Anderson described how he collected information on irises:
I have studied such irises as I could get to see, in as great detail as possible. measuring iris standard after iris standard and iris fall after iris fall, sitting squat-legged with record book and ruler in mountain meadows, in cypress swamps, on lake beaches, and in English parks. [E. Anderson. "The Irises of the Gaspé Peninsula." Bulletin. American IrisSociety, Vol. 59 pp. 2-5, 1935.]
The data in Table 7-10 were collected by Dr. Anderson and were published by his friend and colleague R. A. Fisher in a paper titled "The Use of Multiple Measurements in Taxonomic Problems" (Annals of Eugenics. part II. pp. 179-188, 1936). To find these data, visit the Carnegie Mellon University Data and Story Library (DASI.) web site. From the DASI. site, look under Biology and select Fisher's Irises Story.
Let x be a random variable representing petal length. Using a TI-84Plus/TI-83Plus/TI-n spire calculator, it was found that the sample mean is
(a) Examine the histogram for petal lengths. Would you say that the distribution is approximately mound-shaped and symmetric? Our sample has only 50 irises; if many thousands of irises had been used, do you think the distribution would look even more like a normal curve? Let x be the petal length of Iris setosa. Research has shown that x has an approximately
(b) Use the
(c) Compute the
(d) Suppose that a random sample of 30 irises is obtained. Compute the probability that the average petal length for this sample is between 1.3 and 1.6 cm. Compute the probability that the average petal length is greater than 1.6 cm.
(e) Compare your answers to parts (c) and (d). Do you notice any differences? Why would these differences occur?
TABLE 7-10 | Petal Length in Centimeters for Iris serosa | |||
1.4 | 14 | 1.3 | 1.5 | 1.4 |
1.7 | 1.4 | 1.5 | 14 | 1.5 |
1.5 | 1.6 | 14 | 1.1 | 1.2 |
1.5 | 1.3 | 1.4 | 1.7 | 1.5 |
1.7 | 1.5 | 1 | 1.7 | 1.9 |
1.6 | 16 | 1.5 | 1.4 | 16 |
1.5 | 1.5 | 1.4 | 1.5 | |
1.2 | 1.3 | 1.4 | 1.3 | 1.5 |
1.3 | 1.3 | 1.3 | 1.6 | 1.9 |
1.4 | 1.6 | 1.4 | 1.5 | 14 |
FIGURE 7-36
Petal Length (cm) for Iris setosa (TI-84Plus/TI-83Plus/TI-n spire)
(a)

To explain: Whether the distribution is approximately mound-shaped and symmetrical.
Answer to Problem DHGP
Solution: Yes, the distribution is approximately mound-shaped and symmetrical.
Explanation of Solution
Calculation:
From the histogram for petal lengths, the distribution is approximately bell-shaped or mound-shaped and symmetrical because approximately the left half of the graph being the mirror image of the right half of the graph.
Our sample has only 50 irises; if many thousands of irises had been used, the distribution would look more similar to normal curve because the sample is very largeand the distribution of the sample will be approximately normally distributed.
(b)

To find: The 68%, 95% and 99% interval and compare the computed percentages with those given by empirical rule..
Answer to Problem DHGP
Solution: The 68%, 95% and 99% interval are (1.3, 1.7), (1.1, 1.9), (0.9, 2.1) respectively.
Explanation of Solution
Let x be the petal length of Iris Setosa and x has an approximately normal distribution, with mean
We know that, 68% of the observations will fall within one standard deviation of mean.
The 68% interval is,
95% of the observations will fall within two standard deviation of mean.
The 95% interval is,
99.7% of the observations will fall within two standard deviation of mean.
The 99.7% interval is,
There are 33 data values fall within the interval 1.3 and 1.7, so the percentage of data within the interval 1.3 and 1.7 is
There are 46 data values fall within the interval 1.1 and 1.9, so the percentage of data within the interval 1.3 and 1.7 is
All data values fall within the interval 0.9 and 2.1, so the percentage of data within the interval 1.3 and 1.7 is
(c)

To find: The probability that a petal length is between 1.3 and 1.6 cm and the probability that a petal length is greater than 1.6 cm.
Answer to Problem DHGP
Solution: The probability that a petal length is between 1.3 and 1.6 cm is 0.5328. The probability that a petal length is greater than 1.6 cm is 0.3085.
Explanation of Solution
Let x be the petal length of Iris Setosa and x has an approximately normal distribution, with mean
We convert the interval
Using Table 3 from the Appendix to find the
Hence, the probability that a petal length is between 1.3 and 1.6 cm is 0.5328.
We convert the interval
Using Table 3 from the Appendix
Hence, the probability that a petal length is greater than 1.6 cm is 0.3085.
(d)

To find: The probability that average petal length is between 1.3 and 1.6 cm and the probability that average petal length is greater than 1.6 cm.
Answer to Problem DHGP
Solution: The probability that average petal length is between 1.3 and 1.6 cm is 0.9972. The probability that averagepetal length is greater than 1.6 cm is 0.0027.
Explanation of Solution
Let x has an approximately normal distribution, with mean
We convert the interval
Using Table 3 from the Appendix
Hence, the probability that average petal length is between 1.3 and 1.6 cm is 0.9972.
We convert the interval
Using Table 3 from the Appendix
Hence, the probability that a petal length is greater than 1.6 cm is 0.0027.
(e)

To explain: The comparison of part (c) and part (d).
Answer to Problem DHGP
Solution:
The standard deviation of the sample mean is much smaller than the population standard deviation.
Explanation of Solution
In part (c), x has a distribution that is approximately normal with
In part (b),
The central limit theorem tells us that the standard deviation of the sample mean is much smaller than the population standard deviation.
Want to see more full solutions like this?
Chapter 7 Solutions
Understanding Basic Statistics
- Exercise 6-6 (Algo) (LO6-3) The director of admissions at Kinzua University in Nova Scotia estimated the distribution of student admissions for the fall semester on the basis of past experience. Admissions Probability 1,100 0.5 1,400 0.4 1,300 0.1 Click here for the Excel Data File Required: What is the expected number of admissions for the fall semester? Compute the variance and the standard deviation of the number of admissions. Note: Round your standard deviation to 2 decimal places.arrow_forward1. Find the mean of the x-values (x-bar) and the mean of the y-values (y-bar) and write/label each here: 2. Label the second row in the table using proper notation; then, complete the table. In the fifth and sixth columns, show the 'products' of what you're multiplying, as well as the answers. X y x minus x-bar y minus y-bar (x minus x-bar)(y minus y-bar) (x minus x-bar)^2 xy 16 20 34 4-2 5 2 3. Write the sums that represents Sxx and Sxy in the table, at the bottom of their respective columns. 4. Find the slope of the Regression line: bi = (simplify your answer) 5. Find the y-intercept of the Regression line, and then write the equation of the Regression line. Show your work. Then, BOX your final answer. Express your line as "y-hat equals...arrow_forwardApply STATA commands & submit the output for each question only when indicated below i. Generate the log of birthweight and family income of children. Name these new variables Ibwght & Ifaminc. Include the output of this code. ii. Apply the command sum with the detail option to the variable faminc. Note: you should find the 25th percentile value, the 50th percentile and the 75th percentile value of faminc from the output - you will need it to answer the next question Include the output of this code. iii. iv. Use the output from part ii of this question to Generate a variable called "high_faminc" that takes a value 1 if faminc is less than or equal to the 25th percentile, it takes the value 2 if faminc is greater than 25th percentile but less than or equal to the 50th percentile, it takes the value 3 if faminc is greater than 50th percentile but less than or equal to the 75th percentile, it takes the value 4 if faminc is greater than the 75th percentile. Include the outcome of this code…arrow_forward
- solve this on paperarrow_forwardApply STATA commands & submit the output for each question only when indicated below i. Apply the command egen to create a variable called "wyd" which is the rowtotal function on variables bwght & faminc. ii. Apply the list command for the first 10 observations to show that the code in part i worked. Include the outcome of this code iii. Apply the egen command to create a new variable called "bwghtsum" using the sum function on variable bwght by the variable high_faminc (Note: need to apply the bysort' statement) iv. Apply the "by high_faminc" statement to find the V. descriptive statistics of bwght and bwghtsum Include the output of this code. Why is there a difference between the standard deviations of bwght and bwghtsum from part iv of this question?arrow_forwardAccording to a health information website, the distribution of adults’ diastolic blood pressure (in millimeters of mercury, mmHg) can be modeled by a normal distribution with mean 70 mmHg and standard deviation 20 mmHg. b. Above what diastolic pressure would classify someone in the highest 1% of blood pressures? Show all calculations used.arrow_forward
- Write STATA codes which will generate the outcomes in the questions & submit the output for each question only when indicated below i. ii. iii. iv. V. Write a code which will allow STATA to go to your favorite folder to access your files. Load the birthweight1.dta dataset from your favorite folder and save it under a different filename to protect data integrity. Call the new dataset babywt.dta (make sure to use the replace option). Verify that it contains 2,998 observations and 8 variables. Include the output of this code. Are there missing observations for variable(s) for the variables called bwght, faminc, cigs? How would you know? (You may use more than one code to show your answer(s)) Include the output of your code (s). Write the definitions of these variables: bwght, faminc, male, white, motheduc,cigs; which of these variables are categorical? [Hint: use the labels of the variables & the browse command] Who is this dataset about? Who can use this dataset to answer what kind of…arrow_forwardApply STATA commands & submit the output for each question only when indicated below İ. ii. iii. iv. V. Apply the command summarize on variables bwght and faminc. What is the average birthweight of babies and family income of the respondents? Include the output of this code. Apply the tab command on the variable called male. How many of the babies and what share of babies are male? Include the output of this code. Find the summary statistics (i.e. use the sum command) of the variables bwght and faminc if the babies are white. Include the output of this code. Find the summary statistics (i.e. use the sum command) of the variables bwght and faminc if the babies are male but not white. Include the output of this code. Using your answers to previous subparts of this question: What is the difference between the average birthweight of a baby who is male and a baby who is male but not white? What can you say anything about the difference in family income of the babies that are male and male…arrow_forwardA public health researcher is studying the impacts of nudge marketing techniques on shoppers vegetablesarrow_forward
- The director of admissions at Kinzua University in Nova Scotia estimated the distribution of student admissions for the fall semester on the basis of past experience. Admissions Probability 1,100 0.5 1,400 0.4 1,300 0.1 Click here for the Excel Data File Required: What is the expected number of admissions for the fall semester? Compute the variance and the standard deviation of the number of admissions. Note: Round your standard deviation to 2 decimal places.arrow_forwardA pollster randomly selected four of 10 available people. Required: How many different groups of 4 are possible? What is the probability that a person is a member of a group? Note: Round your answer to 3 decimal places.arrow_forwardWind Mountain is an archaeological study area located in southwestern New Mexico. Potsherds are broken pieces of prehistoric Native American clay vessels. One type of painted ceramic vessel is called Mimbres classic black-on-white. At three different sites the number of such sherds was counted in local dwelling excavations. Test given. Site I Site II Site III 63 19 60 43 34 21 23 49 51 48 11 15 16 46 26 20 31 Find .arrow_forward
- Glencoe Algebra 1, Student Edition, 9780079039897...AlgebraISBN:9780079039897Author:CarterPublisher:McGraw HillHolt Mcdougal Larson Pre-algebra: Student Edition...AlgebraISBN:9780547587776Author:HOLT MCDOUGALPublisher:HOLT MCDOUGALMathematics For Machine TechnologyAdvanced MathISBN:9781337798310Author:Peterson, John.Publisher:Cengage Learning,
- Functions and Change: A Modeling Approach to Coll...AlgebraISBN:9781337111348Author:Bruce Crauder, Benny Evans, Alan NoellPublisher:Cengage Learning



