
Concept explainers
(a)
To find: The Inter
(a)

Answer to Problem 67E
Solution: The Inter
Explanation of Solution
Calculation: The Inter Quartile Range
Step 1: Enter the provided data in a Minitab worksheet.
Step 2: Go to Stat and select basic statistics.
Step 3: Then select Display
Step 4: Next click on Statistics tab and tick mark the option against interquartile range and click on OK twice to get the results.
From the Minitab output the Inter Quartile Range
Interpretation: The Inter Quartile Range
(b)
To find: The outliers using
(b)

Answer to Problem 67E
Solution: The value of outliers in the provided data is 428 of London. Any value above 236.5 or below –35.5 would be considered outliers as they are upper whisker and lower whisker, respectively.
Explanation of Solution
Calculation: To obtain the value of first and third quartile, follow the steps given below in Minitab,
Step 1: Enter the data in a Minitab worksheet.
Step 2: Go to Stat and select basic statistics.
Step 3: Then select Display Descriptive Statistics. Enter the name of the column containing the provided data in the variables textbox.
Step 4: Then click on Statistics tab and tick mark the option against first quartile and third quartile and click on OK twice.
The value of
The formula for upper whisker is,
where
The formula for lower whisker is,
where
Upper whisker of boxplot is found to be 236.5, so any values above 236.5 would be considered outliers and from data provided, value of 428 of London is outlier. There is no value below lower whisker.
Interpretation: Outliers refers to those data points that lie either above upper whisker and below lower whisker in boxplot. There was one outlier found in this data, and it is 428 of London.
(c)
To graph: A boxplot of the provided data and describe the distribution using it.
(c)

Explanation of Solution
Graph: Plot the boxplot in Minitab by performing the following steps,
Step 1: Enter the data into a Minitab worksheet.
Step 2: Go to ‘Graph’ and click on ‘Boxplot’.
Step 3: In the dialogue box that appears select ‘Simple’ and click OK.
Step 4: Next enter the name of the column containing the data in the filed marked as ‘Graph variables’ and click on OK.
The boxplot is obtained as shown below,
Interpretation: The boxplot is generally preferred to describe dataset having unsymmetrical distribution. The boxplot shows First quartile, Median, and Third quartile. The boxplot of the data is shown above and clearly displays an outlier. The boxplot marks the outlier as a part of the whisker.
(d)
To graph: A modified boxplot and describes the distribution using it.
(d)

Explanation of Solution
Graph: Plot the modified boxplot in Minitab by performing the following steps,
Step 1: Enter the data into a Minitab worksheet.
Step 2: Go to ‘Graph’ and click on ‘Boxplot’.
Step 3: In the dialogue box that appears select ‘Simple’ and click OK.
Step 4: Next enter the name of the column containing the data in the filed marked as ‘Graph variables’ and click on OK.
The boxplot is obtained as shown below,
Interpretation: The modified boxplot is used to display data graphically when the distribution of data is unsymmetrical and skewed as it can clearly show outliers. In the modified boxplot it was found there is one data value which is upper outlier. This outlier is London. The modified boxplot does not display the outlier as a part of the whisker but marks the outlier away.
(e)
To graph: A stemplot of the provided data.
(e)

Explanation of Solution
Graph: Follow the steps given below to obtain the stemplot:
Step 1: Enter the data of sales in a Minitab worksheet.
Step 2: Go to Graph and select stem and leaves.
Step 3: Enter the name of the column containing the data in the Graph variables textbox and click OK.
The required stemplot is attached below,
Interpretation: The stemplot of data is generally drawn when size of data is small and all the data values are positive. It shows all the data values on stemplot. In the stemplot shown above there is one outliers which is London whose data values is 428. Also the data does not seem to be symmetrically distributed.
(f)
To find: The comparison of Boxplot, Modified boxplot, and stemplot and mention advantages and disadvantages of each.
(f)

Answer to Problem 67E
Solution: In boxplot, data is displayed based on five-number summary, which included Minimum, Maximum, first quartile, third quartile, and Median and displaying the outlier as part of the whisker. In Modified boxplot also data is displayed based on five-number summary, but it displays the outliers such that they are not connected to the whiskers. In stemplot, data values are arranged in stem consisting of all digits except right most and leaves contain final digit. Advantage of boxplots is that it is suitable for unsymmetrical data while advantage of stemplot is that it shows all numerical value of data on graph itself. Disadvantage of Boxplot is that it is not suitable for unsymmetrical data while disadvantage of stemplot is that it is used only for positive numbers only and if the data size is small.
Explanation of Solution
The comparison of Boxplot, Modified boxplot, and stemplot is shown below:
Boxplot |
Modified Boxplot |
Stemplot |
|
Description |
It displays data based on |
It displays data based on five number summary including Minimum, Maximum, First quartile and Third Quartile and Median. The outliers are not displayed as part of the whiskers. |
In stemplot data values are arranged in stem consisting of all digits except right most digit and leaves contain final digit |
Advantages |
1. It displays five number summary graphically. 2. It is suitable for unsymmetrical data. |
1. It displays five number summary. 2. It is suitable for unsymmetrical data which is skewed. 3. It shows outliers clearly. |
1. It can display both symmetrical and unsymmetrical data graphically. 2. It can indicate outliers also 3. It displays all numerical values of data on stemplot. |
Disadvantages |
1. It is not suitable data set having symmetrical distribution. 2. It does not display outliers on graph. |
1. It is not suitable data set having symmetrical distribution. |
1. It is not suitable if data size is very large. 2. It is not used fornegative numbers. |
Want to see more full solutions like this?
Chapter 1 Solutions
EBK INTRODUCTION TO THE PRACTICE OF STA
- You find out that the dietary scale you use each day is off by a factor of 2 ounces (over — at least that’s what you say!). The margin of error for your scale was plus or minus 0.5 ounces before you found this out. What’s the margin of error now?arrow_forwardSuppose that Sue and Bill each make a confidence interval out of the same data set, but Sue wants a confidence level of 80 percent compared to Bill’s 90 percent. How do their margins of error compare?arrow_forwardSuppose that you conduct a study twice, and the second time you use four times as many people as you did the first time. How does the change affect your margin of error? (Assume the other components remain constant.)arrow_forward
- Out of a sample of 200 babysitters, 70 percent are girls, and 30 percent are guys. What’s the margin of error for the percentage of female babysitters? Assume 95 percent confidence.What’s the margin of error for the percentage of male babysitters? Assume 95 percent confidence.arrow_forwardYou sample 100 fish in Pond A at the fish hatchery and find that they average 5.5 inches with a standard deviation of 1 inch. Your sample of 100 fish from Pond B has the same mean, but the standard deviation is 2 inches. How do the margins of error compare? (Assume the confidence levels are the same.)arrow_forwardA survey of 1,000 dental patients produces 450 people who floss their teeth adequately. What’s the margin of error for this result? Assume 90 percent confidence.arrow_forward
- The annual aggregate claim amount of an insurer follows a compound Poisson distribution with parameter 1,000. Individual claim amounts follow a Gamma distribution with shape parameter a = 750 and rate parameter λ = 0.25. 1. Generate 20,000 simulated aggregate claim values for the insurer, using a random number generator seed of 955.Display the first five simulated claim values in your answer script using the R function head(). 2. Plot the empirical density function of the simulated aggregate claim values from Question 1, setting the x-axis range from 2,600,000 to 3,300,000 and the y-axis range from 0 to 0.0000045. 3. Suggest a suitable distribution, including its parameters, that approximates the simulated aggregate claim values from Question 1. 4. Generate 20,000 values from your suggested distribution in Question 3 using a random number generator seed of 955. Use the R function head() to display the first five generated values in your answer script. 5. Plot the empirical density…arrow_forwardFind binomial probability if: x = 8, n = 10, p = 0.7 x= 3, n=5, p = 0.3 x = 4, n=7, p = 0.6 Quality Control: A factory produces light bulbs with a 2% defect rate. If a random sample of 20 bulbs is tested, what is the probability that exactly 2 bulbs are defective? (hint: p=2% or 0.02; x =2, n=20; use the same logic for the following problems) Marketing Campaign: A marketing company sends out 1,000 promotional emails. The probability of any email being opened is 0.15. What is the probability that exactly 150 emails will be opened? (hint: total emails or n=1000, x =150) Customer Satisfaction: A survey shows that 70% of customers are satisfied with a new product. Out of 10 randomly selected customers, what is the probability that at least 8 are satisfied? (hint: One of the keyword in this question is “at least 8”, it is not “exactly 8”, the correct formula for this should be = 1- (binom.dist(7, 10, 0.7, TRUE)). The part in the princess will give you the probability of seven and less than…arrow_forwardplease answer these questionsarrow_forward
- Selon une économiste d’une société financière, les dépenses moyennes pour « meubles et appareils de maison » ont été moins importantes pour les ménages de la région de Montréal, que celles de la région de Québec. Un échantillon aléatoire de 14 ménages pour la région de Montréal et de 16 ménages pour la région Québec est tiré et donne les données suivantes, en ce qui a trait aux dépenses pour ce secteur d’activité économique. On suppose que les données de chaque population sont distribuées selon une loi normale. Nous sommes intéressé à connaitre si les variances des populations sont égales.a) Faites le test d’hypothèse sur deux variances approprié au seuil de signification de 1 %. Inclure les informations suivantes : i. Hypothèse / Identification des populationsii. Valeur(s) critique(s) de Fiii. Règle de décisioniv. Valeur du rapport Fv. Décision et conclusion b) A partir des résultats obtenus en a), est-ce que l’hypothèse d’égalité des variances pour cette…arrow_forwardAccording to an economist from a financial company, the average expenditures on "furniture and household appliances" have been lower for households in the Montreal area than those in the Quebec region. A random sample of 14 households from the Montreal region and 16 households from the Quebec region was taken, providing the following data regarding expenditures in this economic sector. It is assumed that the data from each population are distributed normally. We are interested in knowing if the variances of the populations are equal. a) Perform the appropriate hypothesis test on two variances at a significance level of 1%. Include the following information: i. Hypothesis / Identification of populations ii. Critical F-value(s) iii. Decision rule iv. F-ratio value v. Decision and conclusion b) Based on the results obtained in a), is the hypothesis of equal variances for this socio-economic characteristic measured in these two populations upheld? c) Based on the results obtained in a),…arrow_forwardA major company in the Montreal area, offering a range of engineering services from project preparation to construction execution, and industrial project management, wants to ensure that the individuals who are responsible for project cost estimation and bid preparation demonstrate a certain uniformity in their estimates. The head of civil engineering and municipal services decided to structure an experimental plan to detect if there could be significant differences in project evaluation. Seven projects were selected, each of which had to be evaluated by each of the two estimators, with the order of the projects submitted being random. The obtained estimates are presented in the table below. a) Complete the table above by calculating: i. The differences (A-B) ii. The sum of the differences iii. The mean of the differences iv. The standard deviation of the differences b) What is the value of the t-statistic? c) What is the critical t-value for this test at a significance level of 1%?…arrow_forward
- MATLAB: An Introduction with ApplicationsStatisticsISBN:9781119256830Author:Amos GilatPublisher:John Wiley & Sons IncProbability and Statistics for Engineering and th...StatisticsISBN:9781305251809Author:Jay L. DevorePublisher:Cengage LearningStatistics for The Behavioral Sciences (MindTap C...StatisticsISBN:9781305504912Author:Frederick J Gravetter, Larry B. WallnauPublisher:Cengage Learning
- Elementary Statistics: Picturing the World (7th E...StatisticsISBN:9780134683416Author:Ron Larson, Betsy FarberPublisher:PEARSONThe Basic Practice of StatisticsStatisticsISBN:9781319042578Author:David S. Moore, William I. Notz, Michael A. FlignerPublisher:W. H. FreemanIntroduction to the Practice of StatisticsStatisticsISBN:9781319013387Author:David S. Moore, George P. McCabe, Bruce A. CraigPublisher:W. H. Freeman





