Concept explainers
Household Incomes. The following data represent a sample of 14 household incomes ($1000s). Answer the following questions based on this sample.
- a. What is the
median household income for these sample data? - b. According to a previous survey, the median annual household income five years ago was $55,000. Based on the sample data above, estimate the percentage change in the median household income from five years ago to today.
- c. Compute the first and third
quartiles . - d. Provide a five-number summary.
- e. Using the z-score approach, do the data contain any outliers? Does the approach that uses the values of the first and third quartiles and the
interquartile range to detect outliers provide the same results?
a.

Find the median household income for the given sample.
Answer to Problem 68SE
The median household income for the given sample is $52,100.
Explanation of Solution
Calculation:
The data represent the incomes (in $1,000s) for a sample of 14 households. Five years ago the median annual household income was $55,500.
Software procedure:
Step by step procedure to obtain the descriptive statistics using EXCEL is as follows:
- In an EXCEL sheet enter the values of the sample and label it as Sample.
- Go to Data > Data Analysis (in case it is not default, take the Analysis ToolPak from Excel Add Ins) > Descriptive statistics.
- Enter Input Range as $A$2:$A$15, select Columns in Grouped By, tick on Summary statistics.
- Click on OK.
Output using EXCEL is given as follows:
From the EXCEL output, the median is 52.1.
Thus, the median household income for the given sample is $52,100.
b.

Find the percentage change in the median household income from five years ago to today.
Answer to Problem 68SE
The percentage change in the median household income from five years ago to today is decreased by 6.1%.
Explanation of Solution
Calculation:
Five years ago, the median annual household income was $55,500 (55.5).
The median household income for the given sample is $52,100 (52.1).
The percentage change in the median household income from five years ago to today can be obtained as given below:
Thus, the percentage change in the median household income from five years ago to today is decreased by 6.1%.
c.

Find the first and third quartiles.
Answer to Problem 68SE
The first and third quartiles are 50.75, and 52.6, respectively.
Explanation of Solution
Calculation:
First quartile:
The EXCEL function to compute first quartile is
Software Procedure:
Step by step procedure to obtain the first quartile using EXCEL is as follows:
- Open an EXCEL file.
- Enter the data in the column J in cells J1 to J14.
- In a cell A1, enter the formula QUARTILE.EXC (J1:J14,1).
- Click on OK.
Output using EXCEL is given as follows:
From the EXCEL output, the first quartile of the sample data is 50.75.
Third quartile:
The EXCEL function to compute third quartile is
Software Procedure:
Step by step procedure to obtain the third quartile using EXCEL is as follows:
- Open an EXCEL file.
- Enter the data in the column J in cells J1 to J14.
- In a cell A1, enter the formula QUARTILE.EXC (J1:J14,3).
- Click on OK.
Output using EXCEL is given as follows:
From the EXCEL output, the third quartile of the sample data is 52.6.
Thus, the first and third quartiles are 50.75, and 52.6, respectively.
d.

Find the five number summary for the data.
Answer to Problem 68SE
The five-number summary of the data is 46.5, 50.75, 52.1, 52.6 and 64.5.
Explanation of Solution
Calculation:
The five number summary consists the values of minimum, first quartile, second quartile, third quartile, maximum.
From the EXCEL output obtained in Part (a), the maximum, median, and minimum values are 64.5, and 46.5, respectively.
From Part (c), the first and third quartiles of the dataset are 50.75 and 52.6, respectively.
The quartiles of the data set are
Thus, the five-number summary of the dataset is given below:
- Minimum: 46.5,
- First quartile: 50.75,
- Median: 52.1,
- Third quartile: 52.6,
- Maximum: 64.5.
e.

Check for the outliers in the dataset by using the z-score approach.
Check for the outliers in the dataset by using quartiles and interquartile range.
Check whether or not result obtained using z-score approach matches with the result obtained using quartiles and interquartile range.
Answer to Problem 68SE
The outlier using z-score approach is 64.5.
The outliers using quartiles and interquartile range are 49.4, and 64.5.
The result obtained using z-score approach does not matches with the result obtained using quartiles and interquartile range.
Explanation of Solution
Calculation:
From the EXCEL output obtained in Part (a), the mean and standard deviation of the dataset are 52.2 and 4, respectively.
The formula for z-score is given below:
Where,
In a z-score approach, the data points with the z-score above +3 and the data points with the z-score below –3 are considered as outliers.
The z-score corresponding to the data point 49.4 can be obtained as follows:
Substitute
Thus, the z-score corresponding to 49.4 is –0.70.
Similarly, the z-score corresponding to other data points can be obtained as follows:
Data points | z-score |
46.5 | –1.42 |
48.7 | –0.87 |
49.4 | –0.70 |
51.2 | –0.25 |
51.3 | –0.22 |
51.6 | –0.15 |
52.1 | –0.02 |
52.1 | –0.02 |
52.2 | 0.00 |
52.4 | 0.05 |
52.5 | 0.07 |
52.9 | 0.17 |
53.4 | 0.30 |
64.5 | 3.07 |
From the table, it can be seen that the z-score corresponding 64.5 is greater than 3 standard deviations. Thus, it can be considered as outliers an outlier.
The IQR can be obtained as follows:
Substitute
Thus, the interquartile range is 1.85.
The formula for lower limit is obtained as follows:
Here,
Substitute
Thus, the lower limit is 49.825.
The formula for upper limit is obtained as follows:
Substitute
Thus, the upper limit is 55.375.
Outliers:
The outlier is the observational point that is distant from the remaining observational points. In other words outlier is an observation that lies in an abnormal distance from the remaining values.
In the present scenario, the data points that are less than lower limit (49.825) and the data points that are greater than upper limit (55.375) are considered as outliers.
The data point (49.4) is less than 49.825. Thus, it can be considered as an outlier.
The last observation (64.5) is greater than 55.375. Thus, it can also be considered as an outlier.
Hence, the dataset consists of two outliers, 49.4 and 64.5.
Using the z-score approach, there exists only one outlier (64.5). In the second approach there exist two outliers, 49.4 and 64.5.
Thus, the result obtained using z-score approach does not match with the result obtained using quartiles and interquartile range.
Want to see more full solutions like this?
Chapter 3 Solutions
Essentials Of Statistics For Business & Economics
- You find out that the dietary scale you use each day is off by a factor of 2 ounces (over — at least that’s what you say!). The margin of error for your scale was plus or minus 0.5 ounces before you found this out. What’s the margin of error now?arrow_forwardSuppose that Sue and Bill each make a confidence interval out of the same data set, but Sue wants a confidence level of 80 percent compared to Bill’s 90 percent. How do their margins of error compare?arrow_forwardSuppose that you conduct a study twice, and the second time you use four times as many people as you did the first time. How does the change affect your margin of error? (Assume the other components remain constant.)arrow_forward
- Out of a sample of 200 babysitters, 70 percent are girls, and 30 percent are guys. What’s the margin of error for the percentage of female babysitters? Assume 95 percent confidence.What’s the margin of error for the percentage of male babysitters? Assume 95 percent confidence.arrow_forwardYou sample 100 fish in Pond A at the fish hatchery and find that they average 5.5 inches with a standard deviation of 1 inch. Your sample of 100 fish from Pond B has the same mean, but the standard deviation is 2 inches. How do the margins of error compare? (Assume the confidence levels are the same.)arrow_forwardA survey of 1,000 dental patients produces 450 people who floss their teeth adequately. What’s the margin of error for this result? Assume 90 percent confidence.arrow_forward
- The annual aggregate claim amount of an insurer follows a compound Poisson distribution with parameter 1,000. Individual claim amounts follow a Gamma distribution with shape parameter a = 750 and rate parameter λ = 0.25. 1. Generate 20,000 simulated aggregate claim values for the insurer, using a random number generator seed of 955.Display the first five simulated claim values in your answer script using the R function head(). 2. Plot the empirical density function of the simulated aggregate claim values from Question 1, setting the x-axis range from 2,600,000 to 3,300,000 and the y-axis range from 0 to 0.0000045. 3. Suggest a suitable distribution, including its parameters, that approximates the simulated aggregate claim values from Question 1. 4. Generate 20,000 values from your suggested distribution in Question 3 using a random number generator seed of 955. Use the R function head() to display the first five generated values in your answer script. 5. Plot the empirical density…arrow_forwardFind binomial probability if: x = 8, n = 10, p = 0.7 x= 3, n=5, p = 0.3 x = 4, n=7, p = 0.6 Quality Control: A factory produces light bulbs with a 2% defect rate. If a random sample of 20 bulbs is tested, what is the probability that exactly 2 bulbs are defective? (hint: p=2% or 0.02; x =2, n=20; use the same logic for the following problems) Marketing Campaign: A marketing company sends out 1,000 promotional emails. The probability of any email being opened is 0.15. What is the probability that exactly 150 emails will be opened? (hint: total emails or n=1000, x =150) Customer Satisfaction: A survey shows that 70% of customers are satisfied with a new product. Out of 10 randomly selected customers, what is the probability that at least 8 are satisfied? (hint: One of the keyword in this question is “at least 8”, it is not “exactly 8”, the correct formula for this should be = 1- (binom.dist(7, 10, 0.7, TRUE)). The part in the princess will give you the probability of seven and less than…arrow_forwardplease answer these questionsarrow_forward
- Selon une économiste d’une société financière, les dépenses moyennes pour « meubles et appareils de maison » ont été moins importantes pour les ménages de la région de Montréal, que celles de la région de Québec. Un échantillon aléatoire de 14 ménages pour la région de Montréal et de 16 ménages pour la région Québec est tiré et donne les données suivantes, en ce qui a trait aux dépenses pour ce secteur d’activité économique. On suppose que les données de chaque population sont distribuées selon une loi normale. Nous sommes intéressé à connaitre si les variances des populations sont égales.a) Faites le test d’hypothèse sur deux variances approprié au seuil de signification de 1 %. Inclure les informations suivantes : i. Hypothèse / Identification des populationsii. Valeur(s) critique(s) de Fiii. Règle de décisioniv. Valeur du rapport Fv. Décision et conclusion b) A partir des résultats obtenus en a), est-ce que l’hypothèse d’égalité des variances pour cette…arrow_forwardAccording to an economist from a financial company, the average expenditures on "furniture and household appliances" have been lower for households in the Montreal area than those in the Quebec region. A random sample of 14 households from the Montreal region and 16 households from the Quebec region was taken, providing the following data regarding expenditures in this economic sector. It is assumed that the data from each population are distributed normally. We are interested in knowing if the variances of the populations are equal. a) Perform the appropriate hypothesis test on two variances at a significance level of 1%. Include the following information: i. Hypothesis / Identification of populations ii. Critical F-value(s) iii. Decision rule iv. F-ratio value v. Decision and conclusion b) Based on the results obtained in a), is the hypothesis of equal variances for this socio-economic characteristic measured in these two populations upheld? c) Based on the results obtained in a),…arrow_forwardA major company in the Montreal area, offering a range of engineering services from project preparation to construction execution, and industrial project management, wants to ensure that the individuals who are responsible for project cost estimation and bid preparation demonstrate a certain uniformity in their estimates. The head of civil engineering and municipal services decided to structure an experimental plan to detect if there could be significant differences in project evaluation. Seven projects were selected, each of which had to be evaluated by each of the two estimators, with the order of the projects submitted being random. The obtained estimates are presented in the table below. a) Complete the table above by calculating: i. The differences (A-B) ii. The sum of the differences iii. The mean of the differences iv. The standard deviation of the differences b) What is the value of the t-statistic? c) What is the critical t-value for this test at a significance level of 1%?…arrow_forward
- Big Ideas Math A Bridge To Success Algebra 1: Stu...AlgebraISBN:9781680331141Author:HOUGHTON MIFFLIN HARCOURTPublisher:Houghton Mifflin HarcourtGlencoe Algebra 1, Student Edition, 9780079039897...AlgebraISBN:9780079039897Author:CarterPublisher:McGraw HillHolt Mcdougal Larson Pre-algebra: Student Edition...AlgebraISBN:9780547587776Author:HOLT MCDOUGALPublisher:HOLT MCDOUGAL


