Air Quality
As a researcher for the EPA, you have been asked to determine if the air quality in the United States has changed over the past 2 years. You select a random sample of 10 metropolitan areas and find the number of days each year that the areas failed to meet acceptable air quality standards. The data are shown.
Source: The World Almanac and Book of Facts.
Based on the data, answer the following questions.
1. What is the purpose of the study?
2. Are the samples independent or dependent?
3. What hypotheses would you use?
4. What is (are) the critical value(s) that you would use?
5. What statistical test would you use?
6. How many degrees of freedom are there?
7. What is your conclusion?
8. Could an independent means test have been used?
9. Do you think this was a good way to answer the original question?
1.
To find: The purpose of the study.
Explanation of Solution
The purpose of the givens study is “to determine if the air quality in the United States has changed over the past 2 years”.
2.
To classify: The samples as independent or dependent.
Answer to Problem 1AC
The samples are dependent.
Explanation of Solution
Independent samples:
If the sample values from one population do not associate with the sample values from other population, then the two samples are said to be independent samples.
Dependent samples:
If the sample values from one population associated or matched with the sample values from other population, then the two samples are said to be dependent samples.
Matched pair design occurred at two situations, which are listed below:
- Subjects are matched with pairs and each treatment is given to one subject in each pair.
- Before and after observations on the same subjects.
Here, the samples are dependent because same metropolitan areas are taken and the samples that are related. Thus, it can be concluded that the samples are dependent.
3.
To find: The hypotheses of the study.
Answer to Problem 1AC
Hypotheses:
Null hypothesis:
Alternative hypothesis:
Explanation of Solution
Null hypothesis:
Null hypothesis is a statement about population parameter, its value is equal to the claim value, which is denoted by
Alternative hypothesis:
It is complementary to the null hypothesis. That is, it differs from the null hypothesis. The possible symbols used in the alternative hypothesis would be <,>, or ≠. It is denoted by
State the null and alternative hypotheses:
Null hypothesis:
Alternative hypothesis:
4.
To find: The critical value.
Answer to Problem 1AC
The critical value is ±2.262.
Explanation of Solution
Calculation:
Degrees of freedom:
Software Procedure:
Step-by-step procedure to obtain the critical value using the MINITAB software:
- Choose Graph > Probability Distribution Plot choose View Probability> OK.
- From Distribution, choose ‘t’ distribution.
- In Degrees of freedom, enter 9.
- Click the Shaded Area tab.
- Choose Probability value and Both Tail for the region of the curve to shade.
- Enter the Probability value as 0.05.
- Click OK.
Output using the MINITAB software is given below:
From the output, the critical value is ±2.262.
5.
To find: The statistical test.
Answer to Problem 1AC
The t test for dependent samples can be used.
Explanation of Solution
Here, the samples are dependent because same metropolitan areas are taken and the samples that are related.
Thus, the t test for dependent samples can be used.
6.
To find: The degrees of freedom.
Answer to Problem 1AC
The degrees of freedom is 9.
Explanation of Solution
Calculation:
Degrees of freedom:
Thus, the degrees of freedom is 9.
7.
To describe: The conclusion.
Answer to Problem 1AC
The conclusion is that there is no enough evidence to support the claim that the air quality in the United States has changed over the past 2 years.
Explanation of Solution
Calculation:
Software Procedure:
Step-by-step procedure to obtain the test value using the MINITAB software:
- Choose Stat > Basic Statistics > 1-Sample t.
- In Samples in Column, enter the column of Difference.
- In Perform hypothesis test, enter the test mean as 0.
- Check Options; enter Confidence level as 95%.
- Choose not equal in alternative.
- Click OK.
Output using the MINITAB software is given below:
From the output, the test value is –1.88.
Decision:
Decision rule:
If
If
Here, the value of test statistic is greater than the critical value.
That is,
Therefore, the null hypothesis is not rejected,
Thus, the decision is “fail to reject the null hypothesis”.
Hence, there is no enough evidence to support the claim that the air quality in the United States has changed over the past 2 years.
8.
To check: Whether the independent test can be used.
Answer to Problem 1AC
The independent means test cannot be used.
Explanation of Solution
Here, each metropolitan area had two readings. That is, the samples are related. Thus, the independent means test cannot be used.
9.
To describe: The result.
Explanation of Solution
Answer will vary. One of the possible answers is given below:
The answer is that there are other measures of air quality in the U.S that could have examined to answer the original question.
Want to see more full solutions like this?
Chapter 9 Solutions
ALEKS 360 ELEM STATISTICS
- The mean, variance, skewness and kurtosis of a dataset are given as - Mean = 15, Variance = 20, SKewness = 1.5 and Kurtosis = 3.5 calculate the first four raw moments. (Note- Please include as much detailed solution/steps in the solution to understand, Thank you!)arrow_forwardWrite codes to perform the functions in each of these cases i. ii. Apply cd command to tell STATA the filepath associated with your "favorite folder" (use the same name for the favorite folder that we have been using in class) Apply log using command to tell stata that you are creating a log file to record the codes and the outcomes of these codes. Make sure your log file is called loghwa1_W25.smcl. Do not forget to include the replace option. iii. Get help for the "regress" command & include a screenshot of the outcome of this code iv. V. Open a stata file stored in STATA memory called pop2000.dta Continue from question iv. Save this file in your favorite folder (current working directory) using a different name & a replace optionarrow_forwardAre there any unusually high or low pH levels in this sample of wells?arrow_forward
- 0 n AM RIES s of of 10 m Frequency 40 Frequency 20 20 30 10 You make two histograms from two different data sets (see the following figures), each one containing 200 observations. Which of the histograms has a smaller spread: the first or the second? 40 30 20 10 0 20 40 60 0 20 20 40 60 60 80 80 100 80 100arrow_forwardTIP the aren't, the data are not sym 11 Suppose that the average salary at a certain company is $100,000, and the median salary is $40,000. a. What do these figures tell you about the shape of the histogram of salaries at this company? b. Which measure of center is more appro- priate here? c. Suppose that the company goes through a salary negotiation. How can people on each side use these summary statistics to their advantage? 6360 be 52 PART 1 Getting Off to a Statistically Significant Sarrow_forward12 Suppose that you know that a data set is skewed left, and you know that the two measures of center are 19 and 38. Which figure is the mean and which is the median?arrow_forward
- y of 45 home- televisions u find that 010020 le own one, ee, and 1 owns y histogram of 4 Suppose that you have a loaded die. You roll it several times and record the outcomes, which are shown in the following figure. Histogram for Loaded Die 444% 34.00 48% 6% 2% Frequency 20 20 15 155 10 5- ம 0 1 2 3 4 Outcome 5 6 a. Make a relative frequency histogram of these results. b. You can make a relative frequency histo- gram from a frequency histogram; can you go the other direction?arrow_forwardCalculate the mean for Study Hours and Test Scores. Compute the covariance between the two variables using the formula: Calculate the standard deviation for Study Hours (X) and Test Scores (Y). Determine the correlation coefficient Interpret the results: What does the calculated r-value indicate about the relationship between study hours and test scores?arrow_forwardFor unemployed persons in the United States, the average number of months of unemployment at the end of December 2009 was approximately seven months (Bureau of Labor Statistics, January 2010). Suppose the following data are for a particular region in upstate New York. The values in the first column show the number of months unemployed and the values in the second column show the corresponding number of unemployed persons. Months Unemployed Number Unemployed 1 1029 2 1686 3 2269 4 2675 5 3487 6 4652 7 4145 8 3587 9 2325 10 1120 Let x be a random variable indicating the number of months a person is unemployed. a. Use the data to develop an empirical discrete probability distribution for x (to 4 decimals). (x) f(x) 1 2 3 4 5 6 7 8 9 10 b. Show that your probability distribution satisfies the conditions for a valid discrete probability distribution. The input in the box below will not be graded, but may be reviewed and considered by your instructor. blank c. What is the probability that a…arrow_forward
- West Virginia has one of the highest divorce rates in the nation, with an annual rate of approximately 5 divorces per 1000 people (Centers for Disease Control and Prevention website, January 12, 2012). The Marital Counseling Center, Inc. (MCC) thinks that the high divorce rate in the state may require them to hire additional staff. Working with a consultant, the management of MCC has developed the following probability distribution for x = the number of new clients for marriage counseling for the next year. Excel File: data05-19.xls x 10 f(x) .05 20 30 .10 .10 40 .20 50 60 .35 .20 a. Is this probability distribution valid? - Select your answer- Explain. f(x) Σf(x) Select your answer Select your answer b. What is the probability MCC will obtain more than 30 new clients (to 2 decimals)? c. What is the probability MCC will obtain fewer than 20 new clients (to 2 decimals)? d. Compute the expected value and variance of x. Expected value Variance clients per year squared clients per yeararrow_forwardFor unemployed persons in the United States, the average number of months of unemployment at the end of December 2009 was approximately seven months (Bureau of Labor Statistics, January 2010). Suppose the following data are for a particular region in upstate New York. The values in the first column show the number of months unemployed and the values in the second column show the corresponding number of unemployed persons. Months Unemployed Number Unemployed 1 1029 2 1686 3 2269 4 2675 5 3487 6 4652 7 4145 8 3587 9 2325 10 1120 Let x be a random variable indicating the number of months a person is unemployed. a. Use the data to develop an empirical discrete probability distribution for x (to 4 decimals). (x) f(x) 1 2 3 4 5 6 7 8 9 10 b. Show that your probability distribution satisfies the conditions for a valid discrete probability distribution. The input in the box below will not be graded, but may be reviewed and considered by your instructor. c. What is the probability that a person…arrow_forwardIn Gallup's Annual Consumption Habits Poll, telephone interviews were conducted for a random sample of 1014 adults aged 18 and over. One of the questions was "How many cups of coffee, if any, do you drink on an average day?" The following table shows the results obtained (Gallup website, August 6, 2012). Excel File: data05-23.xls Number of Cups per Day Number of Responses 0 365 264 193 3 4 or more 91 101 Define a random variable x = number of cups of coffee consumed on an average day. Let x = 4 represent four or more cups. Round your answers to four decimal places. a. Develop a probability distribution for x. x 0 1 2 3 4 f(x) b. Compute the expected value of x. cups of coffee c. Compute the variance of x. cups of coffee squared d. Suppose we are only interested in adults that drink at least one cup of coffee on an average day. For this group, let y = the number of cups of coffee consumed on an average day. Compute the expected value of y. Compare it to the expected value of x. The…arrow_forward
- Holt Mcdougal Larson Pre-algebra: Student Edition...AlgebraISBN:9780547587776Author:HOLT MCDOUGALPublisher:HOLT MCDOUGALGlencoe Algebra 1, Student Edition, 9780079039897...AlgebraISBN:9780079039897Author:CarterPublisher:McGraw HillBig Ideas Math A Bridge To Success Algebra 1: Stu...AlgebraISBN:9781680331141Author:HOUGHTON MIFFLIN HARCOURTPublisher:Houghton Mifflin Harcourt