Instructions: In all exercises, include software results (e.g., from Excel, MegaStat, or Minitab) to support your calculations. State the hypotheses, show how the degrees of freedom are calculated, find the critical value of chi-square from Appendix E or from Excel’s
Pick one Excel data set (A through F) and investigate whether the data could have come from a normal population using α = .01. Use any test you wish, including a histogram, or MegaStat’s
Source: www.wikipedia.org.
Source: Independent project by statistics student Frances Williams. Weighed on an American Scientific Model S/P 120 analytical balance, accurate to 0.0001 gram.
Source: www.wikipedia.org
State the null and alternative hypothesis.
Find the degrees of freedom.
Find the critical value of chi-square from Appendix E or from Excel’s function.
Calculate the chi-square test statistics at 0.01 level of significance.
Interpret the p-value.
Check whether the conclusion is sensitive to the level of significance chosen, identify the cells that contribute to the chi-square test statistic and check for the small expected frequencies.
Draw histogram.
Test to obtain a probability plot with the Anderson-Darling statistic.
Interpret p-values.
Answer to Problem 40CE
The null hypothesis is:
And the alternative hypothesis is:
The degrees of freedom is 1.
The critical-value using EXCEL is 2.705543.
The chi-square test statistics at 0.1 level of significance is 0.0486.
The p-value for the hypothesis test is 0.825518.
There is enough evidence to conclude that the Circulated nickels come from a Normal population.
The conclusion is not sensitive to the level of significance chosen.
The chi-square test statistic has highest chi-square value for zero appointments.
There is no expected frequencies that are too small.
The histogram is:
The probability plot is:
The Anderson-Darling statistics is 0.881.
The p-value is 0.021.
There is enough evidence to conclude that the Circulated nickels come from a Normal population.
Explanation of Solution
Calculation:
Results will vary.
There are 6 data set and pick any of them. The given information is the 31 randomly chosen circulated nickels.
The claim is to test whether the data provide sufficient evidence to conclude that the Circulated nickels come from a Normal population. If the claim is rejected, then the Circulated nickels do not come from a Normal population.
The test hypotheses are given below:
Null hypothesis:
Alternative hypothesis:
Software procedure:
- Step by step procedure to obtain the mean and standard deviations using the MINITAB software.
- Choose Stat > Basic Statistics>Display Descriptive Statistics.
- Under Variables, choose 'Weight'
- Choose Statistics, Select mean and Standard deviation.
- Click OK.
Output obtained from MINITAB software for the data is:
Thus, the mean and the standard deviation of the data is 4.9719 and 0.0662 respectively.
For a normal distribution the first and last class must be open-ended. The upper limit of bin j can be is obtained by the Excel’s function.
Procedure for upper limit using EXCEL:
Step-by-step software procedure to obtain upper limit for first class using EXCEL software is as follows:
- Open an EXCEL file.
- In cell A1, enter the formula “=NORM.INV(1/6,75.38,8.94)”
- Output using EXCEL software is given below:
Thus, the upper limit for first class using EXCEL is 4.908.
Similarly the remaining limits can be obtained as shown below:
Weights |
Under 4.908 |
4.908–4.943 |
4.943–4.972 |
4.972–5.000 |
5.000–5.036 |
5.036 or more |
The expected frequency can be obtained by the following formula:
Where c is the number of bins and n is the sample size.
Substitute
Then the expected frequency for each bin can be obtained as shown in the table:
Weights | Expected Frequency |
Under 4.908 | 5.17 |
4.908–4.943 | 5.17 |
4.943–4.972 | 5.17 |
4.972–5.000 | 5.17 |
5.000–5.036 | 5.17 |
5.036 or more | 5.17 |
Total | 31 |
Frequency:
The frequencies are calculated by using the tally mark and the range of the data is from 4.796 to 5.045.
- Based on the given information, the class intervals are under 4.908, 4.908–4.943, and so on, 5.036 or more.
- Make a tally mark for each value in the corresponding class and continue for all values in the data.
- The number of tally marks in each class represents the frequency, f of that class.
Similarly, the frequency of remaining classes for the emission is given below:
Weights | Tally | Observed Frequency |
Under 4.908 | 4 | |
4.908–4.943 | 4 | |
4.943–4.972 | 6 | |
4.972–5.000 | 4 | |
5.000–5.036 | 8 | |
5.036 or more | 5 |
Let
The chi-square test statistics can be obtained by the formula:
Then the chi-square test statistics can be obtained as shown in the table:
Weights | Frequency | Expected Frequency | ||
Under 4.908 | 4 | 5.17 | –1.17 | 0.265 |
4.908–4.943 | 4 | 5.17 | –1.17 | 0.265 |
4.943–4.972 | 6 | 5.17 | 0.83 | 0.133 |
4.972–5.000 | 4 | 5.17 | –1.17 | 0.265 |
5.000–5.036 | 8 | 5.17 | 2.83 | 1.55 |
5.036 or more | 5 | 5.17 | –0.17 | 0.005 |
Total | 31 | 31 | 0 | 2.483 |
Therefore, the chi-square test statistic is 2.483.
Degrees of freedom:
The degrees of freedom can be obtained as follows:
Where c is the number of classes and m is the number of parameters estimated. Here only 2 parameters are estimated.
Substitute 6 for c and 2 for m.
Thus, the degrees of freedom for the test is 3.
Procedure for p-value using EXCEL:
Step-by-step software procedure to obtain p-value using EXCEL software is as follows:
- Open an EXCEL file.
- In cell A1, enter the formula “=CHISQ.DIST.RT(2.483,3)”
- Output using EXCEL software is given below:
Thus, the p-value using EXCEL is 0.478.
Rejection rule:
If the p-value is less than or equal to the significance level, then reject the null hypothesis
Conclusion:
Here, the p-value is greater than the 0.01 level of significance.
That is,
Therefore, the null hypothesis is not rejected.
Thus, the data provide sufficient evidence to conclude that the Circulated nickels come from a Normal population.
Take
Here, the p-value is less than the 0.05 level of significance.
That is,
Therefore, the null hypothesis is not rejected.
Thus, the data provide sufficient evidence to conclude that the Circulated nickels come from a Normal population.
Thus, the conclusion is same for both the significance levels.
Hence, the conclusion is not sensitive to the level of significance chosen.
The class 5.000-5.036 contribute most to the chi-square test statistic.
Since all
Frequency Histogram:
Software procedure:
- Step by step procedure to draw the relative frequency histogram for mileage for 2000 using MINITAB software.
- Choose Graph > Histogram.
- Choose Simple.
- Click OK.
- In Graph variables, enter the column of 'Weight'.
- In Scale, Choose Y-scale Type as Frequency.
- Click OK.
- Select Edit Scale, Enter 4.8, 4.88, 4.96, 5.04, 5.12 in Positions of ticks.
- In Labels, Enter 4.8, 4.88, 4.96, 5.04, 5.12 41 in Specified.
- Click OK.
Observation:
It is clear from the histogram that the histogram is not symmetric. The left hand tail is little larger than the right hand tail from the maximum frequency value. Thus, there is little negative skewness on the histogram.
Software procedure:
Step by step procedure to obtain the probability plot using the MINITAB software:
- Choose Stat > Basic Statistics > Normality Test.
- In Variable, enter the column of 'Weights'.
- Under Test for Normality, select the column of Anderson-Darling.
- Click OK.
From the probability plot, it can be observed that most of the observations lies near to the straight line. Therefore, the data is from a normal distribution.
From the MINITAB output the Anderson-Darling statistics is 0.881.
From the MINITAB output the p-value is 0.021.
Conclusion:
Here, the p-value is greater than the 0.01 level of significance.
That is,
Therefore, the null hypothesis is not rejected.
Thus, the data provide sufficient evidence to conclude that the Circulated nickels come from a Normal population.
Want to see more full solutions like this?
Chapter 15 Solutions
APPLIED STAT.IN BUS.+ECONOMICS
- What is the most significant independent variable? Smoker Age Blood Pressurearrow_forwardExplain what is the LM test and demonstrate it.arrow_forwardPap smears are a diagnostic test used to detect cervical cancer. Although the test has high specificity, it also has low sensitivity. As a result, women have to be screened often. Using the definitions of sensitivity and specificity, explain why increased frequency of screening is needed when sensitivity is low and specificity is high.arrow_forward
- Which MS Excel Tool shall be used with this problem? Dr. Hess has done work that suggests that emotional arousal affects pupil size. To ascertain if the type of arousal makes a difference, you decided to measure pupil size under three different arousal conditions (Neutral, Pleasant, Aversive). Each participant will look at all the different pictures that differ according to the condition. The pupil size after viewing each photograph is measured in millimeters. * ANOVA Single Factor ANOVA Two Factor Without Replication ANOVA Two Factor With Replication A researcher wants to examine driving performance under three telephone conditions namely: a) No Phone, b) Hand-Held, and c) Hands- Free. If he wants to measure the differences between sample means, we will calculate the variance: * both A and B between treatments within-treatments None of the abovearrow_forwardA random sample of 50 CSCC students are surveyed to determine what percent of our students are first-generation college students. The results showed that 16 of the students identify as first-generation college students. show your work What is the sample proportion of first-generation college students at CSCC? Is this an example of a parameter or a statistic? Explain.arrow_forwardDetermine the test statistic, z0. and P-value.arrow_forward
- Identifying TestsFor the following prompts include:- the name of the test- the parameter being tested, H0 and Ha.Example: What test would you use to determine whether the starting salariesfor statisticians are greater than $80,000?Ans: One-sample t-test for population mean, H0 : µ = $80, 000 vs. Ha : µ >$80,000.a. The 2010 Census found that the average family size in Minnesota was 3.05.What statistical procedure would you use to test whether the average familysize in Minnesota is greater than 3.05 in 2019?b. According to the 2017 American Community Survey, 5.7% of all workers inMinneapolis/St. Paul urbanized area commute to work by public transportation. What statistical procedure would you use to test whether the proportionof workers in Minneapolis/St. Paul urbanized area commute to work by publictransportation in 2019 has changed from than in 2015?c. According to the 2017 American Community Survey, 4.3% of all workers inMinneapolis/St. Paul urbanized area work in construction.…arrow_forwardHelparrow_forwardConsider the parameter(s) of interest in this study. • Label the appropriate parameter (s) of interest and write out what they are in plain English. Use proper statistical notation and an appropriate subscript to denote each population. Then, describe what the parameter(s) represent in terms of the study. • State the question of interest in terms of the parameters (using a mathematical relationship) and explain your reasoning.arrow_forward
- Glencoe Algebra 1, Student Edition, 9780079039897...AlgebraISBN:9780079039897Author:CarterPublisher:McGraw HillBig Ideas Math A Bridge To Success Algebra 1: Stu...AlgebraISBN:9781680331141Author:HOUGHTON MIFFLIN HARCOURTPublisher:Houghton Mifflin HarcourtHolt Mcdougal Larson Pre-algebra: Student Edition...AlgebraISBN:9780547587776Author:HOLT MCDOUGALPublisher:HOLT MCDOUGAL