
For Problems 7-21, please provide the following information.
(a) What is the level of significance? Stale the null and alternate hypotheses, (b) Check Requirements What sampling distribution will you use? Do you think the sample size is sufficiently large? Explain Compute the value of the sample test statistic and corresponding z value. (c) Find the P-value of the test statistic Sketch the sampling distribution and show the area corresponding to the P-value.
(d) Based on sour answers in parts (a) to (c), will you reject or fail to reject the null hypothesis? Are the data statistic ally significant at level a?
(c) Interpret your conclusion in the context of the application.
Focus Problem: Benford's Law Again, suppose you are the auditor for a very large corporation. The revenue file contains millions of numbers in a large computer data bank (see Problem 7). You draw a random sample of n = 228 numbers from this file, and r = 92 have a first nonzero digit of 1.
Let p represent the population proportion of all numbers in the computer file that have a leading digit of 1.
i.Test the claim that p is more than 0.301. Use
ii.If p is in fact larger than 0.301, it would seem there are too many numbers in the file with leading 1s. Could this indicate that the books have been “cooked" by artificially lowering numbers in the file? Comment from the point of view of the Internal Revenue Service, Comment from the perspective of the Federal Bureau of Investigation as it looks for "profit skimming" by unscrupulous employees.
iii.Comment on the following statement: "If we reject the null hypothesis at level of significance
(i)
(a)

The level of significance, null and alternative hypothesis.
Answer to Problem 8P
Solution: The level of significance is
Explanation of Solution
The level of significance is defined as the probability of rejecting the null hypothesis when it is true, it is denoted by
Null hypothesis
Alternative hypothesis
(b)

To find: The sampling distribution that should be used and compute the z value of the sample test statistic.
Answer to Problem 8P
Solution: The sampling distribution
Explanation of Solution
Calculation:
The
The standardized sample test statistic for
(c)

To find: The P-value of the test statistic and sketch the sampling distribution showing the area corresponding to the P-value.
Answer to Problem 8P
Solution: The P-value of the test statistic is 0.0004.
Explanation of Solution
Calculation:
We have z = 3.37
Using Table 3 from the Appendix to find the specified area:
Thus P- value is 0.0004.
Graph:
To draw the required graphs using the Minitab, follow the below instructions:
Step 1: Go to the Minitab software.
Step 2: Go to Graph > Probability distribution plot > View probability.
Step 3: Select ‘Normal’ and enter Mean 0 and Standard deviation 1.
Step 4: Click on the Shaded area > X value.
Step 5: Enter X-value as 3.37 and select ‘Right tail’.
Step 6: Click on OK.
The obtained distribution graph is:
(d)

Whether we reject or fail to reject the null hypothesisand whether the data is statistically significant for a level of significance of 0.01.
Answer to Problem 8P
Solution: The P-value
Explanation of Solution
The P-value of 0.0004 is less than the level of significance (
(e)

The interpretation for the conclusion.
Answer to Problem 8P
Solution: There is sufficient evidence to conclude that population proportion of numbers with leading “1” in the revenue file is more than the probability 0.301.
Explanation of Solution
The P-value of 0.0004 is less than the level of significance (
(ii)

To explain: Whether it is suspect that there are too many numbers in the data file with leading 1's.
Answer to Problem 8P
Solution: Yes. The revenue data file seems to be too many entries with leading digit 1.
Explanation of Solution
There are too many numbers in the data file with leading 1's. So, we cannot say that it is an indication of the books have been “cooked” by artificially lowering numbers in the file. From the viewpoint of the Internal Revenue Service and the Federal Bureau of Investigation as it looks for “profit skimming”, it may be true or false because there are too many numbers in the data file with leading 1’s.
(iii)

To explain: Whether it recommends further investigation before accusing the company of fraud.
Answer to Problem 8P
Solution: Our data lead us to reject the null hypothesis, more investigation is merited.
Explanation of Solution
Since, we reject the null hypothesis
Want to see more full solutions like this?
Chapter 9 Solutions
EBK UNDERSTANDING BASIC STATISTICS
- The following ordered data list shows the data speeds for cell phones used by a telephone company at an airport: A. Calculate the Measures of Central Tendency from the ungrouped data list. B. Group the data in an appropriate frequency table. C. Calculate the Measures of Central Tendency using the table in point B. 0.8 1.4 1.8 1.9 3.2 3.6 4.5 4.5 4.6 6.2 6.5 7.7 7.9 9.9 10.2 10.3 10.9 11.1 11.1 11.6 11.8 12.0 13.1 13.5 13.7 14.1 14.2 14.7 15.0 15.1 15.5 15.8 16.0 17.5 18.2 20.2 21.1 21.5 22.2 22.4 23.1 24.5 25.7 28.5 34.6 38.5 43.0 55.6 71.3 77.8arrow_forwardII Consider the following data matrix X: X1 X2 0.5 0.4 0.2 0.5 0.5 0.5 10.3 10 10.1 10.4 10.1 10.5 What will the resulting clusters be when using the k-Means method with k = 2. In your own words, explain why this result is indeed expected, i.e. why this clustering minimises the ESS map.arrow_forwardwhy the answer is 3 and 10?arrow_forward
- PS 9 Two films are shown on screen A and screen B at a cinema each evening. The numbers of people viewing the films on 12 consecutive evenings are shown in the back-to-back stem-and-leaf diagram. Screen A (12) Screen B (12) 8 037 34 7 6 4 0 534 74 1645678 92 71689 Key: 116|4 represents 61 viewers for A and 64 viewers for B A second stem-and-leaf diagram (with rows of the same width as the previous diagram) is drawn showing the total number of people viewing films at the cinema on each of these 12 evenings. Find the least and greatest possible number of rows that this second diagram could have. TIP On the evening when 30 people viewed films on screen A, there could have been as few as 37 or as many as 79 people viewing films on screen B.arrow_forwardQ.2.4 There are twelve (12) teams participating in a pub quiz. What is the probability of correctly predicting the top three teams at the end of the competition, in the correct order? Give your final answer as a fraction in its simplest form.arrow_forwardThe table below indicates the number of years of experience of a sample of employees who work on a particular production line and the corresponding number of units of a good that each employee produced last month. Years of Experience (x) Number of Goods (y) 11 63 5 57 1 48 4 54 5 45 3 51 Q.1.1 By completing the table below and then applying the relevant formulae, determine the line of best fit for this bivariate data set. Do NOT change the units for the variables. X y X2 xy Ex= Ey= EX2 EXY= Q.1.2 Estimate the number of units of the good that would have been produced last month by an employee with 8 years of experience. Q.1.3 Using your calculator, determine the coefficient of correlation for the data set. Interpret your answer. Q.1.4 Compute the coefficient of determination for the data set. Interpret your answer.arrow_forward
- Can you answer this question for mearrow_forwardTechniques QUAT6221 2025 PT B... TM Tabudi Maphoru Activities Assessments Class Progress lIE Library • Help v The table below shows the prices (R) and quantities (kg) of rice, meat and potatoes items bought during 2013 and 2014: 2013 2014 P1Qo PoQo Q1Po P1Q1 Price Ро Quantity Qo Price P1 Quantity Q1 Rice 7 80 6 70 480 560 490 420 Meat 30 50 35 60 1 750 1 500 1 800 2 100 Potatoes 3 100 3 100 300 300 300 300 TOTAL 40 230 44 230 2 530 2 360 2 590 2 820 Instructions: 1 Corall dawn to tha bottom of thir ceraan urina se se tha haca nariad in archerca antarand cubmit Q Search ENG US 口X 2025/05arrow_forwardThe table below indicates the number of years of experience of a sample of employees who work on a particular production line and the corresponding number of units of a good that each employee produced last month. Years of Experience (x) Number of Goods (y) 11 63 5 57 1 48 4 54 45 3 51 Q.1.1 By completing the table below and then applying the relevant formulae, determine the line of best fit for this bivariate data set. Do NOT change the units for the variables. X y X2 xy Ex= Ey= EX2 EXY= Q.1.2 Estimate the number of units of the good that would have been produced last month by an employee with 8 years of experience. Q.1.3 Using your calculator, determine the coefficient of correlation for the data set. Interpret your answer. Q.1.4 Compute the coefficient of determination for the data set. Interpret your answer.arrow_forward
- Q.3.2 A sample of consumers was asked to name their favourite fruit. The results regarding the popularity of the different fruits are given in the following table. Type of Fruit Number of Consumers Banana 25 Apple 20 Orange 5 TOTAL 50 Draw a bar chart to graphically illustrate the results given in the table.arrow_forwardQ.2.3 The probability that a randomly selected employee of Company Z is female is 0.75. The probability that an employee of the same company works in the Production department, given that the employee is female, is 0.25. What is the probability that a randomly selected employee of the company will be female and will work in the Production department? Q.2.4 There are twelve (12) teams participating in a pub quiz. What is the probability of correctly predicting the top three teams at the end of the competition, in the correct order? Give your final answer as a fraction in its simplest form.arrow_forwardQ.2.1 A bag contains 13 red and 9 green marbles. You are asked to select two (2) marbles from the bag. The first marble selected will not be placed back into the bag. Q.2.1.1 Construct a probability tree to indicate the various possible outcomes and their probabilities (as fractions). Q.2.1.2 What is the probability that the two selected marbles will be the same colour? Q.2.2 The following contingency table gives the results of a sample survey of South African male and female respondents with regard to their preferred brand of sports watch: PREFERRED BRAND OF SPORTS WATCH Samsung Apple Garmin TOTAL No. of Females 30 100 40 170 No. of Males 75 125 80 280 TOTAL 105 225 120 450 Q.2.2.1 What is the probability of randomly selecting a respondent from the sample who prefers Garmin? Q.2.2.2 What is the probability of randomly selecting a respondent from the sample who is not female? Q.2.2.3 What is the probability of randomly…arrow_forward
- College Algebra (MindTap Course List)AlgebraISBN:9781305652231Author:R. David Gustafson, Jeff HughesPublisher:Cengage LearningGlencoe Algebra 1, Student Edition, 9780079039897...AlgebraISBN:9780079039897Author:CarterPublisher:McGraw Hill

