Predicting software defects. Refer to the PROMISE Software Engineering Repository data on 498 modules of software code written in “C” language for a NASA spacecraft instrument, saved in the file. (See Exercise 3.132, p. 185). Recall that the software code in each module was evaluated for defects; 49 were classified as “true” (i.e., module has defective code), and 449 were classified as “false” (i.e., module has correct code). Consider these to be independent random samples of software code modules. Researchers predicted the defect status of each module using the simple algorithm, “If number of lines of code in the module exceeds 50, predict the module to have a defect.” The accompanying SPSS printout shows the number of modules in each of the two samples that were predicted to have defects (PRED_LOC = _“yes”) and predicted to have no defects (PRED_LOC = _“no”). Now, define the accuracy rate of the algorithm as the proportion of modules that were correctly predicted. Compare the accuracy rate of the algorithm when applied to modules with defective code with the accuracy rate of the algorithm when applied to modules with correct code. Use a 99% confidence interval.
DEFECT PRED_LOC Crosstabulation
Count
PRED_LOC | Total | |||
no | yes | |||
DEFECT | false | 400 | 49 | 449 |
true | 29 | 20 | 49 | |
total | 429 | 69 | 496 |
Want to see the full answer?
Check out a sample textbook solutionChapter 8 Solutions
Statistics for Business and Economics (13th Edition)
- Johnson Filtration, Inc. provides maintenance service for water-filtration systems. Suppose that in addition to information on the number of months since the machine was serviced and whether a mechanical or an electrical repair was necessary, the managers obtained a list showing which repairperson performed the service. Repair Time(hours) Months SinceLast Service Type ofRepair Repairperson 2.9 2 electrical Dave Newton 3.0 6 mechanical Dave Newton 4.8 8 electrical Bob Jones 1.8 3 mechanical Dave Newton 2.9 2 electrical Dave Newton 4.9 7 electrical Bob Jones 4.2 9 mechanical Bob Jones 4.8 8 mechanical Bob Jones 4.4 4 electrical Bob Jones 4.5 6 electrical Dave Newton Ignore for now the months since the last maintenance service (x1 ) and the repairperson who performed the service. Develop the estimated simple linear regression equation to predict the repair time (y) given the…arrow_forwardI mainly need help with part D. I'm not sure how to write the code to use in R to get the values needed. Question 1. Are the outcomes of hospital care different on weekends than weekdays? In a random sample of 500 patients who experienced severe medical complications after admission to acute care wards in three U.S. states from 1999 and 2001, 121 had been admitted on a weekend and 379 had been admitted on a weekday. This compares with a large population of people at risk for such complications in which 13.9% are admitted on weekends and 86.1% are admitted on weekdays.a) In the 500 sampled patients with severe complications, what fraction had been admitted on weekends? Is this higher or lower than the fraction of all at-risk patients admitted on weekends? b) Name two statistical methods that could be used to test whether the probability of severe complications in at-risk patients admitted to hospitals differs betweenweekend and weekday. State the advantages and disadvantages of both.…arrow_forwarddo a scatterplot that can visualize how the confirmed cases of COVID-19 (case rate per 100,000) affect economic development (GDP per capita in Q2 2020) Political affiliation: Swing States State Case Rate per 100,000 GDP Per Capita in Q2 2020 ($) Arizona 5,079 48,105 Florida 4,886 47,802 Georgia 4,766 54,696 Michigan 4,271 47,612 Nevada 5,541 50,783 North Carolina 3,804 52,133 Pennsylvania 3,280 56,540 Wisconsin 7,587 53,934 Political affiliation: Solid Red States State Case Rate per 100,000 GDP Per Capita in Q2 2020 ($) Arkansas 5,671 40,033 Indiana 5,703 51,102 Kansas 5,780 55,423 Nebraska 7,248 61,875 Kentucky 4,490 43,396 Oklahoma 5,224 43,736 Tennessee 5,917 48,790 Wyoming 6,286 57,421 Political affiliation: Solid Blue States State Case Rate per 100,000 GDP Per Capita in Q2 2020 ($) California 3,392 73,219 Colorado 4,575 63,384 Connecticut 3,575 73,685 Maryland 3,596 65,933 Massachusetts 3,730 79,296 New Jersey 4,131…arrow_forward
- Dr. Moas is interested in examining how environmental cues affect memory. She asks 15 students to study 10 obscure words for a vocabulary exam. All students take the final vocabulary exam in Classroom A. She randomly assigns participants to one of three locations to study for the exam: the same classroom (Classroom A), a different classroom (Classroom B), or a completely separate location, the school gym. Then, she records the number of vocabulary words that each student correctly defines on the exam. The data is as follows: Participant Study Location Vocabulary Words Answered Correctly 1 Same Classroom 9 2 Same Classroom 8 3 Same Classroom 10 4 Same Classroom 8 5 Same Classroom 9 6 Different Classroom 9 7 Different Classroom 8 8 Different Classroom 7 9 Different Classroom 8 10 Different Classroom 7 11 School Gym 6 12 School Gym 7 13 School Gym 5 14 School Gym 6…arrow_forwardDr. Moas is interested in examining how environmental cues affect memory. She asks 15 students to study 10 obscure words for a vocabulary exam. All students take the final vocabulary exam in Classroom A. She randomly assigns participants to one of three locations to study for the exam: the same classroom (Classroom A), a different classroom (Classroom B), or a completely separate location, the school gym. Then, she records the number of vocabulary words that each student correctly defines on the exam. The data is as follows: Participant Study Location Vocabulary Words Answered Correctly 1 Same Classroom 9 2 Same Classroom 8 3 Same Classroom 10 4 Same Classroom 8 5 Same Classroom 9 6 Different Classroom 9 7 Different Classroom 8 8 Different Classroom 7 9 Different Classroom 8 10 Different Classroom 7 11 School Gym 6 12 School Gym 7 13 School Gym 5 14 School Gym 6…arrow_forwardA researcher wondered if attainment within six years among students who receive grants as sart of their educational funding (Group 1) was lower than attainment within six years among students who did not receive grants as part of their educational funding (Group 2). Atainment is defined as whether the student earned the degree or certificate that heishe set out to eam upon enrolment. Complete parts (a) through (c) below. A. 2 1 Cannot be determined (c) In part (b), we learned that two groups (students who receive grants and students who do not receive grants) are being compared. In addition, the sampling method is independent. State the null and altemative hypotheses for this test Hg.arrow_forward
- (All answers were generated using 1,000 trials and native Excel functionality.) Allegiant Airlines is considering an overbooking policy for one of its flights. The airplane has 50 seats, but Allegiant is considering accepting more reservations than seats because sometimes passengers do not show up for their flights, resulting in empty seats. The PassengerAppearance worksheet in the file Overbooking contains data on 1,000 passengers showing whether or not they showed up for their respective flights. Click on the datafile logo to reference the data. In addition, Allegiant has conducted a field experiment to gauge the demand for reservations for the current flight. During this experiment, they did not limit the number of reservations for the flight to observe the uncensored demand. The following table summarizes the result of the field experiment. No. of Reservations Demanded Probability 48 0.05 49 0.05 50 0.15 51 0.30 52 0.25 53 0.10 54 0.10 Allegiant receives a…arrow_forwardJensen Tire & Auto is in the process of deciding whether to purchase a maintenance contract for its new computerized wheel alignment and balancing machine. Managers feel that maintenance expense is related to usage of the machine, and they have collected the following information on weekly usage of the machine (in hours) and annual maintenance expense (in hundreds of dollars) from 10 other companies that own the machine.Company Weekly Usage(hours) Annual maintenance Expense(hundreds of dollars)A 13 17.0B 10 22.0C 20 30.0D 28 37.0E 32 47.0F 17…arrow_forwardNo 5arrow_forward
- Researchers conducted a study to find out if there is a difference in the use of online-learning by different age groups. Randomly selected participants were divided into two age groups. In the 16 to 29-year-old group, 7% of the 628 surveyed use online-learning, while 11% of the 2,309 participants 30 years old and older use online learning. (Set 1 for 16-29 year old group; Set 2 for 30 years old and older). What is the alternative hypothesis test?arrow_forwardFloyd’s Bumpers has distribution centers in Lafayette, Indiana; Charlotte, North Carolina; Los Angeles, California; Dallas, Texas; and Pittsburgh, Pennsylvania. Each distribution center carries all products sold. Floyd’s customers are auto repair shops and larger auto parts retail stores. You are asked to perform an analysis of the customer assignments to determine which of Floyd’s customers should be assigned to each distribution center. The rule for assigning customers to distribution centers is simple: A customer should be assigned to the closest center. The worksheet Floyds in the provided datafile contains the distance from each of Floyd’s 1,029 customers to each of the five distribution centers. Your task is to build a list that tells which distribution center should serve each customer. The following functions will be helpful: =MIN(array) The MIN function returns the smallest value in a set of numbers. For example, if the range A1:A3 contains the values 6, 25, and 38, then the…arrow_forwardThe department of code enforcement of a country government issues permits to general contractors to work on residential projects. For each permit issued, the department inspects the result of the project and gives a "pass" or "fail" rating. A failed project must be re-inspected until it receives a pass rating. The department had been frustrated by the high cost of re-inspection rate and decided to publish the inspection records of all contractors on the web. It was hoped that public access to the records would lower the re-inspection rate. A year after the web access was made public, two samples of records were randomly selected. One sample was selected from the pool of records before web publication and one after. The proportion of projects that passed on the first inspected was noted for each sample. The results are summarized below. No Public Web Access nį = 500, ĝ1 = 0.67 Public Web Access n2 = 100, p2 = 0.80 A test, at a 0.10 level of significance, is to be conducted as to whether…arrow_forward
- Glencoe Algebra 1, Student Edition, 9780079039897...AlgebraISBN:9780079039897Author:CarterPublisher:McGraw HillMathematics For Machine TechnologyAdvanced MathISBN:9781337798310Author:Peterson, John.Publisher:Cengage Learning,