Predicting software defects. Refer to the PROMISE Software Engineering Repository data on 498 modules of software code written in “C” language for a NASA spacecraft instrument, saved in the file. (See Exercise 3.132, p. 185). Recall that the software code in each module was evaluated for defects; 49 were classified as “true” (i.e., module has defective code), and 449 were classified as “false” (i.e., module has correct code). Consider these to be independent random samples of software code modules. Researchers predicted the defect status of each module using the simple algorithm, “If number of lines of code in the module exceeds 50, predict the module to have a defect.” The accompanying SPSS printout shows the number of modules in each of the two samples that were predicted to have defects (PRED_LOC = _“yes”) and predicted to have no defects (PRED_LOC = _“no”). Now, define the accuracy rate of the algorithm as the proportion of modules that were correctly predicted. Compare the accuracy rate of the algorithm when applied to modules with defective code with the accuracy rate of the algorithm when applied to modules with correct code. Use a 99% confidence interval.
DEFECT PRED_LOC Crosstabulation
Count
PRED_LOC | Total | |||
no | yes | |||
DEFECT | false | 400 | 49 | 449 |
true | 29 | 20 | 49 | |
total | 429 | 69 | 496 |
Want to see the full answer?
Check out a sample textbook solutionChapter 8 Solutions
Statistics for Business and Economics (13th Edition)
- I mainly need help with part D. I'm not sure how to write the code to use in R to get the values needed. Question 1. Are the outcomes of hospital care different on weekends than weekdays? In a random sample of 500 patients who experienced severe medical complications after admission to acute care wards in three U.S. states from 1999 and 2001, 121 had been admitted on a weekend and 379 had been admitted on a weekday. This compares with a large population of people at risk for such complications in which 13.9% are admitted on weekends and 86.1% are admitted on weekdays.a) In the 500 sampled patients with severe complications, what fraction had been admitted on weekends? Is this higher or lower than the fraction of all at-risk patients admitted on weekends? b) Name two statistical methods that could be used to test whether the probability of severe complications in at-risk patients admitted to hospitals differs betweenweekend and weekday. State the advantages and disadvantages of both.…arrow_forwardAnswer with no programming please.arrow_forwardDr. Moas is interested in examining how environmental cues affect memory. She asks 15 students to study 10 obscure words for a vocabulary exam. All students take the final vocabulary exam in Classroom A. She randomly assigns participants to one of three locations to study for the exam: the same classroom (Classroom A), a different classroom (Classroom B), or a completely separate location, the school gym. Then, she records the number of vocabulary words that each student correctly defines on the exam. The data is as follows: Participant Study Location Vocabulary Words Answered Correctly 1 Same Classroom 9 2 Same Classroom 8 3 Same Classroom 10 4 Same Classroom 8 5 Same Classroom 9 6 Different Classroom 9 7 Different Classroom 8 8 Different Classroom 7 9 Different Classroom 8 10 Different Classroom 7 11 School Gym 6 12 School Gym 7 13 School Gym 5 14 School Gym 6…arrow_forward
- Dr. Moas is interested in examining how environmental cues affect memory. She asks 15 students to study 10 obscure words for a vocabulary exam. All students take the final vocabulary exam in Classroom A. She randomly assigns participants to one of three locations to study for the exam: the same classroom (Classroom A), a different classroom (Classroom B), or a completely separate location, the school gym. Then, she records the number of vocabulary words that each student correctly defines on the exam. The data is as follows: Participant Study Location Vocabulary Words Answered Correctly 1 Same Classroom 9 2 Same Classroom 8 3 Same Classroom 10 4 Same Classroom 8 5 Same Classroom 9 6 Different Classroom 9 7 Different Classroom 8 8 Different Classroom 7 9 Different Classroom 8 10 Different Classroom 7 11 School Gym 6 12 School Gym 7 13 School Gym 5 14 School Gym 6…arrow_forwardA researcher wondered if attainment within six years among students who receive grants as sart of their educational funding (Group 1) was lower than attainment within six years among students who did not receive grants as part of their educational funding (Group 2). Atainment is defined as whether the student earned the degree or certificate that heishe set out to eam upon enrolment. Complete parts (a) through (c) below. A. 2 1 Cannot be determined (c) In part (b), we learned that two groups (students who receive grants and students who do not receive grants) are being compared. In addition, the sampling method is independent. State the null and altemative hypotheses for this test Hg.arrow_forwardAt a stop sign, some drivers come to a full stop, some come to a rolling stop (not a full stop, but slow down), and some do not stop at all. Test if there is an association between gender and type of stop (full, rolling, or no stop). Data is collected by standing a few feet from a stop sign and taking note of the type of stop and the gender of the driver. Below is a contingency table summarizing the data that was collected. Do the data provide enough evidence to show that there is an association between gender and type of stop (full, rolling, or no stop)? Test at the 3% level and round expected values to one decimal place. Type of Stop Male (Observed) Male (Expected Value) Female (Observed) Female (Expected Value) Row Total Full Stop 11 21 32 Rolling Stop 15 15 30 No Stop 9 8 17 Column Total 35 44 79 H: HypothesesHo:Ho:Select an answer Gender male female Male and Select an answer Type of Stop male female Female are Select an answer independent dependent…arrow_forward
- Assume that in your linguistics class you have earned the test scores 90, 71, 57, and 68; and only one test remains. If you need a mean score of 70 to earn a C, then what minimum score must you obtain on the last test?arrow_forwardI am currently sick with COVID so my brain is not working but this question is kicking my butt help pleasearrow_forwardCan you help me please?arrow_forward
- Jensen Tire & Auto is in the process of deciding whether to purchase a maintenance contract for its new computerized wheel alignment and balancing machine. Managers feel that maintenance expense is related to usage of the machine, and they have collected the following information on weekly usage of the machine (in hours) and annual maintenance expense (in hundreds of dollars) from 10 other companies that own the machine.Company Weekly Usage(hours) Annual maintenance Expense(hundreds of dollars)A 13 17.0B 10 22.0C 20 30.0D 28 37.0E 32 47.0F 17…arrow_forwardThe production department of Celltronics International wants to explore the relationship between the number of employees who assemble a subassembly and the number produced. As an experiment, 2 employees were assigned to assemble the subassemblies. They produced 11 during a one-hour period. Then 4 employees assembled them. They produced 18 during a one-hour period. The complete set of paired observations follows. Number ofAssemblers One-HourProduction (units) 2 11 4 18 1 5 5 23 3 18 The dependent variable is production; that is, it is assumed that different levels of production result from a different number of employees. Click here for the Excel Data File Draw a scatter diagram. On the graph below, use the point tool to plot the point corresponding to the first Number of Assemblers and her Production (No of Assemblers1). Repeat the process for the remainder of the sample (No of Assemblers2,…arrow_forwardIs high school GPA a good predictor of college GPA? Let's explore it with some real data. In RStudio, run the following code to install and/or library the package "openintro". 1. install.packages ("openintro") # don't do this again if you already did this! 2. library(openintro) 3. satgpa Delete the install line of code if you are in an RMD file so that it doesn't install every time you knit. The last line of code will access the dataset of that name. The dataset 'satgpa' gives information about test scores and GPA in high school and college for students at an unnamed college. Use the data to build a model to predict first year college GPA ("fy_gpa") from high school GPA ("hs_gpa"). a. Make a scatterplot of "fy_gpa" (y-axis) vs "hs_gpa" (x-axis). Which plot is the scatterplot? Graph A Graph B Graph C Graph D O A C₁- fy_gpa hs_gpa b. Does the relationship appear roughly linear? O No, all the points do not fall on a straight line. O Yes, there is no obvious curvature in the graph O No,…arrow_forward
- Glencoe Algebra 1, Student Edition, 9780079039897...AlgebraISBN:9780079039897Author:CarterPublisher:McGraw HillMathematics For Machine TechnologyAdvanced MathISBN:9781337798310Author:Peterson, John.Publisher:Cengage Learning,