Data Sciene Extra Credit

pdf

School

University of Michigan *

*We aren’t endorsed by this school

Course

DATA STRUC

Subject

Statistics

Date

Feb 20, 2024

Type

pdf

Pages

2

Uploaded by EarlGoat1910

Report
Exam 1: Question 12 (5 points were lost) 12.1: 1 point was deducted for this question because terms that were not understood by the general audience were used in the response to compare cancerous and non-cancerours tumors, specifically terms such as “right skew” and “outliers. A correct response would include the rest of the information but replace the word “right skew” with an explanation of how, on average, most of the sizes of the malignant(cancerous) tumors have somewhat smaller radii compared to the average non-cancerouos tumor radii. I would also replace the word “outliers” with extreme values that are out of the usual spread for either of the groups. 12.2 1 point was deducted for this question because I failed to mention information about the perimeter of the cells. A correct response would be that as most cells will be close to circles, their perimeter will be an almost exact linear function of the radius. 12.3 3 points were deducted for this question because I failed to mention how the 75/25 quantiles ratio can be used, did not provide a specific calculation of the ratio, and did not mention the variation in the groups. A correct answer would say how the ratio of the 75/25 quantiles can be used to see the variation between both groups as we will be able to see the spread between the lowest cell sizes and the highest cell sizes, to calculate the ratio for the benign group we would divide 551/378 = 1.45, and to get the malignant ratio we would divide 1204/705= 1.7, and that these values show that there is slightly more variation in the malignant group as the ratio is larger. Exam 2: ( 7 points were lost) Question 9: 2 points were deducted for this question because I did not include 0.75 in my response. A correct response would include 0, 0.75, and -0.25 as all of these values are outside the interval of [0.1, 0.5] which would result in a rejection of the null hypothesis in a two sided hypothesis test at the alpha = 0.01 level. Question 10: 2 points were deducted for this question because I did not correctly identify the result of the snippet of code. The correct answer is that the code shows the joint distribution which would be the result if the two groups REGION and ANY_65 were independent. Question 12: 12.3 2 points were deducted for this question because I failed to mention characteristics about the relationship between the coefficient for Air Temp © (0.888) and the value from the previous problem (0.894). A correct answer would include how the correlation coefficient (0.888) tells us the strength of a linear relationship, in this case strong and positive, and the slope(0.894) tells use which line which is used in that linear relationship, which is a positive slope. Question 13: 13.2
1 point was deduced from this question because I had the incorrect p-value. The correct p-value should be about 0.5 as 10/5= 2 and then using the empirical rule, the p-value would be 0.5.
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help