Midterm_solutions_Section B

pdf

School

York University *

*We aren’t endorsed by this school

Course

2565

Subject

Mathematics

Date

Jan 9, 2024

Type

pdf

Pages

Uploaded by ColonelThunderSeaUrchin4

MATH 2565 3.0 - Fall 2023 Test questions Question 1: Customers at Longos grocery store in Toronto can pay for purchases with cash, an ATM card, or a credit card. 55% of all customers use cash, and 38% use an ATM card. Careful research has shown that of those paying with cash, 75% use coupons; of those using an ATM card, 35% use coupons; and of those using a credit card, only 10% use coupons. Round answers to 3 decimals. a) If a random selected customer does not use coupons, what is the probability that the customer uses a credit card to pay for purchases? P (credit card | no coupons) = 0 . 141 b) If 3 customers are randomly selected, what is the probably that at least one of them uses coupons? P (at least 1 of 3 use coupons) = 0 . 910 1

Question 2: Consider the following data set x 1 , . . . , x 18 be 177 183 187 188 189 192 194 196 196 202 206 210 211 212 215 222 223 224 Round answers to 3 decimals. a) Using the following given information, n X i =1 x i = 3627 n X i =1 x 2 i = 734403 n X i =1 ( x i − ¯ x ) 2 = 3562 . 5 , find the sample mean and sample variance. ¯ x = 201 . 5 s 2 = 209 . 559 b) Use the IQR to identify outliers. Q 1 = 189 Q 3 = 212 IQR = 23 ( Q 1 − 1 . 5 IQR, Q 3 + 1 . 5 IQR ) = (154 . 5 , 246 . 5) Since all data are within (154.5, 246.5), therefore no outliers. 2

c) Based on the given normal Q-Q plot, does the data look like they are coming from a normal distribution? Why or why not? -2 -1 0 1 2 180 190 200 210 220 Normal Q-Q Plot Theoretical Quantiles Sample Quantiles Since the normal Q-Q plot looks more or less like a straight line, therefore the data seem to follow a normal distribution. d) Assume the data histogram looks bell-shaped. Applying the 68-95-99.7 Rule (Empirical Rule), what range does 95% of the data fall within? According to the 68-95-99.7/empirical rule, approximately 95% of data fall within (¯ x − 2 s, ¯ x + 2 s ) such that (¯ x − 2 s, ¯ x + 2 s ) = (172 . 548 , 230 . 452). 3

Your preview ends here

Eager to read complete document? Join bartleby learn and gain access to the full version

Access to all documents
Unlimited textbook solutions
24/7 expert homework help

Question 3: Assume that online shopping browsing session durations of retail websites is normally distributed with a mean of 21.7 minutes and a standard deviation of 6.8 minutes, based on data from a market research firm. Round answers to 4 decimal places. a) Find the probability that a randomly selected online shopping browsing session has a duration less than 15 minutes. P ( X < 15) = 0 . 1611 b) Find the probability that a randomly selected online shopping browsing session has a duration between 15 to 20 minutes. P (15 < X < 20) = 0 . 2402 c) What durations make up the top 10% of all online shopping browsing durations for the retail websites? P ( Z < 1 . 28) = 0 . 90 1 . 28 = X − 21 . 7 6 . 8 X = 30 . 4 Thus, durations of 30.4 minutes or more make up the top 10%. d) Find the probability that a SRS of 10 online shopping browsing sessions has a mean duration less than 15 minutes. P ( ¯ X < 15) = 0 . 0009 4

Question 4: During the pandemic, it was common that many people worked from home. A multi-national company gathered data from random office locations worldwide to determine if there was a relationship between the number of cases of the virus at that office (Cases) and the percentage of employees who were working in the office at that location (InOffice). Furthermore, they wanted to determine if the number of cases could predict the percentage of employees who showed up to the office for work. Here is some R output from their analysis. > mean(Cases) [1] 25.39623 > mean(InOffice) [1] 25.38755 > cor(Cases,InOffice) [1] -0.7301128 > lm(InOffice ∼ Cases) Call: lm(formula = InOffice ∼ Cases) Coefficients : (Intercept) Cases 34.0818 -0.3423 Answer the following questions based on the R output. a) Which is the independent variable and which is the dependent variable? Write the linear regression equation. Independent variable is number of cases, dependent variable is percentage of employees ˆ y = 34 . 0818 − 0 . 3423 x b) Explain if the y -intercept make sense in this scenario. ˆ β 0 = 34 . 0818 is the percentage of employees in the office when there are no cases. It is possible to have 0 cases and thus the intercept makes sense. 5

c) Calculate r 2 . What does this value mean? r 2 = 0 . 53; 53% of the variability in the percentage of employees in the office can be explained by the number of cases. d) Is the relationship between Cases and InOffice linear? Explain. Not linear because scatterplot not linear and residual plot not randomly scattered. e) Predict the percentage of employees who work at the office when there are 200 cases of the virus using the regression equation. Does this number make sense? Why or why not? ˆ y = − 34%; Does not make sense because percentage should not be negative. Should not extrapolate and should not use the regression equation because the data is not linear. 6

Your preview ends here

Eager to read complete document? Join bartleby learn and gain access to the full version

Access to all documents
Unlimited textbook solutions
24/7 expert homework help

Midterm_solutions_Section B

Related Documents