2023.11.11. Assignment No. 4 - Answer Key

pdf

School

University of Victoria *

*We aren’t endorsed by this school

Course

245

Subject

Economics

Date

Jan 9, 2024

Type

pdf

Pages

7

Uploaded by BailiffBeaverPerson731

Report
ECON 245: Descriptive Statistics and Probability Dr. Paola Beneras P. Due: Nov. 21, 2023 Assignment No. 4 - Answer Key Instructions: Ensure your assignment is legible and that you staple your assignment before handing it in. Show your work. Question 1. Education and Household Income. The following table shows the number of households (in 1000s) and the household income by the highest level of education for the head of household. Only households in which the head has a high school diploma or more are included. Household Income Highest Level of Education Under $ 25,000 $ 25,000-49,999 $ 50,000-99,999 $ 100,000+ High School Degree 9880 9970 9441 3482 Bachelor’s Degree 2484 4164 7666 7817 Master’s Degree 685 1205 3019 4094 Doctoral Degree 79 160 422 1076 a. Develop a joint probability table. Household Income Highest Level of Education Under $ 25,000 $ 25,000-49,999 $ 50,000-99,999 $ 100,000+ Total High School Degree 0.1505 1519 0.1438 0.0530 0.4993 Bachelor’s Degree 0.0378 0.0634 0.1168 0.1191 0.3371 Master’s Degree 0.0104 0.0184 0.0460 0.0624 0.1371 Doctoral Degree 0.0012 0.0024 0.0064 0.0164 0.0265 Total 0.2000 0.2361 0.3130 0.2509 1.0000 b. What is the probability that the head of one of those households has a master’s degree or more education? Identify what kind of probability you are calculating. - This is the sum of two marginal probabilities: P (master’s degree or more education) = 0 . 1371 + 0 . 0265 = 0 . 1636 c. What is the probability that a household headed by someone with a high school diploma earns $ 100,000 or more? Identify what kind of probability you are calculating. - This is a conditional probability: P (100 k or more | HS) = P (100 k or more HS) P (HS) = 0 . 0530 0 . 4993 = 0 . 1016 d. What is the probability had one of these households has an income below $ 25,000? Identify what kind of probability you are calculating. - This is a marginal probability: P (under 25) = 0 . 2000 1
e. What is the probability that a household headed with someone with a bachelor’s degree earns less than $ 25,000? Identify what kind of probability you are calculating. - This is a conditional probability: P (under 25 | BD) = P (under 25 BD) P (BD) = 0 . 0378 0 . 3371 = 0 . 1121 f. Is household income independent of education level? - No. Household income is not independent of education level. People who have a HS degree have a lower probability of having a higher household income: P (100 k or more | HS) = 0 . 1016 ̸ = P (100 k or more) = 0 . 2509 Question 2. Finding Gold in the Artic. A mining company is in negotiations for a contract in Northern Canada. Preliminary geologic studies have computed the following probabilities for the site: P (high-grade gold) = 0 . 50 P (low-grade gold) = 0 . 20 P (no gold) = 0 . 30 Using these probabilities, calculate the following: a. What is the probability of finding gold? - P (AU) = 0 . 50 + 0 . 20 = 0 . 70 After exploring the first site, soil samples are taken noting the concentration of gold in soil. The probabilities of finding a particular type of soil identified by the sample are: P (soil | high-grade gold) = 0 . 20 P (soil | low-grade gold) = 0 . 80 P (soil | no gold) = 0 . 20 b. How should the mining company interpret these samples? How would you revise probabilities (explain in plain English, step by step)? What is the new probability of finding gold? Would you make a bid to mine this particular site? - Let, S = soil test results; AU i = quality of gold, where: i = 1: represents high-grade and i = 2: represents low-grade, and i = 3: represents no gold: Events P ( AU i ) P ( S | AU i ) P ( AU i S ) P ( AU i | S ) High-grade ( AU 1 ) 0.50 0.20 0.10 0.3125 Low-grade ( AU 2 ) 0.20 0.80 0.16 0.5000 No gold ( AU 3 ) 0.30 0.20 0.06 0.1875 1.00 P(S) = 0.32 1 - The prior probabilities are P ( AU i ). - The quality of the soil depends and varies with the grade of gold found, which gives the conditional proba- bilities P ( S | AU i ). - The probability of finding a particular type of soil is given by P ( S ). This probability is found by looking at the probability of each experimental outcome, i.e., at the sum of the three joint probabilities. - Trying to identify the type of gold given a particular soil sample becomes looking for the posterior probabilities P ( AU i | S ) 2
- These posterior probabilities allow us to calculate that now that the firm has taken a soil test, the probability of finding gold in this side is now: P ( AU ) = 0 . 31 + 0 . 50 = 0 . 81 - So given these new soil samples, the probability of finding gold is not 81%, however it is more likely that the site contains low-grade gold rather than high-grade gold (0.50 vs. 0.31). Question 3. Interviews for Summer Position. Three students scheduled interviews for summer employment. In each case the interview results in either an offer for a position or no offer. Experimental outcomes are defined in terms of the results of the three interviews. a. List the experimental outcomes. Let, Y = position is offered. N = position is not offered. S = { ( Y, Y, Y ) , ( Y, Y, N ) , ( Y, N, Y ) , ( N, Y, Y ) , ( Y, N, N ) , ( N, Y, N ) , ( N, N, Y ) , ( N, N, N ) } b. Define a random variable that represents the number of offers made. Is the random variable discrete or continuous? - Let O = number of offers made; O is a discrete random variable. c. Show the value of the random variable for each of the experimental outcomes. Experimental Outcome (Y,Y,Y) (Y,Y,N) (Y,N,Y) (N,Y,Y) (Y,N,N) (N,Y,N) (N,N,Y) (N,N,N) Value of O 3 2 2 2 1 1 1 0 Question 4. TikTok Usage. It is estimated that nearly half of all TikTok users in 2020 were aged 10-29. Suppose a random sample of Canadians were surveyed regarding their use of TikTok with the following results: Age of Respondent Uses TikTok Does Not Use TikTok 10 - 19 36 44 20 - 29 87 52 30 - 39 42 89 40+ 44 138 a. Use the sample results above to calculate the probability that a respondent uses TikTok. P (respondent uses TikTok) = 209 / (209 + 323) = 0 . 3929 b. What is the probability of a respondent in each age range (determined by decade, i.e., 10-19, 20-29, etc.) uses TikTok? Which age group is most likely to use TikTok? P (respondent aged 10 – 19 uses TikTok) = 36 / (36 + 44) = 0 . 4500 P (respondent aged 20 – 29 uses TikTok) = 87 / (87 + 52) = 0 . 6259 P (respondent aged 30 – 39 uses TikTok) = 42 / (42 + 89) = 0 . 3206 P (respondent aged 40 + uses TikTok) = 44 / (44 + 138) = 0 . 2418 - Respondents aged 20 – 29 are most likely to use TikTok. Question 5. NBA Free Throws. Suppose that a particular NBA player makes 93% of free throws. Assume that late in a basketball game, this player is fouled and is awarded two shots. a. What is the probability that the player will make both shots? - P (Make the Shot) = 0 . 93 is the probability for each foul shot, so the probability that the player will make two consecutive foul shots is: P (Make the Shot) × P (Make the Shot) = (0 . 93)(0 . 93) = 0 . 8649 . 3
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help
b. What is the probability that the player will make at least one shot? - There are three ways in which the player can make at least one shot: make the first shot and miss the second shot; miss the first shot and make the second shot; make both shots. - Event “Miss the Shot” is the complement of the event “Make the Shot,” so: P (Miss the Shot) = 1 P (Make the Shot) = 1 0 . 93 = 0 . 07 So the probability of making at least one shot is 99.51%: P (Miss the Shot) P (Miss the Shot) = (0 . 93)(0 . 07) = 0 . 0651 P (Miss the Shot) P (Miss the Shot) = (0 . 07)(0 . 93) = 0 . 0651 P (Miss the Shot)(Miss the Shot) = (0 . 93)(0 . 93) = 0 . 864 0 . 0651 + 0 . 0651 + 0 . 864 = 0 . 9951 c. What is the probability that the player will miss both shots? - This can be found in two ways. Directly: P (Miss the Shot) P (Miss the Shot) = (0 . 07)(0 . 07) = 0 . 0049 - Or can recognize that the event “Miss both Shots” is the complement of the event “Make at Least One of the Two Shots” so: P (Miss the Shot) P (Miss the Shot) = 1 0 . 9951 = 0 . 0049 d. Late in a basketball game, a team often intentionally fouls an opposing player in order to stop the clock. The strategy tends to be intentionally fouling the other team’s worst free-throw shooter. Assume that the team’s worst free-throw shooter makes 58% of his free-throw shots. Calculate the probabilities for this player for parts (a), (b), and (c) again, assuming two shots. Is intentionally fouling this player (who makes 58%) of the free throws a better strategy than intentionally fouling the player who makes 93% of the free throws? - For the player who makes 58% of free throws: P (Make the Shot) = 0 . 58 for each foul shot. So probability that this player will make two consecutive foul shots is: P (Make the Shot) P (Make the Shot) = (0 . 58)(0 . 58) = 0 . 3364 . Again, there are ways to make at least one shot: the player can make the first shot and miss the second shot, miss the first shot and make the second shot, or make both shots. Because the event “Miss the Shot” is the complement of the event “Make the Shot”: P (Miss the Shot) = 1 P (Make the Shot) = 1 0 . 58 = 0 . 42 . Therefore: the probability of making at least one shot is 82.36%: P (Make the Shot) P (Miss the Shot) = (0 . 58)(0 . 42) = 0 . 2436 P (Miss the Shot) P (Make the Shot) = (0 . 42)(0 . 58) = 0 . 2436 P (Make the Shot) P (Make the Shot) = (0 . 58)(0 . 58) = 0 . 3364 - Can again find the probability the 58% free-throw shooter will miss both shots in two ways, directly: P (Miss the Shot) P (Miss the Shot) = (0 . 42)(0 . 42) = 0 . 1764 Or recognize that the event “Miss Both Shots” is the complement of the event “Make at Least One of the Two Shots,” so P (Miss the Shot) P (Miss the Shot) = 1 0 . 9951 = 0 . 1764 Therefore, intentionally fouling the 58% free-throw shooter is a better strategy than intentionally fouling the 93% shooter. 4
Question 6. Mortgage Defaults. A local bank reviewed their mortgage portfolio given the current interest rates. In the past, approximately 5% of households with mortgages defaulted. Management, thus, established a prior probability of 0.05 that any particular household will default on their mortgage. The bank also discovered that for customers who do not default, the probability of missing a monthly payment is 0.20. The probability of missing a monthly payment for households who default on their mortgage is 1. a. Assume a household missed one or more monthly payments, what is the probability they will default on their mortgage? Let, M = missed payment D 1 = customer defaults D 2 = customer does not default So, P ( D 1 ) = 0 . 05 P ( D 2 ) = 0 . 95 P ( M | D 2 ) = 0 . 20 P ( M | D 1 ) = 1 . 00 Therefore, P ( D 1 | M ) = P ( D 1 ) P ( M | D 1 ) P ( D 1 ) P ( M | D 1 ) + P ( D 2 ) P ( M | D 2 ) = (0 . 05)(1 . 00) (0 . 05)(1 . 00) + (0 . 95)(0 . 20) = 0 . 21 b. The bank is considering changing the policies calling the loans and demanding full payment if the probability that a household defaults is greater than 0.20. If the policy changes, would the bank recall mortgages of households that miss a monthly payment? Why or why not? - Yes, the probability of default is greater than 0.20. Question 7. Random Variables. List 5 experiments (that you experience on your daily life) and associate them to a random variable. In each example, identify the values that the random variable can assume. Identify if the random variable is discrete or continuous and ensure you include at least two continuous variables. Example: Experiment: completing this 10-question assignment. Random Variable (x): number of questions an- swered correctly. Values: 0, 1, 2, ..., 10 and it is a discrete random variable. Question 8. Operating Room Use. The following data were collected by the number of operating rooms in the Royal Jubilee Hospital over a 20-day period. On three of the days, only one operating room was used, on five of the days two rooms were used, on eight of the days three rooms were used, and on four days all four of the hospital’s operating rooms were used. a. Use the relative frequency approach to construct an empirical discrete probability distribution for the number of operating rooms in use on any given day. x f(x) 1 3/20 = 0.15 2 5/20 = 0.25 3 8/20 = 0.40 4 4/20 = 0.20 Total 1.00 b. Draw a graph of the probability distribution. 5
c. Show that your probability distribution satisfies the required conditions for a valid discrete probability dis- tribution. f ( x ) 0 for x = 1 , 2 , 3 , 4 X f ( x ) = 1 Question 9. Coffee Consumption. In Gallup’s Annual Consumption Habits Poll, 1014 adults aged 18 and over were sampled. One of the questions was: “How many cups of coffee, if any, do you drink on an average day?” From this sample, 365 adults noted they do not drink coffee; 264 adults have one cup of coffee per day; 193 adults drink two cups per day; 91 adults drink three cups of coffee per day and 101 adults drink 4 or more cups of coffee per day. Define a random variable, x , as the number of cups of coffee consumed on an average day. Let, x = 4 represent four or more cups of coffee. a. Develop a probability distribution for x . The number of responses is 1014. Therefore: f (0) = 365 / 1014 = 0 . 3600 f (1) = 264 / 1014 = 0 . 2604 f (2) = 193 / 1014 = 0 . 1903 f (3) = 91 / 1014 = 0 . 0897 f (4) = 101 / 1014 = 0 . 0996 b. Compute the expected value of x . - E ( x ) = 1.3085. Calculations below. c. Compute the variance of x . - V ar ( x ) = 1.7103. Calculations below. x f ( x ) x · f ( x ) x µ ( x µ ) 2 ( x µ ) 2 · f ( x ) 0 0.3600 0.0000 -1.3085 1.7122 0.6164 1 0.2604 0.2604 -0.3085 0.0952 0.0248 2 0.1903 0.3806 0.6915 0.4782 0.0910 3 0.0897 0.2691 1.6915 2.8612 0.2566 4 0.0996 0.3984 2.6915 7.2442 0.7215 E ( x ) = 1.3085 V ar ( x ) = 1.7103 d. Suppose that you are only interested on people who consume coffee on a daily basis. For this subset, let, y be the number of cups of coffee consumed on an average day. Compute the expected value of y and compate it to the expected value of x . - The possible values of y are 1, 2, 3, and 4. The total number of responses is 649, so: y f ( y ) y · f ( y ) 1 0.4068 0.4068 2 0.2974 0.5948 3 0.1402 0.4206 4 0.1556 0.6225 Total 1.0000 2.0447 6
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help
- E ( y ) = 2 . 0447. The expected value or mean number of cups per day for adults that drink at least one cup of coffee on an average day is 2.05, or approximately a mean of two cups per day. As expected, the mean is somewhat higher when we only take into account adults who drink at least one cup of coffee per day. Question 10. Random Variable. The following table provides a probability distribution of random variable, x . x f(x) 3 0.25 6 0.50 9 0.25 For the following, show all steps and explain, with words, what is being calculated in each step. a. Compute the expected value of x . - The expected value is the sum of the values of the random variable using the probabilities of each (given by f ( x )) as weights. E ( x ) = 6 b. Compute the variance of x . - The variance is calculated by computing the squared differences between each value of the random variable and its expected value (mean) and averaging these squared differences, again, using the probabilities of each value as weights. V ar ( x ) = 4 . 5 c. Compute the standard deviation of x . - The standard deviation is simply the square root of the variance. Since we use squared deviations when computing the variance, the units for it are squared, the standard deviation allows us to have the same units as the variable. σ ( x ) = 2 . 12 - Calculations: x f ( x ) x · f ( x ) x µ ( x µ ) 2 ( x µ ) 2 · f ( x ) 3 0.25 0.75 -3 9 2.25 6 0.50 3 0 0 0 9 0.25 2.25 3 9 2.25 1 .00 µ = 6 σ 2 ( x ) = 4.5 σ ( x ) = 2 . 12 7