IDS 575 PS3
pdf
keyboard_arrow_up
School
University of Illinois, Chicago *
*We aren’t endorsed by this school
Course
575
Subject
Statistics
Date
Apr 3, 2024
Type
Pages
12
Uploaded by ChancellorFogGull33
Graded
Problem Set (PS) #03
Student
Ayushi Rajive Srivastava
Total Points
100 / 100 pts
Question 1
Maximum Likelihood Basic
25
/ 25 pts
1.1
(no title)
5
/ 5 pts
+ 0 pts
Incorrect
+ 5 pts
Correct
1.2
(no title)
8
/ 8 pts
+ 8 pts
Correct
+ 0 pts
Incorrect
1.3
(no title)
7
/ 7 pts
+ 7 pts
Correct
+ 0 pts
Incorrect
1.4
(no title)
5
/ 5 pts
+ 5 pts
Correct
+ 0 pts
Incorrect
Question 2
Maximum Likelihood Estimation
30
/ 30 pts
2.1
(no title)
10
/ 10 pts
+ 10 pts
Correct
+ 0 pts
Incorrect
+ 8 pts
partial mistake
2.2
(no title)
10
/ 10 pts
+ 10 pts
Correct
+ 0 pts
Incorrect
+ 5 pts
half correct
− 1 pt
Click here to replace this description.
+ 8 pts
Click here to replace this description.
2.3
(no title)
10
/ 10 pts
+ 10 pts
Correct
+ 0 pts
Incorrect
+ 5 pts
half correct
− 1 pt
minor mistake
Question 3
Logistic Regression
45
/ 45 pts
3.1
(no title)
7
/ 7 pts
+ 7 pts
Correct
+ 0 pts
Incorrect
3.2
(no title)
7
/ 7 pts
+ 7 pts
Correct
+ 0 pts
Incorrect
+ 1.5 pts
half
3.3
(no title)
5
/ 5 pts
+ 5 pts
Correct
+ 0 pts
Incorrect
3.4
(no title)
6
/ 6 pts
+ 6 pts
Correct
+ 0 pts
Incorrect
3.5
(no title)
10
/ 10 pts
+ 10 pts
Correct
+ 0 pts
Incorrect
3.6
(no title)
10
/ 10 pts
+ 10 pts
Correct
+ 0 pts
Incorrect
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
Q1 Maximum Likelihood Basic
25 Points
Q1.1
5 Points
A coin is tossed 100 times and lands heads 82 times. Choose every correct option in the following.
(select all applied, no partial credits for more or less)
Q1.2
8 Points
In the setting given at Q1.1, what is the probability of head that makes your overall observation most-likely? (autograded short answer: only the final result number, round to 2 decimal places,like 0.55)
0.82
This is a fair coin.
Probability of the overall observation is .
0.5
82
Maximum possible likelihood of the overall observation is .
100
82
Minimum possible likelihood of the overall observation is .
100
18
None of the above
Q1.3
7 Points
Assume that the coin used for Q1.1 turns out to be completely normal, which is supposed to generate almost equal numbers of heads and tails if randomly tossing many times. Choose every possible concern of using Maximum Likelihood Estimation. (select all applied, no partial credits for more or less)
Q1.4
5 Points
Maximum Likelihood Estimation gives us a distribution over the parameters as well as the best parameter itself. In other words, MLE provides not only the best parameter but also other parameters with the associated uncertainty of being "non-best".
Does not fully use our observation.
Does not incorporate our prior knowledge.
Does fit tightly to the given observation.
Does fit loosely to the given observation.
None of the above.
θ
^
θ
p
(
θ
)
True
False
Q2 Maximum Likelihood Estimation
30 Points
Consider the following density function , ; , . This is a legal probability density (parametrized by ) because one can verify that the integral over is equal to 1. If necessary, mean and variance of this distributions can be verifed as and , respectively.
f
(
x
∣
θ
) =
θ xe
2
−
θx
x
≥ 0
f
(
x
∣
θ
) = 0
x
< 0
f
θ
∈
R
x
∈ [−∞,∞]
2/
θ
2/
θ
2
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
Q2.1
10 Points
Derive a likelihood of a dataset that consists of independent samples from this distribution. (Hint: The likelihood must be a function of the parameters )
MLE Q2.1.pdf
Download
Your browser does not support PDF previews. You can download the file instead.
D
= {
x
,
x
,...,
x
}
(1)
(2)
(
m
)
m
L
(
θ
;
D
)
θ
Q2.2
10 Points
Derive the log-likelihood function of the dataset from Q2.1. (Hint: The log-
likelihood )
MLE Q2.2.pdf
Download
Your browser does not support PDF previews. You can download the file instead.
D
l
(
θ
;
D
) = log
L
(
θ
;
D
)
Q2.3
10 Points
Find the Maximum Likelihood Estimator . (Hint: Set the derivative equal to zero, and then solve the formula)
MLE Q2.3.pdf
Download
Your browser does not support PDF previews. You can download the file instead.
θ
^
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
Q3 Logistic Regression
45 Points
Q3.1
7 Points
Suppose that you have trained a logistic regression classifier, and it outputs a prediction on a new example . Choose every correct interpretation in the following. (select all applied, no partial credits for more or less)
Q3.2
7 Points
Which of the following ways can we train a logistic regression model? (select all correct, no partial credits for more or less)
h
(
x
) =
θ
0.2
x
Our estimate for is 0.2.
P
(
y
= 0∣
x
;
θ
)
Our estimate for is 0.2.
P
(
y
= 1∣
x
;
θ
)
Our estimate for is 0.8.
P
(
y
= 0∣
x
;
θ
)
Our estimate for is 0.8.
P
(
y
= 1∣
x
;
θ
)
Minimize least-square error
Maximize likelihood
Solve normal equation
Minimimize negative log-likelihood
Q3.3
5 Points
In logistic regression, what do we estimate for one each unit’s change in ?
Q3.4
6 Points
Choose every option that correctly describes properties of the logistic function. (select all applied, no partial credits for more or less)
X
The change in multiplied with .
Y
Y
The change in from its mean.
Y
How much changes.
Y
How much the natural logarithm of the odds (i.e., ) changes.
log
p
(
y
=0)
p
(
y
=1)
It maps a real-valued confidence value into a probability value.
It is essentially an identity function between .
[0,1]
It always maps zero confidence exactly into the probability 0.5.
It is a continuous function that is differentiable everywhere.
Its derivative can be easily evaluated with itself.
Q3.5
10 Points
Recall the grading problem to predict pass/fail in the class. Suppose you collect data for a group of students in the class that consist of two input features = hours studied and = undergrad GPA. Your goal is to predict the output {pass, fail}. Suppose that you fit a logistic regression, learning its parameter .
What will be the probability for a student who studies for 40 hours and has a GPA of 3.5 to pass the class?
(augraded short answer: only final result number, round to 2 decimal places, like 0.11)
0.38
Q3.6
10 Points
How many hours would the student in part Q3.5 need to study in order to have at least 50% chance of passing the class?
(autograded short answer: only final result number: an integer)
50
X
1
X
2
Y
∈
(
θ
;
θ
;
θ
) =
0
1
2
(−6;0.05;1)
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
Related Documents
Related Questions
null/observed/alternative20/25/50%compatible/not compatibleinside/outside30/50/95is/is no
https://learning.statistics-is-awesome.org/threethings/
arrow_forward
Hello! Step by step can you show me how to compute the valus for this statistics test please. Also, how can I compute this into my calculator?
High-powered experimental engines are being developed by the Hendrix Motor Company for use in their new sports coupe. The engineers have calculated the maximum horsepower for the engines to be 610HP. Sixteen engines are randomly selected for testing. Perform a hypothesis test to determine whether the data suggests that the average maximum horsepower for the experimental engine is significantly different than the maximum horsepower calculated by the engineers. Assume the data are normally distributed and use a significance level 0.10.
Maximum Horsepower (HP)
618
590
641
617
592
568
575
654
645
652
651
638
659
633
657
647
arrow_forward
Part d please
Comparing Length of Flight Delays
The success of an airline depends heavily on its ability to provide a pleasant customer experience. One dimension of customer service on which airlines compete is on-time arrival. The file LateFlights contains a sample of data from delayed flights showing the number of minutes each delayed flight was late for two different airlines, Delta and Southwest.
a. Formulate the hypotheses that can be used to test for a difference between the population mean minutes late for delayed flights by these two airlines.
b. What is the sample mean number of minutes late for delayed flights for each of these two airlines?
Delta sample mean: 50.6 minutes
Southwest sample mean: 52.8 minutes
c. Using a .05 level of significance, what is the p-value and what is your conclusion?
The p value is 0.7535
P value > alpha
We fail to reject the null hypothesis
d. Estimate a 95% interval estimation for the difference between the population mean minutes late
e.…
arrow_forward
Please can someone take a look at the questions not sure if the answers are right.
arrow_forward
[3] Instruction: Draw a conclusion by analyzing the given sets of data using the z test. Use the 7 steps below in answering a word problem in inferential statistics. (Type the answers with solution, NOT HANDWRITTEN + Please put a label with "step #" on every solution).
Note (1): pls refer for the format in the photo below and few example on how to answer each steps.
Note (2): pls refer for the problem on the next photo
arrow_forward
IQ and Lead Exposure Data Set 7 “IQ and Lead” in Appendix B lists full IQ scores for a random sample of subjects with “medium” lead levels in their blood and another random sample of subjects with “high” lead levels in their blood. Use a 0.05 significance level to test the claim that subjects with medium lead levels have a higher median of the full IQ scores than subjects with high lead levels. Does lead level appear to affect full IQ scores?
arrow_forward
Nonparametric Tests
a. Which of the following terms is sometimes used instead of “nonparametric test normality test; abnormality test; distribution-free test; last testament; test of patience?
b. Why is the term that is the answer to part (a) better than “nonparametric test”?
arrow_forward
(b) Compute the value of the test statistic. Round the answer to at least three decimal places.
arrow_forward
Please show how it is solve
arrow_forward
Look at the first picture than answer questions in the 2nd one
arrow_forward
Question Help
Two different simple random samples are drawn from two different populations. The first sample consists of 20 people with 9 having a common attribute. The second sample consists of 2000 people with 1414 of them having the same common
attribute. Compare the results from a hypothesis test of p, = p2 (with a 0.05 significance level) and a 95% confidence interval estimate of p, - p2.
TP1 P2
P2
H1: P1 = P2
H1: P1 #P2
H1: P1 > P2
O F. Ho: P1 S P2
H1:P, #P2
OD. Ho: P12 P2
O E. Ho: P1 = P2
H: P1 #P2
Hq:P1
arrow_forward
Risk A study of auto safety determined the number ofdriver deaths per million vehicle sales, classified by typeof vehicle. The data on the next page are for 6 midsizemodels and 6 SUVs. Wondering if there is evidence thatdrivers of SUVs are safer, we hope to create a 95%confidence interval for the difference in driver death ratesfor the two types of vehicles. Are these data appropriatefor this inference? Explain. (Ross and Wenzel, An Analysisof Traffic Deaths by Vehicle Type and Model, March 2002)
Midsize 47 54 64 76 88 97SUV 55 60 62 76 91 109
arrow_forward
What is the equation for computing the degrees of freedom (df) for a hypothesis test of independence of categorical data?
O df = (number of rows + 1) (number of columns + 1)
df = (number of rows) (number of columns)
df = (number of rows - 1) (number of columns - 1)
O df = n - 1
arrow_forward
Explain what this chart is showing.
arrow_forward
"Bob didn’t wear his lucky T-shirt to class, so he failed his chemistry exam." This best illustrates which fallacy?
Multiple Choice
Small sample generalization
Poor survey methods
Post hoc reasoning
More than one of the above
arrow_forward
Number of Jobs A sociologist found that in a sample of 50 retired men, the average number of jobs they had during their lifetimes was 6.9. The population
standard deviation is 2.3.
Part: 0/ 4
Part 1 of 4
(a) Find the best point estimate of the mean.
The best point estimate of the mean is
Part: 1/4
Part 2 of 4
(b) Find the 90% confidence interval of the mean number of jobs. Round intermediate and final answers to one decimal place.
arrow_forward
Question 3
Express the confidence interval (18.9 %, 30.1 %) in the form of p + E.
% ±
Submit Question
18
arrow_forward
Please help with this question the previous anwere was wrong. It’s statistics.
arrow_forward
SEE MORE QUESTIONS
Recommended textbooks for you

Glencoe Algebra 1, Student Edition, 9780079039897...
Algebra
ISBN:9780079039897
Author:Carter
Publisher:McGraw Hill

Related Questions
- null/observed/alternative20/25/50%compatible/not compatibleinside/outside30/50/95is/is no https://learning.statistics-is-awesome.org/threethings/arrow_forwardHello! Step by step can you show me how to compute the valus for this statistics test please. Also, how can I compute this into my calculator? High-powered experimental engines are being developed by the Hendrix Motor Company for use in their new sports coupe. The engineers have calculated the maximum horsepower for the engines to be 610HP. Sixteen engines are randomly selected for testing. Perform a hypothesis test to determine whether the data suggests that the average maximum horsepower for the experimental engine is significantly different than the maximum horsepower calculated by the engineers. Assume the data are normally distributed and use a significance level 0.10. Maximum Horsepower (HP) 618 590 641 617 592 568 575 654 645 652 651 638 659 633 657 647arrow_forwardPart d please Comparing Length of Flight Delays The success of an airline depends heavily on its ability to provide a pleasant customer experience. One dimension of customer service on which airlines compete is on-time arrival. The file LateFlights contains a sample of data from delayed flights showing the number of minutes each delayed flight was late for two different airlines, Delta and Southwest. a. Formulate the hypotheses that can be used to test for a difference between the population mean minutes late for delayed flights by these two airlines. b. What is the sample mean number of minutes late for delayed flights for each of these two airlines? Delta sample mean: 50.6 minutes Southwest sample mean: 52.8 minutes c. Using a .05 level of significance, what is the p-value and what is your conclusion? The p value is 0.7535 P value > alpha We fail to reject the null hypothesis d. Estimate a 95% interval estimation for the difference between the population mean minutes late e.…arrow_forward
- Please can someone take a look at the questions not sure if the answers are right.arrow_forward[3] Instruction: Draw a conclusion by analyzing the given sets of data using the z test. Use the 7 steps below in answering a word problem in inferential statistics. (Type the answers with solution, NOT HANDWRITTEN + Please put a label with "step #" on every solution). Note (1): pls refer for the format in the photo below and few example on how to answer each steps. Note (2): pls refer for the problem on the next photoarrow_forwardIQ and Lead Exposure Data Set 7 “IQ and Lead” in Appendix B lists full IQ scores for a random sample of subjects with “medium” lead levels in their blood and another random sample of subjects with “high” lead levels in their blood. Use a 0.05 significance level to test the claim that subjects with medium lead levels have a higher median of the full IQ scores than subjects with high lead levels. Does lead level appear to affect full IQ scores?arrow_forward
- Nonparametric Tests a. Which of the following terms is sometimes used instead of “nonparametric test normality test; abnormality test; distribution-free test; last testament; test of patience? b. Why is the term that is the answer to part (a) better than “nonparametric test”?arrow_forward(b) Compute the value of the test statistic. Round the answer to at least three decimal places.arrow_forwardPlease show how it is solvearrow_forward
- Look at the first picture than answer questions in the 2nd onearrow_forwardQuestion Help Two different simple random samples are drawn from two different populations. The first sample consists of 20 people with 9 having a common attribute. The second sample consists of 2000 people with 1414 of them having the same common attribute. Compare the results from a hypothesis test of p, = p2 (with a 0.05 significance level) and a 95% confidence interval estimate of p, - p2. TP1 P2 P2 H1: P1 = P2 H1: P1 #P2 H1: P1 > P2 O F. Ho: P1 S P2 H1:P, #P2 OD. Ho: P12 P2 O E. Ho: P1 = P2 H: P1 #P2 Hq:P1arrow_forwardRisk A study of auto safety determined the number ofdriver deaths per million vehicle sales, classified by typeof vehicle. The data on the next page are for 6 midsizemodels and 6 SUVs. Wondering if there is evidence thatdrivers of SUVs are safer, we hope to create a 95%confidence interval for the difference in driver death ratesfor the two types of vehicles. Are these data appropriatefor this inference? Explain. (Ross and Wenzel, An Analysisof Traffic Deaths by Vehicle Type and Model, March 2002) Midsize 47 54 64 76 88 97SUV 55 60 62 76 91 109arrow_forwardarrow_back_iosSEE MORE QUESTIONSarrow_forward_ios
Recommended textbooks for you
- Glencoe Algebra 1, Student Edition, 9780079039897...AlgebraISBN:9780079039897Author:CarterPublisher:McGraw Hill

Glencoe Algebra 1, Student Edition, 9780079039897...
Algebra
ISBN:9780079039897
Author:Carter
Publisher:McGraw Hill
