round, 2 coins yielded a head and y2 coins yielded a tail. Once again, z2+ 3/2 = n. She does this experiment m times. Your job is to estimate the probability p of a coin yielding a head. 1. What is your guess on the value of p? 2. In Maximum Likelihood Estimation, we want to find a parameter p which maximizes all the observations in the dataset. If the dataset is a matrix A, where each row a₁, a2,, am are individual observations, we want to maximize P(A) = P(a₁) P(a2). P(am) because individ- ual experiments are independent. Maximizing this is equivalent to maximizing log P(A) = log P(a1) +log P(az)++log P(am). Maximizing this quantity is equivalent to minimizing the -log P(A) = -log P(ai) - log P(a2) - - log P(am). 3. Here you need to find out P(a;) for yourself. 4. If you can do that properly, you will find an equation of the form: Now, define q = mn log P(A) mn ܕܫܐ logp- ΣΗΜ Then the equation becomes: log (1-P) log P(A) -q logp-(1-q) log (1 - p) mn Use Pinsker's Inequality or Calculus to show that, p = q. 5. What is the value of p for the above dataset given in the table? 6. If you toss 20 coins now, how many coins are most likely to yield a head?

A First Course in Probability (10th Edition)
10th Edition
ISBN:9780134753119
Author:Sheldon Ross
Publisher:Sheldon Ross
Chapter1: Combinatorial Analysis
Section: Chapter Questions
Problem 1.1P: a. How many different 7-place license plates are possible if the first 2 places are for letters and...
icon
Related questions
Question
Need ans for question number 6
1st to 2nd to 3rd to 4th to 5th to 6th to 7th toss
TT HTTH
Now, define q =
Dm
H
mn
HTTHT T T
T
H
T
TTH TTT
H
TTT
T T H H
Η
_log P(A) =
mn
T
H
Then the equation becomes:
log P(A)
mn
Machine Learning is the science of learning from experience. Suppose Alice is repeatedly doing an
experiment. In each experiment she tosses n coins. She does this experiment m times. In the first
round, ₁ coins yielded a head and y₁ coins yielded a tail. Notice that, a₁ +y₁ = n. In the second
T H
round, 2 coins yielded a head and y2 coins yielded a tail. Once again, x2 + y2 = n. She does this
experiment m times. Your job is to estimate the probability p of a coin yielding a head.
1. What is your guess on the value of p?
2. In Maximum Likelihood Estimation, we want to find a parameter p which maximizes all the
observations in the dataset. If the dataset is a matrix A, where each row a1, a2,,am are
individual observations, we want to maximize P(A) = P(a₁)P(a₂) P(am) because individ-
ual experiments are independent. Maximizing this is equivalent to maximizing log P(A) =
log Pla₁) +log P(a₂) + +log P(am). Maximizing this quantity is equivalent to minimizing the
-log P(A) = -log P(a1) - log P(a2)log P(am).
3. Here you need to find out P(a;) for yourself.
4. If you can do that properly, you will find an equation of the form:
logp-log (1-p)
H T
H H
T T
=-q logp (1-q) log (1 - p)
Use Pinsker's Inequality or Calculus to show that, p = q.
5. What is the value of p for the above dataset given in the table?
6. If you toss 20 coins now, how many coins are most likely to yield a head?
Transcribed Image Text:1st to 2nd to 3rd to 4th to 5th to 6th to 7th toss TT HTTH Now, define q = Dm H mn HTTHT T T T H T TTH TTT H TTT T T H H Η _log P(A) = mn T H Then the equation becomes: log P(A) mn Machine Learning is the science of learning from experience. Suppose Alice is repeatedly doing an experiment. In each experiment she tosses n coins. She does this experiment m times. In the first round, ₁ coins yielded a head and y₁ coins yielded a tail. Notice that, a₁ +y₁ = n. In the second T H round, 2 coins yielded a head and y2 coins yielded a tail. Once again, x2 + y2 = n. She does this experiment m times. Your job is to estimate the probability p of a coin yielding a head. 1. What is your guess on the value of p? 2. In Maximum Likelihood Estimation, we want to find a parameter p which maximizes all the observations in the dataset. If the dataset is a matrix A, where each row a1, a2,,am are individual observations, we want to maximize P(A) = P(a₁)P(a₂) P(am) because individ- ual experiments are independent. Maximizing this is equivalent to maximizing log P(A) = log Pla₁) +log P(a₂) + +log P(am). Maximizing this quantity is equivalent to minimizing the -log P(A) = -log P(a1) - log P(a2)log P(am). 3. Here you need to find out P(a;) for yourself. 4. If you can do that properly, you will find an equation of the form: logp-log (1-p) H T H H T T =-q logp (1-q) log (1 - p) Use Pinsker's Inequality or Calculus to show that, p = q. 5. What is the value of p for the above dataset given in the table? 6. If you toss 20 coins now, how many coins are most likely to yield a head?
Expert Solution
steps

Step by step

Solved in 3 steps

Blurred answer
Recommended textbooks for you
A First Course in Probability (10th Edition)
A First Course in Probability (10th Edition)
Probability
ISBN:
9780134753119
Author:
Sheldon Ross
Publisher:
PEARSON
A First Course in Probability
A First Course in Probability
Probability
ISBN:
9780321794772
Author:
Sheldon Ross
Publisher:
PEARSON