Problem 1: Asking Embarrassing Questions, Politely When doing polling, for instance to figure out how popular a given candidate is, a common trick is to just ask N many people whether they support that candidate, and take the support to be the faction of people who say yes: if 70 people support the candidate out of 100 asked, we estimate the support at 70% or 0.7. Suppose that the probability a person supports a candidate is p, which you do not know. Let pn be the fraction of N people polled who support the candidate: total supporters divided by N people polled. 1) What is the distribution of N *pn? 2) Show hat the expected value of pN is p, i.e., PN is a valid estimator for p. If you want your estimated value of p to be accurate, you want your 'error' on N to be small. 3) How many people N should you poll to guarantee the expected squared error on N is less than 4) How many people N should you poll to guarantee the expected squared error on py is less than €, even if you don't know p?

A First Course in Probability (10th Edition)
10th Edition
ISBN:9780134753119
Author:Sheldon Ross
Publisher:Sheldon Ross
Chapter1: Combinatorial Analysis
Section: Chapter Questions
Problem 1.1P: a. How many different 7-place license plates are possible if the first 2 places are for letters and...
icon
Related questions
Question
please send handwritten solution Q4
Problem 1: Asking Embarrassing Questions, Politely
When doing polling, for instace to figure out how popular a given candidate is, a common trick is to just ask N
many people whether they support that candidate, and take the support to be the faction of people who say yes: if 70
people support the candidate out of 100 asked, we estimate the support at 70% or 0.7. Suppose that the probability
a person supports a candidate is p, which you do not know. Let PN be the fraction of N people polled who support
the candidate: total supporters divided by N people polled.
1) What is the distribution of N * p,?
2) Show hat the expected value of pN is p, i.e., pN is a valid estimator for p.
If you want your estimated value of p to be accurate, you want your 'error' on pN to be small.
3) How many people N should you poll to guarantee the erpected squared error on pN is less than e?
4) How many people N should you poll to guarantee the expected squared error on py is less than e, even if you
don't know p?
In the previous to problems, we considered the average or expected squared error. But just because the expected
error is small doesn't mean the actual error is small.
5) How many people N should you poll to guarantee the actual error on PN is less than e with 95% confidence,
even if you don't know p?
However, a problem with polling is whether or not people are willing to answer honestly. If a question might be
viewed as shameful or embarrassing (about politics, sexual activity, or whatever people are sensitive about), they
may be reluctant to answer honestly.
A potential solution to this is the following: when asking whether they support or do not support a given candidate,
give the people you are polling the following instructions: flip a fair coin privately, and if it comes up HEADS, answer
honestly; if it comes up TAILS, flip another fair coin and if it comes up HEADS, answer 'support', if it comes up
TAILS, answer do not 'support'. In this case, the person being polled can always claim that whatever they answered
was the result of the coin in a sense, the results are anonymized and the people being polled are protected.
Let p be the probability that a randomly polled person using this method says 'support'; let q be the true probability
a random person actually supports the candidate. We would like to know the value of q, but we can only estimate
the value of p: let p, be the fraction of N people who answer 'support' using this method. We have that ElPN] = p,
as before.
7) What is the relationship between q and p?
8) Construct an estimator qn from PN (i.e., a formula for ĝn in terms of pN) so that the expected value ElâN] = q.
What is the variance of ĝn?
9) How many people N should you poll to guarantee the actual error between qn and q is less than e, with 90%
confidence? Note, q is unknown, so you cannot use it to determine N.
Transcribed Image Text:Problem 1: Asking Embarrassing Questions, Politely When doing polling, for instace to figure out how popular a given candidate is, a common trick is to just ask N many people whether they support that candidate, and take the support to be the faction of people who say yes: if 70 people support the candidate out of 100 asked, we estimate the support at 70% or 0.7. Suppose that the probability a person supports a candidate is p, which you do not know. Let PN be the fraction of N people polled who support the candidate: total supporters divided by N people polled. 1) What is the distribution of N * p,? 2) Show hat the expected value of pN is p, i.e., pN is a valid estimator for p. If you want your estimated value of p to be accurate, you want your 'error' on pN to be small. 3) How many people N should you poll to guarantee the erpected squared error on pN is less than e? 4) How many people N should you poll to guarantee the expected squared error on py is less than e, even if you don't know p? In the previous to problems, we considered the average or expected squared error. But just because the expected error is small doesn't mean the actual error is small. 5) How many people N should you poll to guarantee the actual error on PN is less than e with 95% confidence, even if you don't know p? However, a problem with polling is whether or not people are willing to answer honestly. If a question might be viewed as shameful or embarrassing (about politics, sexual activity, or whatever people are sensitive about), they may be reluctant to answer honestly. A potential solution to this is the following: when asking whether they support or do not support a given candidate, give the people you are polling the following instructions: flip a fair coin privately, and if it comes up HEADS, answer honestly; if it comes up TAILS, flip another fair coin and if it comes up HEADS, answer 'support', if it comes up TAILS, answer do not 'support'. In this case, the person being polled can always claim that whatever they answered was the result of the coin in a sense, the results are anonymized and the people being polled are protected. Let p be the probability that a randomly polled person using this method says 'support'; let q be the true probability a random person actually supports the candidate. We would like to know the value of q, but we can only estimate the value of p: let p, be the fraction of N people who answer 'support' using this method. We have that ElPN] = p, as before. 7) What is the relationship between q and p? 8) Construct an estimator qn from PN (i.e., a formula for ĝn in terms of pN) so that the expected value ElâN] = q. What is the variance of ĝn? 9) How many people N should you poll to guarantee the actual error between qn and q is less than e, with 90% confidence? Note, q is unknown, so you cannot use it to determine N.
Expert Solution
steps

Step by step

Solved in 3 steps with 14 images

Blurred answer