Suppose you want to find out how many people support Policy X. A standard polling approach is to just ask N many people whether or not they support Policy X, and take the fraction of people who say yes as an estimate of the probability that any one person supports the policy. Suppose that the probability someone supports the policy is p, which you do not know. Let py be the number of people polled who supported the policy, divided by the total number of people polled N. 1) What distribution of N * PN? 2) Show that the expected value of pN is p. 3) If I want my estimate to be accurate, I want the error of pN to be small. How many people should I poll to guarantee the expected squared error on py is less than e? 4) How many people should I poll to guarantee the expected squared error on py is less than e, if I don't know p? 5) Just because the expected error is small doesn't mean the actual error is small. How many people should I poll to guarantee that the actual error on py is less than with 90% confidence? 6) How many people should I poll to guarantee the actual error on py is less than with 90% confidence, if I don't know p?

Glencoe Algebra 1, Student Edition, 9780079039897, 0079039898, 2018
18th Edition
ISBN:9780079039897
Author:Carter
Publisher:Carter
Chapter10: Statistics
Section10.6: Summarizing Categorical Data
Problem 42PFA
icon
Related questions
Question

part 5 6 7

Suppose you want to find out how many people support Policy X. A standard polling approach is to just ask N
many people whether or not they support Policy X, and take the fraction of people who say yes as an estimate of
the probability that any one person supports the policy. Suppose that the probability someone supports the policy
is p, which you do not know. Let py be the mumber of people polled who supported the policy, divided by the total
number of people polled N.
1) What distribution of N * PN?
2) Show that the expected value of pN is p.
3) If I want my estimate to be accurate, I want the error of pn to be small. How many people should I poll to
guarantee the erpected squared error on py is less than e?
4) How many people should I poll to guarantee the expected squared error on pn is less than e, if I don't know
p?
5) Just because the expected error is small doesn't mean the actual error is small. How many people should I poll
to guarantee that the actual error on pN is less than e with 90% confidence?
6) How many people should I poll to guarantee the actual error on pN is less than e with 90% confidence, if I
don't know p?
Transcribed Image Text:Suppose you want to find out how many people support Policy X. A standard polling approach is to just ask N many people whether or not they support Policy X, and take the fraction of people who say yes as an estimate of the probability that any one person supports the policy. Suppose that the probability someone supports the policy is p, which you do not know. Let py be the mumber of people polled who supported the policy, divided by the total number of people polled N. 1) What distribution of N * PN? 2) Show that the expected value of pN is p. 3) If I want my estimate to be accurate, I want the error of pn to be small. How many people should I poll to guarantee the erpected squared error on py is less than e? 4) How many people should I poll to guarantee the expected squared error on pn is less than e, if I don't know p? 5) Just because the expected error is small doesn't mean the actual error is small. How many people should I poll to guarantee that the actual error on pN is less than e with 90% confidence? 6) How many people should I poll to guarantee the actual error on pN is less than e with 90% confidence, if I don't know p?
A potential solution to this is the following: suppose that 'YES' is the embarrassing or socially shameful answer; give
the people you are polling the following instructions: flip a coin privately, and if it comes up heads answer honestly,
but if it comes up tails answer 'YES' regardless of what the truth is. This gives people plausible deniability about
why they answered yes, if pressed. Again, let py be the fraction of people who said 'YES'. Let p be the probability
that a randomly selected person says 'yes'; let q be the probability that a person's true answer is 'yes'. Note again,
we have that the expected value of pN is p, but we are actually interested in measuring q.
7) What is the relationship between q and p?
8) Construct an estimator ĝy from pN so that the expected value of ĝy is q.
9) If I want my estimate to be accurate, I want the error on ĝn to be small. How many people should I poll to
guarantee that the expected squared error on qn is less than e, when I don't know the value of q? How does
this compare to the answer in the previous section?
10) How many people should I poll to guarantee the actual error on ĝn is less than e, with 90% confidence, if I
don't know q?
11) What is the additional 'cost' of accurate polling if I want to preserve people's privacy in this way?
Transcribed Image Text:A potential solution to this is the following: suppose that 'YES' is the embarrassing or socially shameful answer; give the people you are polling the following instructions: flip a coin privately, and if it comes up heads answer honestly, but if it comes up tails answer 'YES' regardless of what the truth is. This gives people plausible deniability about why they answered yes, if pressed. Again, let py be the fraction of people who said 'YES'. Let p be the probability that a randomly selected person says 'yes'; let q be the probability that a person's true answer is 'yes'. Note again, we have that the expected value of pN is p, but we are actually interested in measuring q. 7) What is the relationship between q and p? 8) Construct an estimator ĝy from pN so that the expected value of ĝy is q. 9) If I want my estimate to be accurate, I want the error on ĝn to be small. How many people should I poll to guarantee that the expected squared error on qn is less than e, when I don't know the value of q? How does this compare to the answer in the previous section? 10) How many people should I poll to guarantee the actual error on ĝn is less than e, with 90% confidence, if I don't know q? 11) What is the additional 'cost' of accurate polling if I want to preserve people's privacy in this way?
Expert Solution
steps

Step by step

Solved in 3 steps with 20 images

Blurred answer
Recommended textbooks for you
Glencoe Algebra 1, Student Edition, 9780079039897…
Glencoe Algebra 1, Student Edition, 9780079039897…
Algebra
ISBN:
9780079039897
Author:
Carter
Publisher:
McGraw Hill
Holt Mcdougal Larson Pre-algebra: Student Edition…
Holt Mcdougal Larson Pre-algebra: Student Edition…
Algebra
ISBN:
9780547587776
Author:
HOLT MCDOUGAL
Publisher:
HOLT MCDOUGAL
Calculus For The Life Sciences
Calculus For The Life Sciences
Calculus
ISBN:
9780321964038
Author:
GREENWELL, Raymond N., RITCHEY, Nathan P., Lial, Margaret L.
Publisher:
Pearson Addison Wesley,
Big Ideas Math A Bridge To Success Algebra 1: Stu…
Big Ideas Math A Bridge To Success Algebra 1: Stu…
Algebra
ISBN:
9781680331141
Author:
HOUGHTON MIFFLIN HARCOURT
Publisher:
Houghton Mifflin Harcourt