In: Statistics and Probability
Suppose you have N items with values xi from which we want to sample n items with replacement. Each item has its own probability pi of being selected across all times we pick items.
What's the estimated variance of the Horvitz-Thompson estimator of T?
In this sample, we are picking the n items with replacement.
Let be the probability that the ith item is selected atleast once. So, the probability that it is selected in terms of pi is:
Let be the probability that both ith and jth item are included. Since:
We have:
P(ith and jth item included) = P(ith included) + P(jth item included) - (1 - P(ith and jth item excluded))
Substituting the formulae:
The estimated variance of the Horvitz-Thompson estimator of T is given by: