In this paper, a new randomized response technique aimed at protecting respondents' privacy is proposed. It is designed for estimating the population total, or the population mean, of a quantitative characteristic. It provides a~high degree of protection to the interviewed individuals, hence it may be favorably perceived by them and increase their willingness to cooperate. Instead of revealing the true value of the characteristic under investigation, a respondent only states whether the value is greater (or smaller) than a~number which is selected by him/her at random, and is unknown to the interviewer. For each respondent this number, a sort of individual threshold, is generated as a pseudorandom number from the uniform distribution. Further, two modifications of the proposed technique are presented. The first modification assumes that the interviewer also knows the generated random number. The second modification deals with the issue that, for certain variables, such as income, it may be embarrassing for the respondents to report either high or low values. Thus, depending on the value of the pseudorandom lower bound, the respondent is asked different questions to avoid being embarrassed. The suggested approach is applied in detail to the simple random sampling without replacement, but it can also be applied to many currently used sampling schemes, including cluster sampling, two-stage sampling, etc. Results of simulations illustrate the behavior of the proposed procedure.
翻译:本文提出了一种旨在保护被调查者隐私的新的随机响应技术,旨在估计一个定量特征的人口总数或平均人口数,为被调查者提供了高度的保护,因此他们可能认为这种保护是有利的,并提高了合作意愿。第二次修改涉及某些变量,如收入,可能令被调查者难于报告高值或低值。因此,根据被调查者随机选择的数值,被调查者不知道该数值是否大于(或小于)~数字。对于每一个被调查者来说,这种数字,一种个人阈值,是从统一分布中生成的假冒数字。此外,还提出了两种拟议技术的修改。第一次修改假定被调查者也了解产生的随机数字。第二次修改涉及的问题是,对于某些变量,例如收入,被调查者可能很难报告高值或低值。因此,根据被调查者所选择的伪随机值较低值的价值,被询问不同的问题以避免难堪。建议的办法是在不替换的情况下对简单随机抽样进行详细应用,但也可以对目前使用的取样方法进行两次取样,包括模拟阶段。