Contact tracing is an effective tool in controlling the spread of infectious diseases such as COVID-19. It involves digital monitoring and recording of physical proximity between people over time with a central and trusted authority, so that when one user reports infection, it is possible to identify all other users who have been in close proximity to that person during a relevant time period in the past and alert them. One way to achieve this involves recording on the server the locations, e.g. by reading and reporting the GPS coordinates of a smartphone, of all users over time. Despite its simplicity, privacy concerns have prevented widespread adoption of this method. Technology that would enable the "hiding" of data could go a long way towards alleviating privacy concerns and enable contact tracing at a very large scale. In this article we describe a general method to hide data. By hiding, we mean that instead of disclosing a data value x, we would disclose an "encoded" version of x, namely E(x), where E(x) is easy to compute but very difficult, from a computational point of view, to invert. We propose a general construction of such a function E and show that it guarantees perfect recall, namely, all individuals who have potentially been exposed to infection are alerted, at the price of an infinitesimal number of false alarms, namely, only a negligible number of individuals who have not actually been exposed will be wrongly informed that they have.
翻译:追踪联系是控制传染性疾病(如COVID-19)传播的有效工具。它涉及在中央和可信任的权威下,对人们之间长期存在的距离进行数字监测和记录,以便当一个用户报告感染时,有可能查明过去某个相关时期与该人关系密切的所有其他用户,并提醒他们注意。实现这一目的的方法之一是在服务器上记录各个地点的“编码”版本,例如阅读和报告所有用户的智能手机GPS坐标。尽管这种系统简单明了,但隐私问题阻碍了广泛采用这种方法。能够“隐藏”数据的技术可以大大有助于减轻隐私问题,并能够大规模追踪联系。在文章中,我们描述了隐藏数据的一般方法。我们的意思是,除了披露数据价值x之外,我们还要在服务器上披露“编码”的X版本,即E(x)易于理解,但从计算角度看,它们很难广泛采用这种方法。我们提议一般地构建这样的“隐藏”数据功能,并表明它能够保证准确无误地提醒所有个人,也就是说,最坏的受感染者,只有最坏的、最坏的、最坏的、最坏的、最有可能受到感染的人。</s>