Using mathematical modeling and human subjects experiments, this research explores the extent to which emerging webcams might leak recognizable textual and graphical information gleaming from eyeglass reflections captured by webcams. The primary goal of our work is to measure, compute, and predict the factors, limits, and thresholds of recognizability as webcam technology evolves in the future. Our work explores and characterizes the viable threat models based on optical attacks using multi-frame super resolution techniques on sequences of video frames. Our models and experimental results in a controlled lab setting show it is possible to reconstruct and recognize with over 75% accuracy on-screen texts that have heights as small as 10 mm with a 720p webcam. We further apply this threat model to web textual contents with varying attacker capabilities to find thresholds at which text becomes recognizable. Our user study with 20 participants suggests present-day 720p webcams are sufficient for adversaries to reconstruct textual content on big-font websites. Our models further show that the evolution towards 4K cameras will tip the threshold of text leakage to reconstruction of most header texts on popular websites. Besides textual targets, a case study on recognizing a closed-world dataset of Alexa top 100 websites with 720p webcams shows a maximum recognition accuracy of 94% with 10 participants even without using machine-learning models. Our research proposes near-term mitigations including a software prototype that users can use to blur the eyeglass areas of their video streams. For possible long-term defenses, we advocate an individual reflection testing procedure to assess threats under various settings, and justify the importance of following the principle of least privilege for privacy-sensitive scenarios.
翻译:使用数学模型和人文实验,本研究探索了新兴网络摄像头可能泄漏从网络摄像头摄取的镜像反射镜镜镜镜中可见的可识别文本和图形信息。我们工作的首要目标是测量、计算和预测随着网络摄像头技术在未来演进而可识别的因素、限度和阈值。我们的工作探索和描述基于光学攻击的可行的威胁模型,使用视频框架序列的多框架超级解析技术。我们在受控制的实验室设置中的模型和实验结果显示,有可能以75%以上的准确度在屏幕文本上进行重建和识别,其高度为10毫米,使用720页网络摄像头反射镜。我们进一步将这种威胁模型应用于具有不同攻击能力的网站的网络文本内容,以找到可识别的阈值。我们与20名参与者的用户研究研究表明,今天的纸质720页网络摄像头足以使敌人重建大软体网站的文本内容。我们的模型进一步显示,向4K摄像头的演化过程将使得最接近于最接近的屏幕防御文本流流到最接近的版本的版本的版本,直到720毫米的版本的版本的版本的版本。除了文本测试之外,我们的网站可以使用一个最接近的服务器的服务器的模型,包括一个最接近的模型。