屏幕剖析: 从屏幕截图中向 UI 模型反向工程 (Screen Parsing: Towards Reverse Engineering of UI Models from Screenshots)

Automated understanding of user interfaces (UIs) from their pixels can improve accessibility, enable task automation, and facilitate interface design without relying on developers to comprehensively provide metadata. A first step is to infer what UI elements exist on a screen, but current approaches are limited in how they infer how those elements are semantically grouped into structured interface definitions. In this paper, we motivate the problem of screen parsing, the task of predicting UI elements and their relationships from a screenshot. We describe our implementation of screen parsing and provide an effective training procedure that optimizes its performance. In an evaluation comparing the accuracy of the generated output, we find that our implementation significantly outperforms current systems (up to 23%). Finally, we show three example applications that are facilitated by screen parsing: (i) UI similarity search, (ii) accessibility enhancement, and (iii) code generation from UI screenshots.

翻译：从像素中自动理解用户界面(UI)可以改善无障碍环境,实现任务自动化,便利界面设计,而不必依靠开发者全面提供元数据。第一步是推断屏幕上有哪些UI元素存在,但目前的方法有限,无法推断这些元素是如何在结构化界面定义中进行精密分类的。在本文中,我们从屏幕截图中推介屏幕分割问题,预测UI元素及其关系的任务。我们描述了屏幕剖析的执行情况,并提供了一个优化其性能的有效培训程序。在比较所生成产出的准确性时,我们发现我们的实施大大优于现有系统(高达23% ) 。最后,我们展示了三个通过屏幕剖析促进的应用程序:(一) UII相似性搜索,(二) 无障碍增强,以及(三) 从用户截图中生成代码。

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

深度学习搜索，Exploring Deep Learning for Search

专知会员服务

61+阅读 · 2020年5月9日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

【强化学习资源集合】Awesome Reinforcement Learning

专知会员服务

97+阅读 · 2019年12月23日