屏幕识别:从像素创建移动应用无障碍元数据 (Screen Recognition: Creating Accessibility Metadata for Mobile Applications from Pixels) - 专知论文

会员服务 ·

0

VoiceOver · MoDELS · 稳健性 · FAST · 全 ·

2021 年 1 月 13 日

Screen Recognition: Creating Accessibility Metadata for Mobile Applications from Pixels

翻译：屏幕识别:从像素创建移动应用无障碍元数据

Xiaoyi Zhang,Lilian de Greef,Amanda Swearngin,Samuel White,Kyle Murray,Lisa Yu,Qi Shan,Jeffrey Nichols,Jason Wu,Chris Fleizach,Aaron Everitt,Jeffrey P. Bigham

Many accessibility features available on mobile platforms require applications (apps) to provide complete and accurate metadata describing user interface (UI) components. Unfortunately, many apps do not provide sufficient metadata for accessibility features to work as expected. In this paper, we explore inferring accessibility metadata for mobile apps from their pixels, as the visual interfaces often best reflect an app's full functionality. We trained a robust, fast, memory-efficient, on-device model to detect UI elements using a dataset of 77,637 screens (from 4,068 iPhone apps) that we collected and annotated. To further improve UI detections and add semantic information, we introduced heuristics (e.g., UI grouping and ordering) and additional models (e.g., recognize UI content, state, interactivity). We built Screen Recognition to generate accessibility metadata to augment iOS VoiceOver. In a study with 9 screen reader users, we validated that our approach improves the accessibility of existing mobile apps, enabling even previously inaccessible apps to be used.

翻译：移动平台上的许多无障碍功能要求应用程序(应用程序)提供完整和准确的元数据,描述用户界面(UI)组件。不幸的是,许多应用程序没有提供足够的元数据,以提供预期的无障碍功能。在本文件中,我们探索从其像素中推断移动应用程序的无障碍元数据,因为视觉界面通常最能反映应用程序的全部功能。我们训练了一个强大、快速、记忆高效的在线设计模型,用我们收集的77 637个屏幕(4 068 iPhone Apps)和附加说明的数据集检测UI元素。为了进一步改进对界面的检测和添加语义信息,我们引入了超常(如UIUI分组和订购)和其他模型(如识别UI的内容、状态、互动性)。我们建立了屏幕认知,以生成无障碍元数据,增强iOS语音。在与9个屏幕阅读器用户进行的一项研究中,我们确认我们的方法改善了现有移动应用程序的无障碍性,使以前无法访问的应用程序得以使用。

0

相关内容

VoiceOver

VoiceOver

数据科学导论，54页ppt，Introduction to Data Science

数据科学导论，54页ppt，Introduction to Data Science

专知会员服务

42+阅读 · 2020年7月27日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【深度学习视频分析/多模态学习资源大列表】

【深度学习视频分析/多模态学习资源大列表】

专知会员服务

92+阅读 · 2019年10月16日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

LibRec 精选：AutoML for Contextual Bandits

LibRec 精选：AutoML for Contextual Bandits

LibRec智能推荐

7+阅读 · 2019年9月19日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

已删除

将门创投

5+阅读 · 2018年11月27日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

IEEE2018|An Accurate and Real-time 3D Tracking System for Robots

IEEE2018|An Accurate and Real-time 3D Tracking System for Robots

极市平台

4+阅读 · 2018年4月19日

机器人开发库软件大列表

机器人开发库软件大列表

专知

10+阅读 · 2018年3月18日

【推荐】用Python/OpenCV实现增强现实

【推荐】用Python/OpenCV实现增强现实

机器学习研究会

15+阅读 · 2017年11月16日

【推荐】树莓派/OpenCV/dlib人脸定位/瞌睡检测

【推荐】树莓派/OpenCV/dlib人脸定位/瞌睡检测

机器学习研究会

9+阅读 · 2017年10月24日

【推荐】SLAM相关资源大列表

【推荐】SLAM相关资源大列表

机器学习研究会

10+阅读 · 2017年8月18日

ANDRUSPEX : Leveraging Graph Representation Learning to Predict Harmful App Installations on Mobile Devices

Arxiv

0+阅读 · 2021年3月10日

PROVED: A Tool for Graph Representation and Analysis of Uncertain Event Data

PROVED: A Tool for Graph Representation and Analysis of Uncertain Event Data

Arxiv

0+阅读 · 2021年3月9日

ANDR USPEX : Leveraging Graph Representation Learning to Predict Harmful App Installations on Mobile Devices

Arxiv

0+阅读 · 2021年3月9日

V2S: A Tool for Translating Video Recordings of Mobile App Usages into Replayable Scenarios

Arxiv

0+阅读 · 2021年3月8日

Learn to Differ: Sim2Real Small Defection Segmentation Network

Arxiv

0+阅读 · 2021年3月7日

DeepFakes: a New Threat to Face Recognition? Assessment and Detection

Arxiv

6+阅读 · 2018年12月20日

SlowFast Networks for Video Recognition

SlowFast Networks for Video Recognition

Arxiv

19+阅读 · 2018年12月10日

An application of cascaded 3D fully convolutional networks for medical image segmentation

Arxiv

9+阅读 · 2018年3月14日

Deep Learning Framework for Multi-class Breast Cancer Histology Image Classification

Arxiv

9+阅读 · 2018年2月3日

DAiSEE: Towards User Engagement Recognition in the Wild

Arxiv

7+阅读 · 2017年12月15日

VIP会员

文章信息

相关主题

相关VIP内容

数据科学导论，54页ppt，Introduction to Data Science

数据科学导论，54页ppt，Introduction to Data Science

专知会员服务

42+阅读 · 2020年7月27日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【深度学习视频分析/多模态学习资源大列表】

【深度学习视频分析/多模态学习资源大列表】

专知会员服务

92+阅读 · 2019年10月16日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

《多域空战指挥体系：驾驭复杂性的艺术》

构建军事人工智能信任体系始于破除黑盒机制

《生态建模密码破译：建模与编程实践》美陆军最新报告

《战争形态演变：合成兵种防御主导模式探析》48页slides

相关资讯

LibRec 精选：AutoML for Contextual Bandits

LibRec 精选：AutoML for Contextual Bandits

LibRec智能推荐

7+阅读 · 2019年9月19日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

已删除

将门创投

5+阅读 · 2018年11月27日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

IEEE2018|An Accurate and Real-time 3D Tracking System for Robots

IEEE2018|An Accurate and Real-time 3D Tracking System for Robots

极市平台

4+阅读 · 2018年4月19日

机器人开发库软件大列表

机器人开发库软件大列表

专知

10+阅读 · 2018年3月18日

【推荐】用Python/OpenCV实现增强现实

【推荐】用Python/OpenCV实现增强现实

机器学习研究会

15+阅读 · 2017年11月16日

【推荐】树莓派/OpenCV/dlib人脸定位/瞌睡检测

【推荐】树莓派/OpenCV/dlib人脸定位/瞌睡检测

机器学习研究会

9+阅读 · 2017年10月24日

【推荐】SLAM相关资源大列表

【推荐】SLAM相关资源大列表

机器学习研究会

10+阅读 · 2017年8月18日

相关论文

ANDRUSPEX : Leveraging Graph Representation Learning to Predict Harmful App Installations on Mobile Devices

Arxiv

0+阅读 · 2021年3月10日

PROVED: A Tool for Graph Representation and Analysis of Uncertain Event Data

PROVED: A Tool for Graph Representation and Analysis of Uncertain Event Data

Arxiv

0+阅读 · 2021年3月9日

ANDR USPEX : Leveraging Graph Representation Learning to Predict Harmful App Installations on Mobile Devices

Arxiv

0+阅读 · 2021年3月9日

V2S: A Tool for Translating Video Recordings of Mobile App Usages into Replayable Scenarios

Arxiv

0+阅读 · 2021年3月8日

Learn to Differ: Sim2Real Small Defection Segmentation Network

Arxiv

0+阅读 · 2021年3月7日

DeepFakes: a New Threat to Face Recognition? Assessment and Detection

Arxiv

6+阅读 · 2018年12月20日

SlowFast Networks for Video Recognition

SlowFast Networks for Video Recognition

Arxiv

19+阅读 · 2018年12月10日

An application of cascaded 3D fully convolutional networks for medical image segmentation

Arxiv

9+阅读 · 2018年3月14日

Deep Learning Framework for Multi-class Breast Cancer Histology Image Classification

Arxiv

9+阅读 · 2018年2月3日

DAiSEE: Towards User Engagement Recognition in the Wild

Arxiv

7+阅读 · 2017年12月15日

微信扫码咨询专知VIP会员