DroidBot-GPT: 基于 GPT 的 Android UI 自动化 (DroidBot-GPT: GPT-powered UI Automation for Android) - 专知论文

会员服务 ·

0

android · 操作 · 自动化 · 自然语言描述 · 范例 ·

2023 年 4 月 14 日

DroidBot-GPT: GPT-powered UI Automation for Android

翻译：DroidBot-GPT: 基于 GPT 的 Android UI 自动化

Hao Wen,Hongming Wang,Jiaxuan Liu,Yuanchun Li

from arxiv, 8 pages, 5 figures

This paper introduces DroidBot-GPT, a tool that utilizes GPT-like large language models (LLMs) to automate the interactions with Android mobile applications. Given a natural language description of a desired task, DroidBot-GPT can automatically generate and execute actions that navigate the app to complete the task. It works by translating the app GUI state information and the available actions on the smartphone screen to natural language prompts and asking the LLM to make a choice of actions. Since the LLM is typically trained on a large amount of data including the how-to manuals of diverse software applications, it has the ability to make reasonable choices of actions based on the provided information. We evaluate DroidBot-GPT with a self-created dataset that contains 33 tasks collected from 17 Android applications spanning 10 categories. It can successfully complete 39.39% of the tasks, and the average partial completion progress is about 66.76%. Given the fact that our method is fully unsupervised (no modification required from both the app and the LLM), we believe there is great potential to enhance automation performance with better app development paradigms and/or custom model training.

翻译：---- 本文介绍了 DroidBot-GPT，一种利用类似于 GPT 的大型语言模型 (LLMs) 自动操作 Android 移动应用程序的工具。给定所需任务的自然语言描述，DroidBot-GPT 可以自动生成并执行操作，以导航应用程序并完成任务。它通过将应用 GUI 状态信息和智能手机屏幕上的可用操作转化为自然语言提示，然后要求 LLM 根据提供的信息进行操作选择。由于 LLM 通常是在包括不同软件应用程序的使用手册在内的大量数据上进行训练的，因此它具有根据所提供的信息做出合理操作选择的能力。我们使用自己创建的数据集对 DroidBot-GPT 进行评估，该数据集包含来自 10 个类别的 17 个 Android 应用程序的 33 个任务。它能够成功完成 39.39% 的任务，并且平均部分完成进度约为 66.76%。鉴于我们的方法完全无监督（不需要修改应用程序和 LLM），我们认为可以通过更好的应用程序开发范例和/或自定义模型训练来提高自动化性能。

2

相关内容

android

【2023新书】《ChatGPT入门》，179页pdf

【2023新书】《ChatGPT入门》，179页pdf

专知会员服务

259+阅读 · 2023年5月30日

大模型最权威课程！MIT最新《生成式AI-大模型》课程，MIT斯坦福OpenAI-DeepMind众多专家讲授

大模型最权威课程！MIT最新《生成式AI-大模型》课程，MIT斯坦福OpenAI-DeepMind众多专家讲授

专知会员服务

121+阅读 · 2023年5月26日

手册《兵棋推演：工具、技术和程序》33页slides，Connections UK – Wargaming for Professionals

手册《兵棋推演：工具、技术和程序》33页slides，Connections UK – Wargaming for Professionals

专知会员服务

40+阅读 · 2022年10月10日

【新书】【Metalearning】自动机器学习和数据挖掘的应用，Applications to Automated Machine Learning and Data Mining

【新书】【Metalearning】自动机器学习和数据挖掘的应用，Applications to Automated Machine Learning and Data Mining

专知会员服务

73+阅读 · 2022年3月24日

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

【新书】数据科学编程傻瓜式教程，数据科学编程一体机 | Data Science Programming All-In-One For Dummies

【新书】数据科学编程傻瓜式教程，数据科学编程一体机 | Data Science Programming All-In-One For Dummies

专知会员服务

41+阅读 · 2020年1月22日

2019年自然语言处理NLP亮点总结，29页pdf，NLP Year in Review — 2019 NLP highlights for the year 2019.

2019年自然语言处理NLP亮点总结，29页pdf，NLP Year in Review — 2019 NLP highlights for the year 2019.

专知会员服务

69+阅读 · 2020年1月2日

【Google论文强烈推荐】ALBERT:基于精简BERT的自我监督学习的语言表示，ALBERT: A Lite BERT for Self-Supervised Learning of Language Representations

【Google论文强烈推荐】ALBERT:基于精简BERT的自我监督学习的语言表示，ALBERT: A Lite BERT for Self-Supervised Learning of Language Representations

专知会员服务

24+阅读 · 2019年12月21日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

Android 和 iOS 平台能打通？Google 发布全新跨平台工具包

Android 和 iOS 平台能打通？Google 发布全新跨平台工具包

CSDN

0+阅读 · 2022年8月29日

【2022新书】Python DevOps，245页pdf

【2022新书】Python DevOps，245页pdf

专知

6+阅读 · 2022年7月11日

Android Studio Chipmunk 现已发布

Android Studio Chipmunk 现已发布

谷歌开发者

0+阅读 · 2022年6月28日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】用Python/OpenCV实现增强现实

【推荐】用Python/OpenCV实现增强现实

机器学习研究会

15+阅读 · 2017年11月16日

自动文摘AS知识资料全集（入门/进阶/代码/数据/专家等）(附pdf下载)

自动文摘AS知识资料全集（入门/进阶/代码/数据/专家等）(附pdf下载)

机器学习研究会

11+阅读 · 2017年11月7日

【论文】图上的表示学习综述

【论文】图上的表示学习综述

机器学习研究会

15+阅读 · 2017年9月24日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

无人驾驶车辆智能测试评估与环境设计

国家自然科学基金

27+阅读 · 2014年12月31日

室内眩光的视觉模型研究

国家自然科学基金

2+阅读 · 2014年12月31日

在线社交网络上恶意网址的实时预警

国家自然科学基金

2+阅读 · 2014年12月31日

BER通路基因miRNA结合位点基因多态性与结直肠癌易感性的关联及功能研究

国家自然科学基金

0+阅读 · 2013年12月31日

面向密集移动标签的RFID敏感信息交互机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

现代藏文自动校对研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于认知模式与信息搜寻的驾驶员出行信息环境优化及仿真

国家自然科学基金

0+阅读 · 2012年12月31日

计算力学基本计算及可视化工具程序包的开发与集成

国家自然科学基金

2+阅读 · 2012年12月31日

基于模型的测试用例优化生成与自动执行

国家自然科学基金

0+阅读 · 2011年12月31日

无线通信网络多信道协议的研究

国家自然科学基金

0+阅读 · 2009年12月31日

Neuron to Graph: Interpreting Language Model Neurons at Scale

Arxiv

0+阅读 · 2023年5月31日

On the Power of Foundation Models

Arxiv

1+阅读 · 2023年5月31日

Practical PCG Through Large Language Models

Arxiv

0+阅读 · 2023年5月31日

PlaSma: Making Small Language Models Better Procedural Knowledge Models for (Counterfactual) Planning

Arxiv

0+阅读 · 2023年5月31日

User Driven Functionality Deletion for Mobile Apps

Arxiv

0+阅读 · 2023年5月30日

Analyzing the Performance of GPT-3.5 and GPT-4 in Grammatical Error Correction

Arxiv

0+阅读 · 2023年5月30日

Autonomous GIS: the next-generation AI-powered GIS

Arxiv

0+阅读 · 2023年5月29日

Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond

Arxiv

12+阅读 · 2023年4月26日

A Survey of Large Language Models

A Survey of Large Language Models

Arxiv

473+阅读 · 2023年3月31日

Data-Free Knowledge Transfer: A Survey

Arxiv

21+阅读 · 2021年12月31日

VIP会员

文章信息

相关主题

自然语言描述

相关VIP内容

【2023新书】《ChatGPT入门》，179页pdf

【2023新书】《ChatGPT入门》，179页pdf

专知会员服务

259+阅读 · 2023年5月30日

大模型最权威课程！MIT最新《生成式AI-大模型》课程，MIT斯坦福OpenAI-DeepMind众多专家讲授

大模型最权威课程！MIT最新《生成式AI-大模型》课程，MIT斯坦福OpenAI-DeepMind众多专家讲授

专知会员服务

121+阅读 · 2023年5月26日

手册《兵棋推演：工具、技术和程序》33页slides，Connections UK – Wargaming for Professionals

手册《兵棋推演：工具、技术和程序》33页slides，Connections UK – Wargaming for Professionals

专知会员服务

40+阅读 · 2022年10月10日

【新书】【Metalearning】自动机器学习和数据挖掘的应用，Applications to Automated Machine Learning and Data Mining

【新书】【Metalearning】自动机器学习和数据挖掘的应用，Applications to Automated Machine Learning and Data Mining

专知会员服务

73+阅读 · 2022年3月24日

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

【新书】数据科学编程傻瓜式教程，数据科学编程一体机 | Data Science Programming All-In-One For Dummies

【新书】数据科学编程傻瓜式教程，数据科学编程一体机 | Data Science Programming All-In-One For Dummies

专知会员服务

41+阅读 · 2020年1月22日

2019年自然语言处理NLP亮点总结，29页pdf，NLP Year in Review — 2019 NLP highlights for the year 2019.

2019年自然语言处理NLP亮点总结，29页pdf，NLP Year in Review — 2019 NLP highlights for the year 2019.

专知会员服务

69+阅读 · 2020年1月2日

【Google论文强烈推荐】ALBERT:基于精简BERT的自我监督学习的语言表示，ALBERT: A Lite BERT for Self-Supervised Learning of Language Representations

【Google论文强烈推荐】ALBERT:基于精简BERT的自我监督学习的语言表示，ALBERT: A Lite BERT for Self-Supervised Learning of Language Representations

专知会员服务

24+阅读 · 2019年12月21日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

《生成式人工智能与大/小语言模型在供应链管理决策优化与可持续性提升中的作用评估》最新51页

白宫发布《赢得AI竞赛：美国人工智能行动计划》最新28页

地下战：地下空间的战略博弈

《美地下作战条令手册》228页

相关资讯

Android 和 iOS 平台能打通？Google 发布全新跨平台工具包

Android 和 iOS 平台能打通？Google 发布全新跨平台工具包

CSDN

0+阅读 · 2022年8月29日

【2022新书】Python DevOps，245页pdf

【2022新书】Python DevOps，245页pdf

专知

6+阅读 · 2022年7月11日

Android Studio Chipmunk 现已发布

Android Studio Chipmunk 现已发布

谷歌开发者

0+阅读 · 2022年6月28日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】用Python/OpenCV实现增强现实

【推荐】用Python/OpenCV实现增强现实

机器学习研究会

15+阅读 · 2017年11月16日

自动文摘AS知识资料全集（入门/进阶/代码/数据/专家等）(附pdf下载)

自动文摘AS知识资料全集（入门/进阶/代码/数据/专家等）(附pdf下载)

机器学习研究会

11+阅读 · 2017年11月7日

【论文】图上的表示学习综述

【论文】图上的表示学习综述

机器学习研究会

15+阅读 · 2017年9月24日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

相关论文

Neuron to Graph: Interpreting Language Model Neurons at Scale

Arxiv

0+阅读 · 2023年5月31日

On the Power of Foundation Models

Arxiv

1+阅读 · 2023年5月31日

Practical PCG Through Large Language Models

Arxiv

0+阅读 · 2023年5月31日

PlaSma: Making Small Language Models Better Procedural Knowledge Models for (Counterfactual) Planning

Arxiv

0+阅读 · 2023年5月31日

User Driven Functionality Deletion for Mobile Apps

Arxiv

0+阅读 · 2023年5月30日

Analyzing the Performance of GPT-3.5 and GPT-4 in Grammatical Error Correction

Arxiv

0+阅读 · 2023年5月30日

Autonomous GIS: the next-generation AI-powered GIS

Arxiv

0+阅读 · 2023年5月29日

Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond

Arxiv

12+阅读 · 2023年4月26日

A Survey of Large Language Models

A Survey of Large Language Models

Arxiv

473+阅读 · 2023年3月31日

Data-Free Knowledge Transfer: A Survey

Arxiv

21+阅读 · 2021年12月31日

相关基金

无人驾驶车辆智能测试评估与环境设计

国家自然科学基金

27+阅读 · 2014年12月31日

室内眩光的视觉模型研究

国家自然科学基金

2+阅读 · 2014年12月31日

在线社交网络上恶意网址的实时预警

国家自然科学基金

2+阅读 · 2014年12月31日

BER通路基因miRNA结合位点基因多态性与结直肠癌易感性的关联及功能研究

国家自然科学基金

0+阅读 · 2013年12月31日

面向密集移动标签的RFID敏感信息交互机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

现代藏文自动校对研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于认知模式与信息搜寻的驾驶员出行信息环境优化及仿真

国家自然科学基金

0+阅读 · 2012年12月31日

计算力学基本计算及可视化工具程序包的开发与集成

国家自然科学基金

2+阅读 · 2012年12月31日

基于模型的测试用例优化生成与自动执行

国家自然科学基金

0+阅读 · 2011年12月31日

无线通信网络多信道协议的研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员