Software maintenance constitutes a large portion of the software development lifecycle. To carry out maintenance tasks, developers often need to understand and reproduce bug reports. As such, there has been increasing research activity coalescing around the notion of automating various activities related to bug reporting. A sizable portion of this research interest has focused on the domain of mobile apps. However, as research around mobile app bug reporting progresses, there is a clear need for a manually vetted and reproducible set of real-world bug reports that can serve as a benchmark for future work. This paper presents ANDROR2: a dataset of 90 manually reproduced bug reports for Android apps listed on Google Play and hosted on GitHub, systematically collected via an in-depth analysis of 459 reports extracted from the GitHub issue tracker. For each reproduced report, ANDROR2 includes the original bug report, an apk file for the buggy version of the app, an executable reproduction script, and metadata regarding the quality of the reproduction steps associated with the original report. We believe that the ANDROR2 dataset can be used to facilitate research in automatically analyzing, understanding, reproducing, localizing, and fixing bugs for mobile applications as well as other software maintenance activities more broadly.
翻译:软件维护是软件开发生命周期的一大部分。 执行维护任务, 开发者往往需要理解并复制错误报告。 因此, 研究活动越来越多, 围绕与错误报告相关的各种活动自动化概念, 研究兴趣的很大一部分集中在移动应用程序领域。 然而, 围绕移动应用程序错误报告进展的研究显示, 明显需要手动审查并可复制的一套真实世界错误报告, 这组报告可以作为未来工作的基准。 本文展示了 ANDROR2 : 谷歌 Play 上列出并在 GitHub 上托管的 Android Apps 90 个手工复制的错误报告数据集。 通过深入分析从 GitHub 发行跟踪器中提取的459份报告系统收集。 对于每份复制的报告, ANDRO2 包括原始错误报告、 可执行的复制脚本和关于与原始报告相关的复制步骤质量的元件。 我们认为, ANTR2 数据集可以用来促进研究, 进行更广义的移动软件操作, 并更新其他的维护程序。