In practice, developers search for related earlier bugs and their associated discussion threads when faced with a new bug to repair. Typically, these discussion threads consist of comments and even bug-fixing comments intended to capture clues for facilitating the investigation and root cause of a new bug report. Over time, these discussions can become extensively lengthy and difficult to understand. Inevitably, these discussion threads lead to instances where bug-fixing comments intermingle with seemingly-unrelated comments. This task, however, poses further challenges when dealing with high volumes of bug reports. Large software systems are plagued by thousands of bug reports daily. Hence, it becomes time-consuming to investigate these bug reports efficiently. To address this gap, this paper builds a ranked-based automated tool that we refer it to as RETRORANK. Specifically, RETRORANK recommends bug-fixing comments from issue tracking discussion threads in the context of user query relevance, the use of positive language, and semantic relevance among comments. By using a combination of Vector Space Model (VSM), Sentiment Analysis (SA), and the TextRank Model (TR) we show how that past fixed bugs and their associated bug-fixing comments with relatively positive sentiments can semantically connect to investigate the root cause of a new bug. We evaluated our approach via a synthetic study and a user study. Results indicate that RETRORANK significantly improved performance when compared to the baseline VSM.
翻译:在实践中, 开发者在遇到新错误需要修理时会寻找相关的早期错误及其相关讨论线索。 通常, 这些讨论线索包括评论, 甚至错误修正评论, 目的是收集线索, 便利新错误报告的调查和根本原因。 随着时间的推移, 这些讨论会变得广泛冗长且难以理解。 这些讨论线索不可避免地会导致错误修正评论与似乎无关的评论交织在一起。 然而, 这项任务在处理大量错误报告时会带来进一步的挑战。 大型软件系统每天受到成千上万个错误报告的困扰。 因此, 有效调查这些错误报告将耗时很多。 为了弥补这一差距, 本文将建立一个基于排名的自动化工具, 我们把它称为 RETROANK 。 具体地说, RETRANK 建议在用户查询相关性、 使用正面语言, 以及评论之间的语义关联性。 使用VCtort Smode 模型( VSM) 、 SentiRO 分析(SA) 和 TextRANK 模型(TRANK ) 将一个我们如何通过错误和错误模型进行正确性分析, 显示我们如何进行新的分析, 我们如何通过错误和错误性分析。 我们如何用错误分析。