Stakeholders in software projects use issue trackers like JIRA to capture and manage issues, including requirements and bugs. To ease issue navigation and structure project knowledge, stakeholders manually connect issues via links of certain types that reflect different dependencies, such as Epic-, Block-, Duplicate-, or Relate- links. Based on a large dataset of 15 JIRA repositories, we study how well state-of-the-art machine learning models can automatically detect common link types. We found that a pure BERT model trained on titles and descriptions of linked issues significantly outperforms other optimized deep learning models, achieving an encouraging average macro F1-score of 0.64 for detecting 9 popular link types across all repositories (weighted F1-score of 0.73). For the specific Subtask- and Epic- links, the model achieved top F1-scores of 0.89 and 0.97, respectively. Our model does not simply learn the textual similarity of the issues. In general, shorter issue text seems to improve the prediction accuracy with a strong negative correlation of -0.70. We found that Relate-links often get confused with the other links, which suggests that they are likely used as default links in unclear cases. We also observed significant differences across the repositories, depending on how they are used and by whom.
翻译:软件项目的利益攸关方使用JIRA这样的问题跟踪器来捕捉和管理问题,包括要求和错误。为了方便问题导航和结构项目知识,利益攸关方通过反映不同依赖性的某些类型(如Epic-、Block-、Dopreplication-或对应链接)的链接手工连接问题。根据15个JIRA储存库的庞大数据集,我们研究的是,最先进的机器学习模型如何能够自动检测共同链接类型。我们发现,在相关议题的标题和描述方面受过培训的纯BERT模型大大优于其他优化的深层学习模型,在发现所有储存库9种广受欢迎的链接类型(加权F1-序号为0.73)方面实现了令人鼓舞的平均F1-核心(0.64)。对于具体的Subtask-和Epic-链接而言,模型分别实现了0.89和0.97的顶级F1-核心。我们的模型并不单纯地了解问题的文字相似性。我们发现,一般而言,较短的问题文本似乎提高了预测准确性,而其它最优化的深层次的深层次的学习模型比-0.70。我们发现,我们发现,相互对应的链接往往会混淆于其他链接。