Software developers often rely on natural language text that appears in software engineering artifacts to access critical information as they build and work on software systems. For example, developers access requirements documents to understand what to build, comments in source code to understand design decisions, answers to questions on Q&A sites to understand APIs, and so on. To aid software developers in accessing and using this natural language information, software engineering researchers often use techniques from natural language processing. In this paper, we explore whether frame semantics, a general linguistic approach, which has been used on requirements text, can also help address problems that occur when applying lexicon analysis based techniques to text associated with program comprehension activities. We assess the applicability of generic semantic frame parsing for this purpose, and based on the results, we propose SEFrame to tailor semantic frame parsing for program comprehension uses. We evaluate the correctness and robustness of the approach finding that SEFrame is correct in between 73% and 74% of the cases and that it can parse text from a variety of software artifacts used to support program comprehension. We describe how this approach could be used to enhance existing approaches to identify meaning on intention from software engineering texts.
翻译:软件开发者往往依靠软件工程工艺品中出现的自然语言文字来获取关键信息。 例如,开发者在软件系统建设和工作时,往往依靠软件工程工艺品中出现的自然语言文字来获取关键信息。例如,开发者获取要求文件来理解构建什么,在源代码中进行评论以理解设计决定,回答在 ⁇ A 站点上的问题以理解API 等。为了帮助软件开发者获取和使用这种自然语言信息,软件工程研究人员经常使用来自自然语言处理的技术。在本文中,我们探索框架语义(一种通用语言方法,在要求文本中已经使用过)能否帮助解决在将基于词汇的分析技术的文字应用于与程序理解活动相关的文本时出现的问题。我们评估通用语义框架用于此目的的可适用性,并根据结果,我们建议SEFBrame, 来调整用于程序理解用途的语义框架。我们评估SEFrame的正确性和稳健性方法,发现SEFrame在73%到74%的案件中是正确的,并且它能够从支持程序理解的各种软件工艺品中解析出文本。我们打算如何加强现有方法。我们如何用这个方法来鉴别。我们描述了。我们想用这个方法如何加强现有方法。