Question answering platforms, such as Stack Overflow, have impacted substantially how developers search for solutions for their programming problems. The crowd knowledge content available from such platforms has also been used to leverage software development tools. The recent advances on Natural Language Processing, specifically on more powerful language models, have demonstrated ability to enhance text understanding and generation. In this context, we aim at investigating the factors that can influence on the application of such models for understanding source code related data and produce more interactive and intelligent assistants for software development. In this preliminary study, we particularly investigate if a how-to question filter and the level of context in the question may impact the results of a question answering transformer-based model. We suggest that fine-tuning models with corpus based on how-to questions can impact positively in the model and more contextualized questions also induce more objective answers.
翻译:Stack overflow等问答平台对开发者如何寻找解决其编程问题的办法产生了重大影响。这些平台提供的人群知识内容也被用于利用软件开发工具。最近有关自然语言处理的进展,特别是更强大的语言模型的进展,表明有能力增进对文本的理解和生成。在这方面,我们的目标是调查对应用这些模型有影响的因素,以了解源代码相关数据,并为软件开发提供更互动和智能的助手。在本初步研究中,我们特别调查一个如何质疑过滤和问题的背景程度是否会影响回答变压器模型的问题的结果。我们建议,基于问题对质的微调模型可以在模型中产生积极的影响,而更符合背景的问题也会产生更客观的答案。