Stack Overflow provides a means for developers to exchange knowledge. While much previous research on Stack Overflow has focused on questions and answers (Q&A), recent work has shown that discussions in comments also contain rich information. On Stack Overflow, discussions through comments and chat rooms can be tied to questions or answers. In this paper, we conduct an empirical study that focuses on the nature of question discussions. We observe that: (1) Question discussions occur at all phases of the Q&A process, with most beginning before the first answer is received. (2) Both askers and answerers actively participate in question discussions; the likelihood of their participation increases as the number of comments increases. (3) There is a strong correlation between the number of question comments and the question answering time (i.e., more discussed questions receive answers more slowly); also, questions with a small number of comments are likely to be answered more quickly than questions with no discussion. Our findings suggest that question discussions contain a rich trove of data that is integral to the Q&A processes on Stack Overflow. We further suggest how future research can leverage the information in question discussions, along with the commonly studied Q&A information.
翻译:虽然先前关于Stack overflow的很多研究侧重于问答,但最近的工作表明,评论中的讨论也包含丰富的信息。关于Stack overflow,通过评论和聊天室的讨论可以与问答联系在一起。在本文件中,我们进行了一项经验性研究,侧重于问题讨论的性质。我们注意到:(1) 问题讨论发生在“A”进程的各个阶段,大部分始于收到第一个答复之前。(2) 问者和回答者积极参与了问题讨论;随着评论数量的增加,他们的参与的可能性也随之增加。(3) 问题评论的数量与回答时间(即,讨论较多的问题得到的回答更慢)之间有着密切的相互关系;此外,只有少量意见的问题可能得到更快的回答,而不是没有讨论的问题。我们的调查结果表明,问题讨论包含大量的数据,这是“A”进程关于“Stack overplow”进程的组成部分。我们进一步建议,未来的研究如何利用问题讨论中的信息以及共同研究的“A”信息。