Stack Overflow hosts valuable programming-related knowledge with 11,926,354 links that reference to the third-party websites. The links that reference to the resources hosted outside the Stack Overflow websites extend the Stack Overflow knowledge base substantially. However, with the rapid development of programming-related knowledge, many resources hosted on the Internet are not available anymore. Based on our analysis of the Stack Overflow data that was released on Jun. 2, 2019, 14.2% of the links on Stack Overflow are broken links. The broken links on Stack Overflow can obstruct viewers from obtaining desired programming-related knowledge, and potentially damage the reputation of the Stack Overflow as viewers might regard the posts with broken links as obsolete. In this paper, we characterize the broken links on Stack Overflow. 65% of the broken links in our sampled questions are used to show examples, e.g., code examples. 70% of the broken links in our sampled answers are used to provide supporting information, e.g., explaining a certain concept and describing a step to solve a problem. Only 1.67% of the posts with broken links are highlighted as such by viewers in the posts' comments. Only 5.8% of the posts with broken links removed the broken links. Viewers cannot fully rely on the vote scores to detect broken links, as broken links are common across posts with different vote scores. The websites that host resources that can be maintained by their users are referenced by broken links the most on Stack Overflow -- a prominent example of such websites is GitHub. The posts and comments related to the web technologies, i.e., JavaScript, HTML, CSS, and jQuery, are associated with more broken links. Based on our findings, we shed lights for future directions and provide recommendations for practitioners and researchers.
翻译:Stack Overflook 提供了11,926,354个与第三方网站相关的重要编程相关知识。 Stack Overflow 网站外的资源链接,大大扩展了 Stack Overflow 知识库。然而,随着与编程相关知识的迅速发展,互联网上的许多资源已不再可用。根据我们对 Stack Overflow 上发布的 Stack Over 数据的分析, Stack Overflow 上链接的11,926,354链接中11,11,926,354个链接。 Stack Over 上的断开链接会妨碍观众获得所希望的编程相关知识,并可能损害 Stack Oververver的声誉,因为查看者可能认为断开链接的Staack Over 数据库已经过时。在本文中,我们抽样问题的65%的断开链接被用来展示实例,例如,20191919年6月2日发布的Stack Overflower 答案中的70%断开的链接被用来提供支持信息,例如,解释某种概念和描述解决问题的一步。只有1.67 % 与断断开的服务器的断断开的链接与Speak 的链接,通过浏览的链接无法通过浏览的链接在S treverview 浏览的链接在浏览的链接在服务器上被显示。