This contribution argues that Reddit, as a massive, categorized, open-access dataset, can be used to conduct knowledge capture on "almost any topic". Presented analysis, is based on 180 manually annotated papers related to Reddit and data acquired from top databases of scientific papers. Moreover, an open source tool is introduced, which provides easy access to Reddit resources, and exploratory data analysis of how Reddit covers selected topics.
翻译:本文认为,Reddit作为一个庞大的、分类的、开放存取的数据集,可用于对“几乎任何专题”进行知识捕捉。 提出分析的依据是180份人工附加说明的关于Reddit的文件和从科学论文顶层数据库获得的数据。 此外,还引入了一个开放源码工具,方便检索Reddd 资源,以及探索性数据分析Rdddit如何覆盖选定专题。