The modern increase in data production is driven by multiple factors, and several stakeholders from various sectors contribute to it. Although drawing a comparison of the sizes at stake for different big data players is hard due to the lack of official data, this report tries to reconstruct the yearly orders of magnitude generated by some of the most important organizations by mining several online sources. The estimation is based on retrieving meaningful unitary data production measures for each of the big data sources considered, and the yearly amounts are then obtained by conjecturing reasonable per-unit sizes. The final result is summarized in the form of a bubble plot.
翻译:现代数据生产的增长是由多种因素驱动的,来自不同部门的若干利益攸关方为此做出了贡献。 尽管由于缺乏官方数据,很难对不同大数据参与者的利害关系规模进行比较,但本报告试图重建一些最重要的组织通过采矿若干在线来源产生的年度数量级。这一估算基于对所考虑的每个大数据源采取有意义的统一数据生产措施,然后通过预测合理的单位规模获得年度数量。最终结果以泡沫图的形式汇总。