The vast amount of data produced everyday (so-called 'digital traces') and available nowadays represent a gold mine for the social sciences, especially in a computational context, that allows to fully extract their informational and knowledge value. In the latest years, statistical offices have made efforts to profit from harnessing the potential offered by these new sources of data, with promising results. But how difficult is this integration process? What are the challenges that statistical offices would likely face to profit from new data sources and analytical methods? This chapter will start by setting the scene of the current official statistics system, with a focus on its fundamental principles and dimensions relevant to the use of non-traditional data. It will then present some experiments and proofs of concept in the context of data innovation for official statistics, followed by a discussion on prospective challenges related to sustainable data access, new technical and methodological approaches and effective use of new sources of data.
翻译:每天产生的大量数据(所谓的“数字痕迹”)和现在可得到的大量数据是社会科学的金矿,特别是在计算方面,能够充分提取其信息和知识价值;近年来,统计部门努力利用这些新数据来源提供的潜力,并取得了可喜的成果;但这种整合过程有多困难?统计部门可能从新的数据来源和分析方法中受益,有哪些挑战?本章将首先确定当前官方统计系统的现场,重点是其使用非传统数据的基本原则和层面,然后在官方统计数据创新方面提出一些实验和概念证明,然后讨论与可持续数据获取、新的技术和方法方法方法以及有效利用新数据来源有关的未来挑战。