Improving file management interfaces and optimising system performance requires current data about users' digital collections and particularly about the file size distributions of such collections. However, prior works have examined only the sizes of system files and users' work files in varied contexts, and there has been no such study since 2013; it therefore remains unclear how today's file sizes are distributed, particularly personal files, and further if distributions differ among the major operating systems or common occupations. Here we examine such differences among 49 million files in 348 user collections. We find that the average file size has grown more than ten-fold since the mid-2000s, though most files are still under 8 MB, and that there are demographic and technological influences in the size distributions. We discuss the implications for user interfaces, system optimisation, and PIM research.
翻译:改进文件管理界面和优化系统性能需要关于用户数字收藏的最新数据,特别是关于这类收藏的文件规模分布的数据,然而,以前的工作只审查了系统文档的规模和用户在不同情况下的工作文件,自2013年以来没有进行这种研究;因此,目前的文件规模,特别是个人档案的分布方式仍然不清楚,如果主要操作系统或共同职业之间的分布方式不同,更不清楚。我们在这里审查348个用户收藏中4 900万个文件之间的这种差异。我们发现,自2000年代中期以来,平均文件规模已经增加了十倍以上,尽管大多数文件仍然在8 MB之下,而且在规模分布方面存在着人口和技术影响。我们讨论了对用户界面、系统优化和PIM研究的影响。