Case-hindering, multi-year digital forensic evidence backlogs have become commonplace in law enforcement agencies throughout the world. This is due to an ever-growing number of cases requiring digital forensic investigation coupled with the growing volume of data to be processed per case. Leveraging previously processed digital forensic cases and their component artefact relevancy classifications can facilitate an opportunity for training automated artificial intelligence based evidence processing systems. These can significantly aid investigators in the discovery and prioritisation of evidence. This paper presents one approach for file artefact relevancy determination building on the growing trend towards a centralised, Digital Forensics as a Service (DFaaS) paradigm. This approach enables the use of previously encountered pertinent files to classify newly discovered files in an investigation. Trained models can aid in the detection of these files during the acquisition stage, i.e., during their upload to a DFaaS system. The technique generates a relevancy score for file similarity using each artefact's filesystem metadata and associated timeline events. The approach presented is validated against three experimental usage scenarios.
翻译:世界各地执法机构经常出现多年度数字法证积压案件,这是因为需要数字法证调查的案件越来越多,而且每起案件需要处理的数据数量越来越多。利用以前处理的数字法证案件及其组成部分的亚素化相关性分类,可以促进培训自动化人工智能证据处理系统的机会。这些可以极大地帮助调查人员发现证据并确定证据的先后次序。本文件介绍了一种办法,用以在日益趋向集中的、数字法证服务模式的基础上确定档案的切片相关性。这种办法使得能够利用以前遇到的有关档案对调查中新发现的档案进行分类。经过培训的模型可以帮助在获取阶段,即上传到DFAAS系统期间,发现这些档案。这种技术利用每种艺术法证的档案系统元数据和相关时间事件,为相似的档案制作了比值。根据三种试验性使用假设,对所采用的方法进行了验证。