Artifacts such as log data and network traffic are fundamental for cybersecurity research, e.g., in the area of intrusion detection. Yet, most research is based on artifacts that are not available to others or cannot be adapted to own purposes, thus making it difficult to reproduce and build on existing work. In this paper, we identify the challenges of artifact generation with the goal of conducting sound experiments that are valid, controlled, and reproducible. We argue that testbeds for artifact generation have to be designed specifically with reproducibility and adaptability in mind. To achieve this goal, we present SOCBED, our proof-of-concept implementation and the first testbed with a focus on generating realistic log data for cybersecurity experiments in a reproducible and adaptable manner. SOCBED enables researchers to reproduce testbed instances on commodity computers, adapt them according to own requirements, and verify their correct functionality. We evaluate SOCBED with an exemplary, practical experiment on detecting a multi-step intrusion of an enterprise network and show that the resulting experiment is indeed valid, controlled, and reproducible. Both SOCBED and the log dataset underlying our evaluation are freely available.
翻译:记录数据和网络交通等人工制品是网络安全研究的基础,例如入侵探测领域。然而,大多数研究都以他人无法获取或无法适应自身目的的文物为基础,因此难以复制和利用现有的工作。在本文件中,我们确定人工制品生成的挑战,目的是进行有效、受控和可复制的健全实验。我们认为,人工制品生成的试验台必须专门设计,要具有可复制性和适应性。为了实现这一目标,我们介绍了SOCBED、我们的概念验证实施和第一个测试,重点是以可复制和可调整的方式为网络安全实验生成现实的原始数据。SOCBED使研究人员能够在商品计算机上复制测试过的事例,根据自己的要求加以调整,并核查其正确功能。我们用检测企业网络多步入侵的模范和实用的实验对SOCBED进行了评估,并表明由此产生的实验确实有效、可控和可复制。SOCBED和我们评估所依据的日志数据集是自由可得的。