Automated cyber threat detection in computer networks is a major challenge in cybersecurity. The cyber domain has inherent challenges that make traditional machine learning techniques problematic, specifically the need to learn continually evolving attacks through global collaboration while maintaining data privacy, and the varying resources available to network owners. We present a scheme to mitigate these difficulties through an architectural approach using community model sharing with a streaming analytic pipeline. Our streaming approach trains models incrementally as each log record is processed, thereby adjusting to concept drift resulting from changing attacks. Further, we designed a community sharing approach which federates learning through merging models without the need to share sensitive cyber-log data. Finally, by standardizing data and Machine Learning processes in a modular way, we provide network security operators the ability to manage cyber threat events and model sensitivity through community member and analytic method weighting in ways that are best suited for their available resources and data.
翻译:在计算机网络中自动发现网络威胁是网络安全的一大挑战。 网络领域存在固有的挑战,使传统机器学习技术产生问题,具体而言,需要通过全球协作学习不断演变的袭击,同时维护数据隐私,以及网络拥有者可获得的各种资源。我们提出了一个计划,通过建筑方法,利用社区模式共享与流水分析管道共享来缓解这些困难。我们的流程方法在处理每个日志记录时逐步培训模型,从而适应不断变化的袭击造成的概念漂移。此外,我们设计了一种社区共享方法,通过合并模型来联合学习,而无需分享敏感的网络记录数据。最后,通过模块化的数据和机器学习进程,我们为网络安全操作者提供了管理网络威胁事件和模型敏感性的能力,通过社区成员和分析方法,以最适合其现有资源和数据的方式进行模型的权衡。