SK-Tree:通过签名内核在树流中系统检测恶意软件的算法 (SK-Tree: a systematic malware detection algorithm on streaming trees via the signature kernel) - 专知论文

会员服务 ·

0

流 · 核化 · 学成 · Machine Learning · Continuity ·

2021 年 9 月 29 日

SK-Tree: a systematic malware detection algorithm on streaming trees via the signature kernel

翻译：SK-Tree:通过签名内核在树流中系统检测恶意软件的算法

Thomas Cochrane,Peter Foster,Varun Chhabra,Maud Lemercier,Cristopher Salvi,Terry Lyons

from arxiv, Published at IEEE-CSR (International Conference on Cybersecurity and Resilience) 2021

The development of machine learning algorithms in the cyber security domain has been impeded by the complex, hierarchical, sequential and multimodal nature of the data involved. In this paper we introduce the notion of a streaming tree as a generic data structure encompassing a large portion of real-world cyber security data. Starting from host-based event logs we represent computer processes as streaming trees that evolve in continuous time. Leveraging the properties of the signature kernel, a machine learning tool that recently emerged as a leading technology for learning with complex sequences of data, we develop the SK-Tree algorithm. SK-Tree is a supervised learning method for systematic malware detection on streaming trees that is robust to irregular sampling and high dimensionality of the underlying streams. We demonstrate the effectiveness of SK-Tree to detect malicious events on a portion of the publicly available DARPA OpTC dataset, achieving an AUROC score of 98%.

翻译：网络安全领域机器学习算法的发展由于所涉数据的复杂性、等级性、顺序性和多式联运性质而受到阻碍。在本文中,我们引入了流树概念,作为包含大量真实世界网络安全数据的通用数据结构。从基于主机的事件日志开始,我们将计算机过程作为不断演化的流树来代表。利用签字内核的特性,这是最近作为以复杂数据序列进行学习的领先技术而出现的机器学习工具,我们开发了SK-Tree算法。SK-Tree是一种监督的学习方法,用于在流树上系统检测恶意软件,对于不规则采样和深层流的高度维度是很强的。我们展示了SK-Tree在可公开获取的DARPA OpTC数据集中的一部分检测恶意事件的有效性,达到了98 %的AUROC分数。

0

相关内容

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

【经典书】线性代数，436页pdf

专知会员服务

78+阅读 · 2021年3月16日

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

53+阅读 · 2021年1月20日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

复杂的序列数据分析：现有算法的系统文献综述，Complex Sequential Data Analysis: A Systematic Literature Review of Existing Algorithms

复杂的序列数据分析：现有算法的系统文献综述，Complex Sequential Data Analysis: A Systematic Literature Review of Existing Algorithms

专知会员服务

27+阅读 · 2020年7月24日

神经网络与形式语言综述，12页pdf，A Survey of Neural Networks and Formal Languages

神经网络与形式语言综述，12页pdf，A Survey of Neural Networks and Formal Languages

专知会员服务

21+阅读 · 2020年6月4日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

时序数据异常检测工具/数据集大列表

时序数据异常检测工具/数据集大列表

极市平台

65+阅读 · 2019年2月23日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

已删除

清华大学研究生教育

3+阅读 · 2018年6月30日

AEVA: Black-box Backdoor Detection Using Adversarial Extreme Value Analysis

Arxiv

0+阅读 · 2021年11月21日

Federated Learning for Malware Detection in IoT Devices

Arxiv

0+阅读 · 2021年11月19日

Sketch-based Creativity Support Tools using Deep Learning

Arxiv

0+阅读 · 2021年11月19日

Recommender systems based on graph embedding techniques: A comprehensive review

Arxiv

23+阅读 · 2021年9月20日

AttentionFlow: Visualising Influence in Networks of Time Series

Arxiv

9+阅读 · 2021年2月3日

Privacy and Robustness in Federated Learning: Attacks and Defenses

Arxiv

35+阅读 · 2020年12月7日

Less is More: Learning Highlight Detection from Video Duration

Less is More: Learning Highlight Detection from Video Duration

Arxiv

7+阅读 · 2019年3月3日

Android Malware Detection using Large-scale Network Representation Learning

Arxiv

4+阅读 · 2018年12月11日

Adaptive Neural Trees

Adaptive Neural Trees

Arxiv

4+阅读 · 2018年12月10日

A Memory-Network Based Solution for Multivariate Time-Series Forecasting

A Memory-Network Based Solution for Multivariate Time-Series Forecasting

Arxiv

13+阅读 · 2018年9月6日

VIP会员

文章信息

相关主题

Machine Learning

相关VIP内容

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

【经典书】线性代数，436页pdf

专知会员服务

78+阅读 · 2021年3月16日

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

53+阅读 · 2021年1月20日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

复杂的序列数据分析：现有算法的系统文献综述，Complex Sequential Data Analysis: A Systematic Literature Review of Existing Algorithms

复杂的序列数据分析：现有算法的系统文献综述，Complex Sequential Data Analysis: A Systematic Literature Review of Existing Algorithms

专知会员服务

27+阅读 · 2020年7月24日

神经网络与形式语言综述，12页pdf，A Survey of Neural Networks and Formal Languages

神经网络与形式语言综述，12页pdf，A Survey of Neural Networks and Formal Languages

专知会员服务

21+阅读 · 2020年6月4日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

《乌克兰无人机产业：志愿者与政策在构建新兴无人机产业中的协同作用》最新报告

《人工智能辅助决策中的数据可视化：系统性综述》

人工智能驱动弹药制造现代化：美国陆军转型之路

《敏捷作战部署中枢纽-辐条基地选址优化研究》80页

相关资讯

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

时序数据异常检测工具/数据集大列表

时序数据异常检测工具/数据集大列表

极市平台

65+阅读 · 2019年2月23日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

已删除

清华大学研究生教育

3+阅读 · 2018年6月30日

相关论文

AEVA: Black-box Backdoor Detection Using Adversarial Extreme Value Analysis

Arxiv

0+阅读 · 2021年11月21日

Federated Learning for Malware Detection in IoT Devices

Arxiv

0+阅读 · 2021年11月19日

Sketch-based Creativity Support Tools using Deep Learning

Arxiv

0+阅读 · 2021年11月19日

Recommender systems based on graph embedding techniques: A comprehensive review

Arxiv

23+阅读 · 2021年9月20日

AttentionFlow: Visualising Influence in Networks of Time Series

Arxiv

9+阅读 · 2021年2月3日

Privacy and Robustness in Federated Learning: Attacks and Defenses

Arxiv

35+阅读 · 2020年12月7日

Less is More: Learning Highlight Detection from Video Duration

Less is More: Learning Highlight Detection from Video Duration

Arxiv

7+阅读 · 2019年3月3日

Android Malware Detection using Large-scale Network Representation Learning

Arxiv

4+阅读 · 2018年12月11日

Adaptive Neural Trees

Adaptive Neural Trees

Arxiv

4+阅读 · 2018年12月10日

A Memory-Network Based Solution for Multivariate Time-Series Forecasting

A Memory-Network Based Solution for Multivariate Time-Series Forecasting

Arxiv

13+阅读 · 2018年9月6日

微信扫码咨询专知VIP会员