多系统恶意行为报告 (Behavioural Reports of Multi-Stage Malware) - 专知论文

会员服务 ·

0

数据集 · Extensibility · MoDELS · API · 样本 ·

2023 年 1 月 30 日

Behavioural Reports of Multi-Stage Malware

翻译：多系统恶意行为报告

Marcus Carpenter,Chunbo Luo

The extensive damage caused by malware requires anti-malware systems to be constantly improved to prevent new threats. The current trend in malware detection is to employ machine learning models to aid in the classification process. We propose a new dataset with the objective of improving current anti-malware systems. The focus of this dataset is to improve host based intrusion detection systems by providing API call sequences for thousands of malware samples executed in Windows 10 virtual machines. A tutorial on how to create and expand this dataset is provided along with a benchmark demonstrating how to use this dataset to classify malware. The data contains long sequences of API calls for each sample, and in order to create models that can be deployed in resource constrained devices, three feature selection methods were tested. The principal innovation, however, lies in the multi-label classification system in which one sequence of APIs can be tagged with multiple labels describing its malicious behaviours.

翻译：恶意软件造成的广泛损坏要求不断改进反恶意软件系统,以防止新的威胁。目前恶意软件检测的趋势是使用机器学习模型来帮助分类过程。我们提议了一个新的数据集,目的是改进目前的反恶意软件系统。这个数据集的重点是改进基于主机的入侵检测系统,为在Windows 10虚拟机器中执行的数千个恶意软件样本提供API呼叫序列。提供了关于如何创建和扩大该数据集的教程,同时提供了一个基准,表明如何使用该数据集对恶意软件进行分类。数据包含每个样本的API的长序列,并且为了创建可以在资源限制装置中部署的模型,测试了三种特征选择方法。但是,主要的创新在于多标签分类系统,在这个系统中可以用多个标签标注一个序列的反恶意行为。

0

相关内容

数据集

数据集，又称为资料集、数据集合或资料集合，是一种由数据所组成的集合。
Data set（或dataset）是一个数据的集合，通常以表格形式出现。每一列代表一个特定变量。每一行都对应于某一成员的数据集的问题。它列出的价值观为每一个变量，如身高和体重的一个物体或价值的随机数。每个数值被称为数据资料。对应于行数，该数据集的数据可能包括一个或多个成员。

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

全球首个GNN为主的AI创业公司，募资$18.5 million！

全球首个GNN为主的AI创业公司，募资$18.5 million！

图与推荐

1+阅读 · 2022年4月16日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

脂肪酸去饱和酶（SCD1）基因及调控其基因网络对牦牛乳中不饱和脂肪酸调节机理

国家自然科学基金

0+阅读 · 2014年12月31日

家蚕体壁Spatzle(BmSpz4)的特性及其相关免疫信号通路研究

国家自然科学基金

0+阅读 · 2012年12月31日

一种时空白噪声驱动的Navier-Stokes方程的隐格式

国家自然科学基金

0+阅读 · 2011年12月31日

基于Decorin基因甲基化调控的非小细胞肺癌转移的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

射频及脉冲偏压对感应耦合等离子体动力学行为影响的研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于动力学分析的Internet网络拥塞控制研究

国家自然科学基金

0+阅读 · 2009年12月31日

UGT基因簇进化及调控研究

国家自然科学基金

0+阅读 · 2009年12月31日

CUEDC2/SOCS3复合物负调控JAK-STAT通路及其分子机制的研究

国家自然科学基金

0+阅读 · 2009年12月31日

可压Navier-Stokes方程及相关流体动力学方程研究

国家自然科学基金

0+阅读 · 2008年12月31日

量子点生物效应的热化学研究

国家自然科学基金

0+阅读 · 2008年12月31日

An Extended Study of Human-like Behavior under Adversarial Training

Arxiv

0+阅读 · 2023年3月22日

A survey of hardware-based malware detection approach

Arxiv

0+阅读 · 2023年3月22日

Leveraging Mobile Sensing Technology for Societal Change Towards more Sustainable Behavior

Arxiv

0+阅读 · 2023年3月22日

PICASO: Enhancing API Recommendations with Relevant Stack Overflow Posts

Arxiv

0+阅读 · 2023年3月22日

DeepGraviLens: a Multi-Modal Architecture for Classifying Gravitational Lensing Data

Arxiv

0+阅读 · 2023年3月21日

Large-scale End-of-Life Prediction of Hard Disks in Distributed Datacenters

Large-scale End-of-Life Prediction of Hard Disks in Distributed Datacenters

Arxiv

0+阅读 · 2023年3月20日

A Framework for Learning Behavior Trees in Collaborative Robotic Applications

Arxiv

1+阅读 · 2023年3月20日

HIVE: Harnessing Human Feedback for Instructional Visual Editing

Arxiv

0+阅读 · 2023年3月16日

A Survey of Machine Learning for Computer Architecture and Systems

Arxiv

18+阅读 · 2021年2月16日

A Survey of the Recent Architectures of Deep Convolutional Neural Networks

A Survey of the Recent Architectures of Deep Convolutional Neural Networks

Arxiv

39+阅读 · 2019年1月17日

VIP会员

文章信息

相关主题

相关VIP内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

海底战已至：美国构思海底安全战略 | 最新报告

联邦API网关：将新端点快速集成到预定义模式中 | 最新53页

美军将无人自主等新技术融入潜艇部队以更具杀伤力

量化环境源与海洋学预报在反潜战决策中的价值 | 77页

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

全球首个GNN为主的AI创业公司，募资$18.5 million！

全球首个GNN为主的AI创业公司，募资$18.5 million！

图与推荐

1+阅读 · 2022年4月16日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

An Extended Study of Human-like Behavior under Adversarial Training

Arxiv

0+阅读 · 2023年3月22日

A survey of hardware-based malware detection approach

Arxiv

0+阅读 · 2023年3月22日

Leveraging Mobile Sensing Technology for Societal Change Towards more Sustainable Behavior

Arxiv

0+阅读 · 2023年3月22日

PICASO: Enhancing API Recommendations with Relevant Stack Overflow Posts

Arxiv

0+阅读 · 2023年3月22日

DeepGraviLens: a Multi-Modal Architecture for Classifying Gravitational Lensing Data

Arxiv

0+阅读 · 2023年3月21日

Large-scale End-of-Life Prediction of Hard Disks in Distributed Datacenters

Large-scale End-of-Life Prediction of Hard Disks in Distributed Datacenters

Arxiv

0+阅读 · 2023年3月20日

A Framework for Learning Behavior Trees in Collaborative Robotic Applications

Arxiv

1+阅读 · 2023年3月20日

HIVE: Harnessing Human Feedback for Instructional Visual Editing

Arxiv

0+阅读 · 2023年3月16日

A Survey of Machine Learning for Computer Architecture and Systems

Arxiv

18+阅读 · 2021年2月16日

A Survey of the Recent Architectures of Deep Convolutional Neural Networks

A Survey of the Recent Architectures of Deep Convolutional Neural Networks

Arxiv

39+阅读 · 2019年1月17日

相关基金

脂肪酸去饱和酶（SCD1）基因及调控其基因网络对牦牛乳中不饱和脂肪酸调节机理

国家自然科学基金

0+阅读 · 2014年12月31日

家蚕体壁Spatzle(BmSpz4)的特性及其相关免疫信号通路研究

国家自然科学基金

0+阅读 · 2012年12月31日

一种时空白噪声驱动的Navier-Stokes方程的隐格式

国家自然科学基金

0+阅读 · 2011年12月31日

基于Decorin基因甲基化调控的非小细胞肺癌转移的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

射频及脉冲偏压对感应耦合等离子体动力学行为影响的研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于动力学分析的Internet网络拥塞控制研究

国家自然科学基金

0+阅读 · 2009年12月31日

UGT基因簇进化及调控研究

国家自然科学基金

0+阅读 · 2009年12月31日

CUEDC2/SOCS3复合物负调控JAK-STAT通路及其分子机制的研究

国家自然科学基金

0+阅读 · 2009年12月31日

可压Navier-Stokes方程及相关流体动力学方程研究

国家自然科学基金

0+阅读 · 2008年12月31日

量子点生物效应的热化学研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员