基于哈希的恶意软件K-Means聚类比较分析 (Comparative Analysis of Hash-based Malware Clustering via K-Means) - 专知论文

会员服务 ·

0

软件 · 哈希 · K-均值 · 分析 · 攻击 ·

Comparative Analysis of Hash-based Malware Clustering via K-Means

翻译：基于哈希的恶意软件K-Means聚类比较分析

Aink Acrie Soe Thein,Nikolaos Pitropakis,Pavlos Papadopoulos,Sam Grierson,Sana Ullah Jan

from arxiv, To be published in the proceedings of the 8th International Conference on Reliable Information and Communication Technology (IRICT 2025). Springer Book Series: "Lecture Notes on Data Engineering and Communications Technologies"

With the adoption of multiple digital devices in everyday life, the cyber-attack surface has increased. Adversaries are continuously exploring new avenues to exploit them and deploy malware. On the other hand, detection approaches typically employ hashing-based algorithms such as SSDeep, TLSH, and IMPHash to capture structural and behavioural similarities among binaries. This work focuses on the analysis and evaluation of these techniques for clustering malware samples using the K-means algorithm. More specifically, we experimented with established malware families and traits and found that TLSH and IMPHash produce more distinct, semantically meaningful clusters, whereas SSDeep is more efficient for broader classification tasks. The findings of this work can guide the development of more robust threat-detection mechanisms and adaptive security mechanisms.

翻译：随着日常生活中多种数字设备的普及，网络攻击面不断扩大。攻击者持续探索新途径以利用这些设备并部署恶意软件。另一方面，检测方法通常采用基于哈希的算法（如SSDeep、TLSH和IMPHash）来捕获二进制文件间的结构与行为相似性。本研究重点分析和评估这些技术在使用K-means算法进行恶意软件样本聚类时的表现。具体而言，我们通过已建立的恶意软件家族和特征进行实验，发现TLSH和IMPHash能产生更具区分度、语义更明确的聚类，而SSDeep在更广泛的分类任务中效率更高。本研究的发现可为开发更鲁棒的威胁检测机制和自适应安全机制提供指导。

0

相关内容

软件（中国大陆及香港用语，台湾作软体，英文：Software）是一系列按照特定顺序组织的计算机数据和指令的集合。一般来讲软件被划分为编程语言、系统软件、应用软件和介于这两者之间的中间件。软件就是程序加文档的集合体。

适应性异常检测在识别网络物理系统攻击中的应用：系统性文献综述

适应性异常检测在识别网络物理系统攻击中的应用：系统性文献综述

专知会员服务

17+阅读 · 2024年11月22日

半监督目标检测：从卷积神经网络（CNN）到 Transformer 的进展综述

半监督目标检测：从卷积神经网络（CNN）到 Transformer 的进展综述

专知会员服务

41+阅读 · 2024年7月12日

最新最全《深度元学习》2021综述论文，68页pdf，A Survey of Deep Meta-Learning

最新最全《深度元学习》2021综述论文，68页pdf，A Survey of Deep Meta-Learning

专知会员服务

108+阅读 · 2020年10月9日

语义相似性算法演化论文，29页pdf，Evolution of Semantic Similarity - A Survey

语义相似性算法演化论文，29页pdf，Evolution of Semantic Similarity - A Survey

专知会员服务

44+阅读 · 2020年4月30日

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

专知会员服务

159+阅读 · 2020年2月29日

AAAI 2022 | ProtGNN：自解释图神经网络

AAAI 2022 | ProtGNN：自解释图神经网络

专知

10+阅读 · 2022年2月28日

最新最全《深度元学习》2021综述论文，68页pdf，A Survey of Deep Meta-Learning

最新最全《深度元学习》2021综述论文，68页pdf，A Survey of Deep Meta-Learning

专知

11+阅读 · 2021年4月23日

【KDD2020-Tutorial】深度学习异常检测，180页ppt

【KDD2020-Tutorial】深度学习异常检测，180页ppt

专知

49+阅读 · 2020年8月28日

Python图像处理，366页pdf，Image Operators Image Processing in Python

Python图像处理，366页pdf，Image Operators Image Processing in Python

专知

15+阅读 · 2020年7月23日

LibRec 每周算法：LDA主题模型

LibRec 每周算法：LDA主题模型

LibRec智能推荐

29+阅读 · 2017年12月4日

软件定义网络（SDN）环境下基于机器学习的路由预规划研究

国家自然科学基金

4+阅读 · 2015年12月31日

抗密钥篡改可证明安全公钥密码算法研究

国家自然科学基金

0+阅读 · 2015年12月31日

面向Bug报告的软件故障重现方法研究

国家自然科学基金

4+阅读 · 2015年12月31日

基于自主学习的Ad hoc Agent序贯决策研究

国家自然科学基金

46+阅读 · 2015年12月31日

基于安全需求分析的内核保护方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

LLM-Driven Feature-Level Adversarial Attacks on Android Malware Detectors

Arxiv

0+阅读 · 12月24日

OW-Rep: Open World Object Detection with Instance Representation Learning

Arxiv

0+阅读 · 12月21日

Self-Supervised Learning of Graph Representations for Network Intrusion Detection

Arxiv

0+阅读 · 12月20日

Cyber Risk Scoring with QUBO: A Quantum and Hybrid Benchmark Study

Arxiv

0+阅读 · 12月20日

Methods and Tools for Secure Quantum Clouds with a specific Case Study on Homomorphic Encryption

Arxiv

0+阅读 · 12月19日

VIP会员

文章信息

相关主题

相关VIP内容

适应性异常检测在识别网络物理系统攻击中的应用：系统性文献综述

适应性异常检测在识别网络物理系统攻击中的应用：系统性文献综述

专知会员服务

17+阅读 · 2024年11月22日

半监督目标检测：从卷积神经网络（CNN）到 Transformer 的进展综述

半监督目标检测：从卷积神经网络（CNN）到 Transformer 的进展综述

专知会员服务

41+阅读 · 2024年7月12日

最新最全《深度元学习》2021综述论文，68页pdf，A Survey of Deep Meta-Learning

最新最全《深度元学习》2021综述论文，68页pdf，A Survey of Deep Meta-Learning

专知会员服务

108+阅读 · 2020年10月9日

语义相似性算法演化论文，29页pdf，Evolution of Semantic Similarity - A Survey

语义相似性算法演化论文，29页pdf，Evolution of Semantic Similarity - A Survey

专知会员服务

44+阅读 · 2020年4月30日

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

专知会员服务

159+阅读 · 2020年2月29日

热门VIP内容

开通专知VIP会员享更多权益服务

《北约联合仿真与集成、验证与鉴定服务标准》2025最新40页

《面向协同任务的无人地面车辆与无人机（UGV-UAV）集成研究综述》2025最新综述论文

《理解大语言模型在军事战术任务规划中的局限性》

《国防与安全会议论文集》最新80页

相关资讯

AAAI 2022 | ProtGNN：自解释图神经网络

AAAI 2022 | ProtGNN：自解释图神经网络

专知

10+阅读 · 2022年2月28日

最新最全《深度元学习》2021综述论文，68页pdf，A Survey of Deep Meta-Learning

最新最全《深度元学习》2021综述论文，68页pdf，A Survey of Deep Meta-Learning

专知

11+阅读 · 2021年4月23日

【KDD2020-Tutorial】深度学习异常检测，180页ppt

【KDD2020-Tutorial】深度学习异常检测，180页ppt

专知

49+阅读 · 2020年8月28日

Python图像处理，366页pdf，Image Operators Image Processing in Python

Python图像处理，366页pdf，Image Operators Image Processing in Python

专知

15+阅读 · 2020年7月23日

LibRec 每周算法：LDA主题模型

LibRec 每周算法：LDA主题模型

LibRec智能推荐

29+阅读 · 2017年12月4日

相关论文

LLM-Driven Feature-Level Adversarial Attacks on Android Malware Detectors

Arxiv

0+阅读 · 12月24日

OW-Rep: Open World Object Detection with Instance Representation Learning

Arxiv

0+阅读 · 12月21日

Self-Supervised Learning of Graph Representations for Network Intrusion Detection

Arxiv

0+阅读 · 12月20日

Cyber Risk Scoring with QUBO: A Quantum and Hybrid Benchmark Study

Arxiv

0+阅读 · 12月20日

Methods and Tools for Secure Quantum Clouds with a specific Case Study on Homomorphic Encryption

Arxiv

0+阅读 · 12月19日

相关基金

软件定义网络（SDN）环境下基于机器学习的路由预规划研究

国家自然科学基金

4+阅读 · 2015年12月31日

抗密钥篡改可证明安全公钥密码算法研究

国家自然科学基金

0+阅读 · 2015年12月31日

面向Bug报告的软件故障重现方法研究

国家自然科学基金

4+阅读 · 2015年12月31日

基于自主学习的Ad hoc Agent序贯决策研究

国家自然科学基金

46+阅读 · 2015年12月31日

基于安全需求分析的内核保护方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

微信扫码咨询专知VIP会员