We propose MALIGN, a novel malware family detection approach inspired by genome sequence alignment. MALIGN encodes malware using four nucleotides and then uses genome sequence alignment approaches to create a signature of a malware family based on the code fragments conserved in the family making it robust to evasion by modification and addition of content. Moreover, unlike previous approaches based on sequence alignment, our method uses a multiple whole-genome alignment tool that protects against adversarial attacks such as code insertion, deletion or modification. Our approach outperforms state-of-the-art machine learning based malware detectors and demonstrates robustness against trivial adversarial attacks. MALIGN also helps identify the techniques malware authors use to evade detection.
翻译:我们建议使用新颖的恶意软件家庭检测法,即基因组序列校正法。MALIGN使用四种核素编码恶意软件,然后使用基因组序列校正法生成一个以家庭保存的代码碎片为基础的恶意软件家庭的签名,使其通过修改和添加内容而强大地规避。此外,与以往的序列校正法不同,我们的方法使用多种全基因校对法,防止对抗性攻击,如代码插入、删除或修改。我们的方法优于最先进的机器学习恶意软件探测器,并展示了抵御轻微对抗性攻击的稳健性。MALIGN还帮助识别了恶意软件作者用来逃避检测的技术。