Security issues have gradually emerged with the continuous development of artificial intelligence (AI). Earlier work verified the possibility of converting neural network models into stegomalware, embedding malware into a model with limited impact on the model's performance. However, existing methods are not applicable in real-world attack scenarios and do not attract enough attention from the security community due to performance degradation and additional workload. Therefore, we propose an improved stegomalware EvilModel. By analyzing the composition of the neural network model, three new methods for embedding malware into the model are proposed: MSB reservation, fast substitution, and half substitution, which can embed malware that accounts for half of the model's volume without affecting the model's performance. We built 550 EvilModels using ten mainstream neural network models and 19 malware samples. The experiment shows that EvilModel achieved an embedding rate of 48.52\%. A quantitative algorithm is proposed to evaluate the existing embedding methods. We also design a trigger and propose a threat scenario for the targeted attack. The practicality and effectiveness of the proposed methods were demonstrated by experiments and analyses of the embedding capacity, performance impact, and detection evasion.
翻译:随着人工智能的不断发展(AI),逐渐出现了安全问题。早期的工作核实了将神经网络模型转换成stegomalware的可能性,将恶意软件嵌入对模型性能影响有限的模型;然而,由于性能退化和工作量增加,现有方法不适用于现实世界攻击情景,没有引起安全界足够重视。因此,我们建议改进Stegomalware WildModel。通过分析神经网络模型的构成,提出了将恶意软件嵌入模型的三种新方法:MSB保留、快速替代和半替代,这可以嵌入占模型性能一半的恶意软件,而不会影响模型性能。我们利用10个主流神经网络模型和19个恶意软件样本建造了550个邪恶模型。实验表明,邪恶Model达到了48.52 ⁇ 的嵌入率。我们提出了定量算法,以评价现有的嵌入方法。我们还设计了一个触发器,并提出了目标攻击的威胁情景。通过实验和分析嵌入能力、性影响、探测和规避,证明了拟议方法的实用性和有效性。