LSTM 用于恶意检测的超光谱选择:交互效应和等级选择方法 (LSTM Hyper-Parameter Selection for Malware Detection: Interaction Effects and Hierarchical Selection Approach)

Long-Short-Term-Memory (LSTM) networks have shown great promise in artificial intelligence (AI) based language modeling. Recently, LSTM networks have also become popular for designing AI-based Intrusion Detection Systems (IDS). However, its applicability in IDS is studied largely in the default settings as used in language models. Whereas security applications offer distinct conditions and hence warrant careful consideration while applying such recurrent networks. Therefore, we conducted one of the most exhaustive works on LSTM hyper-parameters for IDS and experimented with approx. 150 LSTM configurations to determine its hyper-parameters relative importance, interaction effects, and optimal selection approach for designing an IDS. We conducted multiple analyses of the results of these experiments and empirically controlled for the interaction effects of different hyper-parameters covariate levels. We found that for security applications, especially for designing an IDS, neither similar relative importance as applicable to language models is valid, nor is the standard linear method for hyper-parameter selection ideal. We ascertained that the interaction effect plays a crucial role in determining the relative importance of hyper-parameters. We also discovered that after controlling for the interaction effect, the correct relative importance for LSTMs for an IDS is batch-size, followed by dropout ratio and padding. The findings are significant because when LSTM was first used for language models, the focus had mostly been on increasing the number of layers to enhance performance.

翻译：长期短期计量(LSTM)网络在人工智能(AI)基础语言模型方面显示出很大的希望。最近,LSTM网络在设计基于AI的入侵探测系统方面也变得很受欢迎。然而,在语言模型中使用的默认情况下,对它是否适用于IDS进行了研究。虽然安全应用提供了不同的条件,因此在应用这种经常性网络时值得仔细考虑。因此,我们为IDS进行了LSTM超参数最详尽的工作之一,并试验了约150 LSTM配置,以确定其超参数相对重要性、互动效应和设计IDS的最佳选择方法。我们对这些实验的结果进行了多次分析,并对不同超参数的相异水平的互动效果进行了经验性控制。我们发现,对于安全应用,特别是设计与语言模型无关的相对重要性,对于超参数模型选择理想而言,我们进行了最详尽的标准线性方法。我们确定,互动效应在决定超参数的相对重要性、互动效果和设计综合数据基础设施的最佳选择方法方面发挥着至关重要的作用。我们对这些实验的结果进行了多次分析,对不同超参数的相互作用的影响进行了经验性控制。我们发现,因为I在使用了高水平后,对不断提高的比比重的等级的等级的分辨率的比重影响是。