A myriad of recent literary works has leveraged generative adversarial networks (GANs) to generate unseen evasion samples. The purpose is to annex the generated data with the original train set for adversarial training to improve the detection performance of machine learning (ML) classifiers. The quality of generated adversarial samples relies on the adequacy of training data samples. However, in low data regimes like medical diagnostic imaging and cybersecurity, the anomaly samples are scarce in number. This paper proposes a novel GAN design called Evasion Generative Adversarial Network (EVAGAN) that is more suitable for low data regime problems that use oversampling for detection improvement of ML classifiers. EVAGAN not only can generate evasion samples, but its discriminator can act as an evasion-aware classifier. We have considered Auxiliary Classifier GAN (ACGAN) as a benchmark to evaluate the performance of EVAGAN on cybersecurity (ISCX-2014, CIC-2017 and CIC2018) botnet and computer vision (MNIST) datasets. We demonstrate that EVAGAN outperforms ACGAN for unbalanced datasets with respect to detection performance, training stability and time complexity. EVAGAN's generator quickly learns to generate the low sample class and hardens its discriminator simultaneously. In contrast to ML classifiers that require security hardening after being adversarially trained by GAN-generated data, EVAGAN renders it needless. The experimental analysis proves that EVAGAN is an efficient evasion hardened model for low data regimes for the selected cybersecurity and computer vision datasets. Code will be available at HTTPS://www.github.com/rhr407/EVAGAN.
翻译:最近大量文学作品利用了基因对抗网络(GANs)来生成逃避的样本。目的是将生成的数据与最初的对抗性训练列列列列列列,以提高机器学习(ML)分类者的检测性能。生成的对抗性抽样的质量取决于培训数据样本的充足性。然而,在医疗诊断成像和网络安全等低数据系统中,异常样本数量很少。本文件建议采用一种新型的GAN设计,称为Evasion General Adversarial 网络(EVAGAN),它更适合使用过度抽样来改进ML分类者的检测性能的低数据系统问题。EVAGAN不仅可以生成逃避性样,而且其导师也可以发挥逃避性能。我们把AAN(ACGAN)作为辅助性分类(ACAN)作为评估EVAGAN(IS-2014、CIC-2017和CIC2018)网络和计算机抽样系统(MNIST)的运行情况的基准。我们表明,EVAN(EVAN)超越了HTTGGGGGAN(HIL)的升级的硬性数据分析。在测试后,需要通过对ARCALAGA值进行硬性变现数据测试, 数据稳定和低性变码数据分析后,需要对低性变码数据进行。