Botnet detectors based on machine learning are potential targets for adversarial evasion attacks. Several research works employ adversarial training with samples generated from generative adversarial nets (GANs) to make the botnet detectors adept at recognising adversarial evasions. However, the synthetic evasions may not follow the original semantics of the input samples. This paper proposes a novel GAN model leveraged with deep reinforcement learning (DRL) to explore semantic aware samples and simultaneously harden its detection. A DRL agent is used to attack the discriminator of the GAN that acts as a botnet detector. The discriminator is trained on the crafted perturbations by the agent during the GAN training, which helps the GAN generator converge earlier than the case without DRL. We name this model RELEVAGAN, i.e. ["relive a GAN" or deep REinforcement Learning-based Evasion Generative Adversarial Network] because, with the help of DRL, it minimises the GAN's job by letting its generator explore the evasion samples within the semantic limits. During the GAN training, the attacks are conducted to adjust the discriminator weights for learning crafted perturbations by the agent. RELEVAGAN does not require adversarial training for the ML classifiers since it can act as an adversarial semantic-aware botnet detection model. Code will be available at https://github.com/rhr407/RELEVAGAN.
翻译:以机器学习为基础的植物检测器是对抗性规避攻击的潜在目标。 数项研究工作采用对抗性对抗性对抗网(GANs)产生的样本进行对抗性培训,使肉网检测器能够识别对抗性规避。 但是,合成逃逸可能无法遵循输入样本的原始语义学。 本文提出一种新型的GAN模型,利用深强化学习(DRL)来探索具有语义意识的样本,同时使其检测更加严格。 DRL代理器被用来攻击GAN的导师,该导师充当了肉网探测器。 该导师在GAN培训期间接受了由该剂制造者制造的触摸性测试,帮助GAN生成器比没有DRL(DL)在案件之前聚集起来。 我们把这个模型命名为RELEVAGAN,即[“再活一个GAN”或深REInstrugment Study Evasion Evasion Generation Adversarial 网络, 因为在DLLAVAL的帮助下, 将GAN的工作最小化为GAN的工作,让其发电机在Surber 样本中探索在Surbreal 内进行。 GRANSLANS tristr 的样本测试中进行。 在GRALANSL训练后, 需要进行一次测试后, 训练后, 需要不断在研修修修修修修修修。