Cyber-crimes have become a multi-billion-dollar industry in the recent years. Most cybercrimes/attacks involve deploying some type of malware. Malware that viciously targets every industry, every sector, every enterprise and even individuals has shown its capabilities to take entire business organizations offline and cause significant financial damage in billions of dollars annually. Malware authors are constantly evolving in their attack strategies and sophistication and are developing malware that is difficult to detect and can lay dormant in the background for quite some time in order to evade security controls. Given the above argument, Traditional approaches to malware detection are no longer effective. As a result, deep learning models have become an emerging trend to detect and classify malware. This paper proposes a new convolutional deep learning neural network to accurately and effectively detect malware with high precision. This paper is different than most other papers in the literature in that it uses an expert data science approach by developing a convolutional neural network from scratch to establish a baseline of the performance model first, explores and implements an improvement model from the baseline model, and finally it evaluates the performance of the final model. The baseline model initially achieves 98% accurate rate but after increasing the depth of the CNN model, its accuracy reaches 99.183 which outperforms most of the CNN models in the literature. Finally, to further solidify the effectiveness of this CNN model, we use the improved model to make predictions on new malware samples within our dataset.
翻译:近年来,网上犯罪已成为一个数十亿美元的行业。大多数网络犯罪/攻击都涉及部署某种恶意软件。恶意地针对每个行业、每个行业、每个部门、每个企业、甚至个人,恶意地针对每个行业、每个部门、每个企业、甚至个人,恶意地针对每个行业、每个部门、每个企业、每个企业、甚至个人,表明其有能力将整个商业组织脱机,每年造成数十亿美元的重大财务损失。恶意作者在攻击战略和复杂程度方面不断演进,并正在开发难以探测的恶意软件,并且可以在相当长一段时间内躲藏在背后,以逃避安全控制。根据上述论点,传统的恶意软件检测方法不再有效。因此,深层次的学习模式已成为发现和分类恶意软件的新趋势。本文提出了一个新的革命性深层学习神经网络,以便准确和有效地检测恶意软件。本文与文献中的其他论文不同,即它使用专家数据科学方法,首先从零到建立性能模型的基线,探索和采用改进模型模型,最后评估最后模型的性能。基线模型最初达到98-183的准确率,但最后又增加了CNIS的准确性模型的精确度。