Host based Intrusion Detection System (HIDS) is an effective last line of defense for defending against cyber security attacks after perimeter defenses (e.g., Network based Intrusion Detection System and Firewall) have failed or been bypassed. HIDS is widely adopted in the industry as HIDS is ranked among the top two most used security tools by Security Operation Centers (SOC) of organizations. Although effective and efficient HIDS is highly desirable for industrial organizations, the evolution of increasingly complex attack patterns causes several challenges resulting in performance degradation of HIDS (e.g., high false alert rate creating alert fatigue for SOC staff). Since Natural Language Processing (NLP) methods are better suited for identifying complex attack patterns, an increasing number of HIDS are leveraging the advances in NLP that have shown effective and efficient performance in precisely detecting low footprint, zero day attacks and predicting the next steps of attackers. This active research trend of using NLP in HIDS demands a synthesized and comprehensive body of knowledge of NLP based HIDS. Thus, we conducted a systematic review of the literature on the end to end pipeline of the use of NLP in HIDS development. For the end to end NLP based HIDS development pipeline, we identify, taxonomically categorize and systematically compare the state of the art of NLP methods usage in HIDS, attacks detected by these NLP methods, datasets and evaluation metrics which are used to evaluate the NLP based HIDS. We highlight the relevant prevalent practices, considerations, advantages and limitations to support the HIDS developers. We also outline the future research directions for the NLP based HIDS development.
翻译:以主机为主的入侵探测系统(HIDS)是防御周边防御失败或被绕过后发生的网络安全攻击袭击的有效最后防线。该行业广泛采用HIDS,因为HIDS是各组织安全行动中心(SOC)最常用的两种安全工具之一。虽然对工业组织来说,高效力和高效率的HIDS是高度可取的,但日益复杂的袭击模式的演变导致HIDS业绩退化的若干挑战(例如,高假警报率使SOC工作人员产生警报疲劳)。由于自然语言处理(NLP)方法更适合识别复杂的攻击模式,越来越多的HIDS正在利用NLP的进展,显示在精确探测低足迹、零日袭击和预测攻击者下一步方面,有效力和高效率的表现。在HIDS使用NLP NLP的动态, 以HID的高级警报率、 NIDS的常规研究方法,以及HDL的常规研究方法,在HDS的文献中系统审查HDR 最终使用。