Stream processing has been an active research field for more than 20 years, but it is now witnessing its prime time due to recent successful efforts by the research community and numerous worldwide open-source communities. This survey provides a comprehensive overview of fundamental aspects of stream processing systems and their evolution in the functional areas of out-of-order data management, state management, fault tolerance, high availability, load management, elasticity, and reconfiguration. We review noteworthy past research findings, outline the similarities and differences between early ('00-'10) and modern ('11-'18) streaming systems, and discuss recent trends and open problems.
翻译:20多年来,溪流处理一直是一个积极的研究领域,但由于研究界和世界各地众多的开放源码社区最近的成功努力,现在它正经历着它的黄金时期。这项调查全面概述了溪流处理系统的基本方面及其在断层数据管理、国家管理、缺陷容忍度、高可用性、负载管理、弹性和重组等职能领域的演变。我们审查了以往值得注意的研究结果,概述了早期('00-'10)和现代('11-18')流流系统之间的异同,并讨论了最近的趋势和公开问题。