Big Data is reforming many industrial domains by providing decision support through analyzing large data volumes. Big Data testing aims to ensure that Big Data systems run smoothly and error-free while maintaining the performance and quality of data. However, because of the diversity and complexity of data, testing Big Data is challenging. Though numerous research efforts deal with Big Data testing, a comprehensive review to address testing techniques and challenges of Big Data is not available as yet. Therefore, we have systematically reviewed the Big Data testing techniques evidence occurring in the period 2010-2021. This paper discusses testing data processing by highlighting the techniques used in every processing phase. Furthermore, we discuss the challenges and future directions. Our findings show that diverse functional, non-functional and combined (functional and non-functional) testing techniques have been used to solve specific problems related to Big Data. At the same time, most of the testing challenges have been faced during the MapReduce validation phase. In addition, the combinatorial testing technique is one of the most applied techniques in combination with other techniques (i.e., random testing, mutation testing, input space partitioning and equivalence testing) to find various functional faults through Big Data testing.
翻译:大数据测试旨在确保大数据系统在保持数据性能和质量的同时顺利、无误地运行。然而,由于数据的多样性和复杂性,测试大数据具有挑战性。尽管许多研究工作涉及大数据测试,但目前还没有针对大数据测试进行全面审查,以解决大数据测试技术和挑战。因此,我们系统地审查了2010至2021年期间发生的大数据测试技术证据。本文通过突出每个处理阶段所使用的技术来讨论测试数据处理。此外,我们讨论了挑战和未来的方向。我们的调查结果显示,使用各种功能性、非功能性和综合(功能性和非功能性)测试技术来解决与大数据有关的具体问题。与此同时,大多数测试挑战在MapReduce验证阶段已经面临。此外,组合测试技术是与其他技术(即随机测试、突变测试、输入空间分割和对等测试)相结合的最应用技术之一,以便通过大数据测试找出各种功能缺陷。