Realistic benchmarks of reproducible bugs and fixes are vital to good experimental evaluation of debugging and testing approaches. However, there is no suitable benchmark suite that can systematically evaluate the debugging and testing methods of quantum programs until now. This paper proposes Bugs4Q, a benchmark of thirty-six real, manually validated Qiskit bugs from four popular Qiskit elements (Terra, Aer, Ignis, and Aqua), supplemented with the test cases for reproducing buggy behaviors. Bugs4Q also provides interfaces for accessing the buggy and fixed versions of the Qiskit programs and executing the corresponding test cases, facilitating the reproducible empirical studies and comparisons of Qiskit program debugging and testing tools. Bugs4Q is publicly available at https://github.com/Z-928/Bugs4Q.
翻译:重现错误和修正的现实基准对于对调试和测试方法进行良好的实验性评估至关重要,然而,到目前为止,还没有合适的基准套件能够系统评估量子程序调试和测试方法。本文提议采用Bugs4Q,这是四个受欢迎的基斯基特元素(Terra、Aer、Ignis和Aqua)中36个真实、手工验证的基斯基特错误的基准,并辅之以重生错误行为的测试案例。Bugs4Q还提供接口,用于访问基斯基特程序错误和固定版本,执行相应的测试案例,便利可复制的经验研究和对基斯基特程序调试和测试工具的比较。Bugs4Q可在https://github.com/Z-928/Bugs4Q上公开查阅。