Spreadsheets are one of the most widely used programming environments, and are widely deployed in domains like finance where errors can have catastrophic consequences. We present a static analysis specifically designed to find spreadsheet formula errors. Our analysis directly leverages the rectangular character of spreadsheets. It uses an information-theoretic approach to identify formulas that are especially surprising disruptions to nearby rectangular regions. We present ExceLint, an implementation of our static analysis for Microsoft Excel. We demonstrate that ExceLint is fast and effective: across a corpus of 70 spreadsheets, ExceLint takes a median of 5 seconds per spreadsheet, and it significantly outperforms the state of the art analysis.
翻译:电子表格是使用最为广泛的编程环境之一,广泛分布在财务等领域,错误可能产生灾难性后果。我们专门为查找电子表格公式错误而进行静态分析。我们的分析直接利用电子表格的矩形特性。我们采用信息理论方法来识别对附近矩形区域特别令人惊讶的干扰的公式。我们介绍了对微软Excel进行静态分析的ExceLint。我们证明ExceLint是快速有效的:在70个电子表格中,ExceLint每张电子表格的中位值为5秒,大大超过艺术分析的状态。