The ongoing 'digital transformation' fundamentally changes audit evidence's nature, recording, and volume. Nowadays, the International Standards on Auditing (ISA) requires auditors to examine vast volumes of a financial statement's underlying digital accounting records. As a result, audit firms also 'digitize' their analytical capabilities and invest in Deep Learning (DL), a successful sub-discipline of Machine Learning. The application of DL offers the ability to learn specialized audit models from data of multiple clients, e.g., organizations operating in the same industry or jurisdiction. In general, regulations require auditors to adhere to strict data confidentiality measures. At the same time, recent intriguing discoveries showed that large-scale DL models are vulnerable to leaking sensitive training data information. Today, it often remains unclear how audit firms can apply DL models while complying with data protection regulations. In this work, we propose a Federated Learning framework to train DL models on auditing relevant accounting data of multiple clients. The framework encompasses Differential Privacy and Split Learning capabilities to mitigate data confidentiality risks at model inference. We evaluate our approach to detect accounting anomalies in three real-world datasets of city payments. Our results provide empirical evidence that auditors can benefit from DL models that accumulate knowledge from multiple sources of proprietary client data.
翻译:目前,国际审计准则(ISA)要求审计员审查大量财务报表的基本数字会计记录,因此,审计公司还“数字化”其分析能力,并投资于深度学习(DL),这是机械学习的一个成功的次级纪律。DL的应用提供了从多个客户的数据中学习专门审计模式的能力,例如,在同一行业或管辖范围内运作的组织的数据。一般而言,条例要求审计员遵守严格的数据保密措施。与此同时,最近令人感兴趣的发现显示,大型DL模型很容易泄露敏感的培训数据信息。如今,审计公司常常不清楚在遵守数据保护条例的同时如何应用DL模型。在这项工作中,我们提议一个联邦学习框架,以培训DL模型,用于审计多个客户的相关会计数据。框架包括不同隐私和分散学习能力,以降低模型推导的数据保密风险。我们评价了我们从三个真实世界数据模型中发现三个城市支付数据异常现象的方法。我们的经验审计员们积累了数据,从而从多种客户支付来源获得的经验证据。