Algorithmic interpretability is necessary to build trust, ensure fairness, and track accountability. However, there is no existing formal measurement method for algorithmic interpretability. In this work, we build upon programming language theory and cognitive load theory to develop a framework for measuring algorithmic interpretability. The proposed measurement framework reflects the process of a human learning an algorithm. We show that the measurement framework and the resulting cognitive complexity score have the following desirable properties - universality, computability, uniqueness, and monotonicity. We illustrate the measurement framework through a toy example, describe the framework and its conceptual underpinnings, and demonstrate the benefits of the framework, in particular for managers considering tradeoffs when selecting algorithms.
翻译:在这项工作中,我们以编程语言理论和认知负载理论为基础,制定一个衡量算法解释的框架。拟议计量框架反映了人类学习算法的过程。我们证明测量框架和由此产生的认知复杂性分数具有下列可取的特性:普遍性、可比较性、独一性和单一性。我们通过一个微小的例子来说明计量框架,描述框架及其概念基础,并展示框架的效益,特别是管理人员在选择算法时考虑权衡的效益。