As the societal impact of Deep Neural Networks (DNNs) grows, the goals for advancing DNNs become more complex and diverse, ranging from improving a conventional model accuracy metric to infusing advanced human virtues such as fairness, accountability, transparency (FaccT), and unbiasedness. Recently, techniques in Explainable Artificial Intelligence (XAI) are attracting considerable attention, and have tremendously helped Machine Learning (ML) engineers in understanding AI models. However, at the same time, we started to witness the emerging need beyond XAI among AI communities; based on the insights learned from XAI, how can we better empower ML engineers in steering their DNNs so that the model's reasonableness and performance can be improved as intended? This article provides a timely and extensive literature overview of the field Explanation-Guided Learning (EGL), a domain of techniques that steer the DNNs' reasoning process by adding regularization, supervision, or intervention on model explanations. In doing so, we first provide a formal definition of EGL and its general learning paradigm. Secondly, an overview of the key factors for EGL evaluation, as well as summarization and categorization of existing evaluation procedures and metrics for EGL are provided. Finally, the current and potential future application areas and directions of EGL are discussed, and an extensive experimental study is presented aiming at providing comprehensive comparative studies among existing EGL models in various popular application domains, such as Computer Vision (CV) and Natural Language Processing (NLP) domains.
翻译:随着深神经网络(DNNS)的社会影响不断增强,推进DNNS的目标变得更加复杂和多样化,从改进常规模型准确度标准到传播公平、问责、透明、公正等高级人类美德(如公平、问责、透明、公正)等常规模型,从改善常规模型准确度指标到推广高级人类美德(FaccT),最近,可解释的人工智能(XAI)技术吸引了相当多的关注,极大地帮助了机器学习工程师理解AI模型。然而,与此同时,我们开始目睹AI社区在XAI之外新出现的需求;根据XAI的见解,我们如何能够更好地赋予ML工程师指导其DNNS的能力,以便按照预期改进模型的合理性和性及性能? 本文对解释性指导DNNational 智能(XAII)过程的技术领域进行了及时和广泛的文献概览,通过增加规范、监督或干预模型解释,指导DNNM(ML)过程。 与此同时,我们首先对EGL及其一般学习模式作了正式的大众定义。 其次,关于EG评估的关键因素的概述,以及目前应用领域中的计算机化和潜在评价领域,作为GEGAG的最后和分类,提供了现有的指标和分类,提供了现有评价领域,作为EGAGAGA的当前评价领域和潜在评价领域,作为最后的参考和分类和分类和分类。