Artificial Intelligence (AI) software systems, such as Sentiment Analysis (SA) systems, typically learn from large amounts of data that may reflect human biases. Consequently, the machine learning model in such software systems may exhibit unintended demographic bias based on specific characteristics (e.g., gender, occupation, country-of-origin, etc.). Such biases manifest in an SA system when it predicts a different sentiment for similar texts that differ only in the characteristic of individuals described. Existing studies on revealing bias in SA systems rely on the production of sentences from a small set of short, predefined templates. To address this limitation, we present BisaFinder, an approach to discover biased predictions in SA systems via metamorphic testing. A key feature of BisaFinder is the automatic curation of suitable templates based on the pieces of text from a large corpus, using various Natural Language Processing (NLP) techniques to identify words that describe demographic characteristics. Next, BisaFinder instantiates new text from these templates by filling in placeholders with words associated with a class of a characteristic (e.g., gender-specific words such as female names, "she", "her"). These texts are used to tease out bias in an SA system. BisaFinder identifies a bias-uncovering test case when it detects that the SA system exhibits demographic bias for a pair of texts, i.e., it predicts a different sentiment for texts that differ only in words associated with a different class (e.g., male vs. female) of a target characteristic (e.g., gender). Our empirical evaluation showed that BiasFinder can effectively create a larger number of fluent and diverse test cases that uncover various biases in an SA system.
翻译:人工智能(AI)软件系统,如感官分析(SA)系统,通常从大量可能反映人类偏见的数据中学习,因此,这种软件系统中的机器学习模式可能根据具体特点(例如性别、职业、原籍国等)出现意想不到的人口偏差。这种偏差表现在一种SA系统中,因为该系统预测对类似文本有不同情绪,而这种情绪仅与所描述的个人特征不同。关于揭示SA系统中的偏差的现有研究依赖于从一套小型的简短、预定义模板中生成的句子。为了应对这一限制,我们介绍了BisaFinder,这是一种通过变形测试在SA系统中发现有偏差预测的方法。BisaFinder是一个关键特征的自动校正,它使用各种自然语言处理(NLP)技术来识别描述人口特征的词。其次是BisaFiread(例如性别专用词,Bireadoral)系统里有性别偏差,“Siefireander”系统里有性别偏见。这些测试案例用来检测。