Gender-based crime is one of the most concerning scourges of contemporary society. Governments worldwide have invested lots of economic and human resources to radically eliminate this threat. Despite these efforts, providing accurate predictions of the risk that a victim of gender violence has of being attacked again is still a very hard open problem. The development of new methods for issuing accurate, fair and quick predictions would allow police forces to select the most appropriate measures to prevent recidivism. In this work, we propose to apply Machine Learning (ML) techniques to create models that accurately predict the recidivism risk of a gender-violence offender. The relevance of the contribution of this work is threefold: (i) the proposed ML method outperforms the preexisting risk assessment algorithm based on classical statistical techniques, (ii) the study has been conducted through an official specific-purpose database with more than 40,000 reports of gender violence, and (iii) two new quality measures are proposed for assessing the effective police protection that a model supplies and the overload in the invested resources that it generates. Additionally, we propose a hybrid model that combines the statistical prediction methods with the ML method, permitting authorities to implement a smooth transition from the preexisting model to the ML-based model. This hybrid nature enables a decision-making process to optimally balance between the efficiency of the police system and aggressiveness of the protection measures taken.
翻译:以性别为基础的犯罪是当代社会的祸害之一。全世界各国政府都投入了大量的经济和人力资源来从根本上消除这一威胁。尽管作出了这些努力,但准确预测性别暴力受害者再次受到攻击的风险仍然是一个非常棘手的未决问题。制定发布准确、公平和快速预测的新方法,将使警察部队能够选择最适当的措施来防止再次犯罪。在这项工作中,我们提议采用机器学习技术来创建模型,准确预测性别暴力罪犯的累犯风险。这项工作的实用性有三个方面:(一) 拟议的ML方法超越了基于古典统计技术的先前存在的风险评估算法;(二) 研究是通过一个官方特定目的数据库进行的,该数据库有40 000多份性别暴力报告;(三) 提出了两项新的质量措施,以评估有效的警察保护,即示范用品和所产生投资资源的超负荷。此外,我们提出一种混合模式,将统计预测方法与ML方法结合起来,使当局能够从原有的效率模式顺利过渡到最佳的警察保护模式,使当局能够顺利地过渡到最佳效率模式。