In this paper, we propose a multi-task representation learning framework to jointly estimate the identity, gender and age of individuals from their hand images for the purpose of criminal investigations since the hand images are often the only available information in cases of serious crime such as sexual abuse. We investigate different up-to-date deep learning architectures and compare their performance for joint estimation of identity, gender and age from hand images of perpetrators of serious crime. To overcome the data imbalance and simplify the age prediction, we create age groups for the age estimation. We make extensive evaluations and comparisons of both convolution-based and transformer-based deep learning architectures on a publicly available 11k hands dataset. Our experimental analysis shows that it is possible to efficiently estimate not only identity but also other attributes such as gender and age of suspects jointly from hand images for criminal investigations, which is crucial in assisting international police forces in the court to identify and convict abusers.
翻译:在本文中,我们提出了一种多任务表示学习框架,将各个人的手部图像联合估计出其身份、性别和年龄,以协助刑事调查,因为在严重犯罪(例如性侵)的案件中,手部图像通常是唯一可用的信息。我们研究了不同的最新深度学习架构,并比较它们在联合从犯罪者的手部图像中预测身份、性别和年龄的性能。为了解决数据不平衡和简化年龄预测,我们为年龄估计创建了年龄组。我们在一个公开的11k hands数据集上对基于卷积和变压器的深度学习架构进行了广泛评估和比较。我们的实验分析表明,可以高效地从犯罪嫌疑人的手部图像联合预测身份、性别和年龄,这对于协助国际警方在法庭上识别并定罪施虐犯非常重要。