In this paper, we propose a multi-task representation learning framework to jointly estimate the identity, gender and age of individuals from their hand images for the purpose of criminal investigations since the hand images are often the only available information in cases of serious crime such as sexual abuse. We investigate different up-to-date deep learning architectures and compare their performance for joint estimation of identity, gender and age from hand images of perpetrators of serious crime. To overcome the data imbalance and simplify the age prediction, we create age groups for the age estimation. We make extensive evaluations and comparisons of both convolution-based and transformer-based deep learning architectures on a publicly available 11k hands dataset. Our experimental analysis shows that it is possible to efficiently estimate not only identity but also other attributes such as gender and age of suspects jointly from hand images for criminal investigations, which is crucial in assisting international police forces in the court to identify and convict abusers.
翻译:在本文中,我们提出了一种多任务表示学习框架,用于从手部图像中联合估计个体的身份、性别和年龄,以便进行犯罪调查,因为手部图像通常是严重犯罪(如性虐待)案件中唯一可用的信息。我们调查了不同的最新深度学习架构,并比较了它们在联合估计犯罪嫌疑人手部图像的身份、性别和年龄方面的表现。为了克服数据不平衡和简化年龄预测,我们为年龄估计创建了年龄组。我们在公开的11k手部数据集上进行了广泛的评估和比较卷积和transformer两种深度学习体系结构的表现。我们的实验分析表明,从手部图像联合估计嫌疑人的个体身份、性别和年龄是可行的,而这对于协助国际警方在法庭上识别和定罪施虐者至关重要。