The area of face recognition is one of the most widely researched areas in the domain of computer vision and biometric. This is because, the non-intrusive nature of face biometric makes it comparatively more suitable for application in area of surveillance at public places such as airports. The application of primitive methods in face recognition could not give very satisfactory performance. However, with the advent of machine and deep learning methods and their application in face recognition, several major breakthroughs were obtained. The use of 2D Convolution Neural networks(2D CNN) in face recognition crossed the human face recognition accuracy and reached to 99%. Still, robust face recognition in the presence of real world conditions such as variation in resolution, illumination and pose is a major challenge for researchers in face recognition. In this work, we used video as input to the 3D CNN architectures for capturing both spatial and time domain information from the video for face recognition in real world environment. For the purpose of experimentation, we have developed our own video dataset called CVBL video dataset. The use of 3D CNN for face recognition in videos shows promising results with DenseNets performing the best with an accuracy of 97% on CVBL dataset.
翻译:面部识别领域是计算机视觉和生物鉴别学领域最广泛研究的领域之一,这是因为面部生物鉴别学的非侵入性使面部生物鉴别学相对更适合在公共场所如机场的监视领域应用。在面部识别方面,应用原始方法并不能带来非常令人满意的表现。然而,随着机器和深层学习方法的出现及其在面部识别方面的应用,取得了一些重大突破。在面部识别方面使用2D演动神经网络(2D CNN)跨越了人面部识别精确度,达到99%。然而,在真实的世界条件下,如分辨率、照明度和姿势的变化,面部识别能力较强,这是研究人员面对的重大挑战。在这项工作中,我们利用视频作为3DCNN系统架构的输入,从视频中获取空间和时间域域信息,以便在真实世界环境中的面识别。为了实验的目的,我们开发了自己的视频数据集,称为CVBL视频数据集。在视频中进行面部识别时,3DCNN显示DNet最有希望的结果,DenseNet在CVB数据集上以97%的准确度表现最佳。