即时阶段在与革命神经网络发现面孔方面的重要性 (The Importance of the Instantaneous Phase in Detecting Faces with Convolutional Neural Networks)

Convolutional Neural Networks (CNN) have provided new and accurate methods for processing digital images and videos. Yet, training CNNs is extremely demanding in terms of computational resources. Also, for specific applications, the standard use of transfer learning also tends to require far more resources than what may be needed. Furthermore, the final systems tend to operate as black boxes that are difficult to interpret. The current thesis considers the problem of detecting faces from the AOLME video dataset. The AOLME dataset consists of a large video collection of group interactions that are recorded in unconstrained classroom environments. For the thesis, still image frames were extracted at every minute from 18 24-minute videos. Then, each video frame was divided into 9x5 blocks with 50x50 pixels each. For each of the 19440 blocks, the percentage of face pixels was set as ground truth. Face detection was then defined as a regression problem for determining the face pixel percentage for each block. For testing different methods, 12 videos were used for training and validation. The remaining 6 videos were used for testing. The thesis examines the impact of using the instantaneous phase for the AOLME block-based face detection application. For comparison, the thesis compares the use of the Frequency Modulation image based on the instantaneous phase, the use of the instantaneous amplitude, and the original gray scale image. To generate the FM and AM inputs, the thesis uses dominant component analysis that aims to decrease the training overhead while maintaining interpretability.

翻译：连锁神经网络(CNN)为处理数字图像和视频提供了新的和准确的方法。然而,培训CNN在计算资源方面要求极高。此外,对于具体的应用程序,传输学习的标准使用也往往需要比可能需要的资源多得多。此外,最后系统往往作为难以解释的黑盒运作。当前的论文认为从AOLME视频数据集中探测面部的问题。AOLME数据集包含大量视频集成的团体互动,这些互动记录在不受限制的课堂环境中。对于论文而言,从18个24分钟的视频中每分钟提取一次图像框。然后,每个视频框被分为9x5个区块,每个区50x50像素。对于19440个区中的每一块,脸部象素的百分比被设定为难以解释的黑盒。然后,面对面检测被定义为确定每个区面像素百分比的回归问题。为了测试不同的方法,使用了12个视频来进行培训和校验。其余的6个视频用于测试。对于利用瞬时的图像分析,利用瞬间分析阶段的磁带分析,对磁带的图像进行了分析,同时使用磁带分析,对磁带分析,对磁段进行磁带分析,对磁带分析,对磁带分析,对磁带分析,对磁带分析,对磁带分析,对磁带分析,对磁带分析,对磁带分析,对磁带分析,对磁带分析,对磁带分析,对磁带分析,对磁带分析,对磁带分析,对磁带分析,对磁带分析,对磁带分析,对磁带分析,对磁带分析,对磁带分析,对磁带分析,对磁带分析,对磁带分析,对磁带分析,对磁带分析,对磁带分析,对磁带分析,对磁段分析,对磁到磁段分析,对磁段段段段段段段段段段段段段段段段段段段段段段段段段段段段段段段段段段段段段段段段段段段段段段段段段段段段段段段段段段段段段段段进行。

相关内容

Neural Networks

关注 1648

神经网络（Neural Networks）是世界上三个最古老的神经建模学会的档案期刊:国际神经网络学会(INNS)、欧洲神经网络学会(ENNS)和日本神经网络学会(JNNS)。神经网络提供了一个论坛，以发展和培育一个国际社会的学者和实践者感兴趣的所有方面的神经网络和相关方法的计算智能。神经网络欢迎高质量论文的提交，有助于全面的神经网络研究，从行为和大脑建模，学习算法，通过数学和计算分析，系统的工程和技术应用，大量使用神经网络的概念和技术。这一独特而广泛的范围促进了生物和技术研究之间的思想交流，并有助于促进对生物启发的计算智能感兴趣的跨学科社区的发展。因此，神经网络编委会代表的专家领域包括心理学，神经生物学，计算机科学，工程，数学，物理。该杂志发表文章、信件和评论以及给编辑的信件、社论、时事、软件调查和专利信息。文章发表在五个部分之一:认知科学，神经科学，学习系统，数学和计算分析、工程和应用。官网地址：http://dblp.uni-trier.de/db/journals/nn/

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

45+阅读 · 2020年12月18日

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

专知会员服务

74+阅读 · 2020年8月2日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日