In human-computer interaction or sign language interpretation, recognizing hand gestures and detecting fingertips become ubiquitous in computer vision research. In this paper, a unified approach of convolutional neural network for both hand gesture recognition and fingertip detection is introduced. The proposed algorithm uses a single network to predict the probabilities of finger class and positions of fingertips in one forward propagation of the network. Instead of directly regressing the positions of fingertips from the fully connected layer, the ensemble of the position of fingertips is regressed from the fully convolutional network. Subsequently, the ensemble average is taken to regress the final position of fingertips. Since the whole pipeline uses a single network, it is significantly fast in computation. The proposed method results in remarkably less pixel error as compared to that in the direct regression approach and it outperforms the existing fingertip detection approaches including the Heatmap-based framework.
翻译:在人体计算机互动或手语翻译中,识别手势和发现指尖在计算机视觉研究中变得无处不在。在本文件中,采用了进化神经网络的统一方法,既用于手势识别,又用于指尖检测。提议的算法使用单一网络来预测手指类的概率和指尖在网络前方传播中的位置。与直接回归方法相比,拟议方法的结果比直接回归法中的指尖错误要少得多,而且它比现有的指尖检测方法(包括Heatmap框架)要快得多,它比现有的指尖检测方法(包括Heatmap框架)要快得多。