The use of 3D cameras for gait analysis has been highly questioned due to the low accuracy they have demonstrated in the past. The objective of the study presented in this paper is to improve the accuracy of the estimations made by robot-mounted 3D cameras in human gait analysis by applying a supervised learning stage. The 3D camera was mounted in a mobile robot to obtain a longer walking distance. This study shows an improvement in detection of kinematic gait signals and gait descriptors by post-processing the raw estimations of the camera using artificial neural networks trained with the data obtained from a certified Vicon system. To achieve this, 37 healthy participants were recruited and data of 207 gait sequences were collected using an Orbbec Astra 3D camera. There are two basic possible approaches for training: using kinematic gait signals and using gait descriptors. The former seeks to improve the waveforms of kinematic gait signals by reducing the error and increasing the correlation with respect to the Vicon system. The second is a more direct approach, focusing on training the artificial neural networks using gait descriptors directly. The accuracy of the 3D camera was measured before and after training. In both training approaches, an improvement was observed. Kinematic gait signals showed lower errors and higher correlations with respect to the ground truth. The accuracy of the system to detect gait descriptors also showed a substantial improvement, mostly for kinematic descriptors rather than spatio-temporal. When comparing both training approaches, it was not possible to define which was the absolute best. Therefore, we believe that the selection of the training approach will depend on the purpose of the study to be conducted. This study reveals the great potential of 3D cameras and encourages the research community to continue exploring their use in gait analysis.
翻译:使用三维摄像机进行动作分析,由于过去显示的精确度很低,因此受到高度质疑。本文件所述研究的目的是通过应用受监督的学习阶段,提高在人行动作分析中使用机器人上三维摄像机所作的估计的准确性。三维相机安装在一个移动机器人中,以获得更远的步行距离。这项研究表明,在用经认证的维昆系统的数据培训的人工神经网络对摄像机进行原始估计后,对动动动的动作信号和音标的检测有所改进。为此,37名健康参与者被聘用,用Orbbec Astra 3D相机收集了207个音响序列的数据。有两个可能的基本培训方法:使用运动式的音征信号,并使用了更远的音标。通过减少错误和增加与维登系统的关联性来改进运动的波形信号。第二个方法是更直接地界定方法,重点是利用声标描述的音标网络培训,而不是直接地进行更精确的音标。在进行深度培训之前,对三维德的图像进行了精确性分析。在进行深度培训之后,还进行了测量了对底线的精确性研究。在测量后,对调的精确性研究显示,对底校的精确性研究显示,然后显示的精确性研究显示,对底色的精确性研究是显示的精确性研究。在进行了。