As herd size on dairy farms continues to increase, automatic health monitoring of cows is gaining in interest. Lameness, a prevalent health disorder in dairy cows, is commonly detected by analyzing the gait of cows. A cow's gait can be tracked in videos using pose estimation models because models learn to automatically localize anatomical landmarks in images and videos. Most animal pose estimation models are static, that is, videos are processed frame by frame and do not use any temporal information. In this work, a static deep-learning model for animal-pose-estimation was extended to a temporal model that includes information from past frames. We compared the performance of the static and temporal pose estimation models. The data consisted of 1059 samples of 4 consecutive frames extracted from videos (30 fps) of 30 different dairy cows walking through an outdoor passageway. As farm environments are prone to occlusions, we tested the robustness of the static and temporal models by adding artificial occlusions to the videos.The experiments showed that, on non-occluded data, both static and temporal approaches achieved a Percentage of Correct Keypoints (PCKh@0.2) of 99%. On occluded data, our temporal approach outperformed the static one by up to 32.9%, suggesting that using temporal data was beneficial for pose estimation in environments prone to occlusions, such as dairy farms. The generalization capabilities of the temporal model was evaluated by testing it on data containing unknown cows (cows not present in the training set). The results showed that the average PCKh@0.2 was of 93.8% on known cows and 87.6% on unknown cows, indicating that the model was capable of generalizing well to new cows and that they could be easily fine-tuned to new herds. Finally, we showed that with harder tasks, such as occlusions and unknown cows, a deeper architecture was more beneficial.
翻译:随着奶牛农场的牛群规模继续增加,牛群的自动健康监测正在增加,牛群的牛群规模正在增加。乳牛群中普遍存在的健康障碍是乳牛们通过分析奶牛的步态通常被检测到的,乳牛的行踪在视频中可以使用估测模型进行跟踪。由于模型学会在图像和视频中自动定位解剖标志。大多数动物的估测模型是静态的,即视频是按框架处理的,不使用任何时间信息。在这项工作中,一个静态的动物施用估计的深层学习模型被扩展为包括过去框架信息在内的时间模型。我们比较了静态和时间估计模型的性能。我们比较了静态和时间模型的性能。我们比较了静态和时间模型的性能模型,我们比较了静态基点(PCK@0 2) 和时间估测模型的性能,从30个奶牛群的连续4个框架样本中提取了10个样本。由于农业环境的变异性数据,我们比较了普通模型,因此可以测试了静态和时间模型。在普通模型上展示了更深层牛群体化的数据,我们展示了这个模型,因此显示了准确基点的固定和时间结构的数值,在精确值的模型中显示为精确值值值的模型显示为精确值值,在98值的模型是比值值为10比值的模型,在9 。在精确度上的数据是比数据是比为10比数据, 。