Computer vision enables the development of new approaches to monitor the behavior, health, and welfare of animals. Instance segmentation is a high-precision method in computer vision for detecting individual animals of interest. This method can be used for in-depth analysis of animals, such as examining their subtle interactive behaviors, from videos and images. However, existing deep-learning-based instance segmentation methods have been mostly developed based on public datasets, which largely omit heavy occlusion problems; therefore, these methods have limitations in real-world applications involving object occlusions, such as farrowing pen systems used on pig farms in which the farrowing crates often impede the sow and piglets. In this paper, we adapt a Center Clustering Network originally designed for counting to achieve instance segmentation, dubbed as CClusnet-Inseg. Specifically, CClusnet-Inseg uses each pixel to predict object centers and trace these centers to form masks based on clustering results, which consists of a network for segmentation and center offset vector map, Density-Based Spatial Clustering of Applications with Noise (DBSCAN) algorithm, Centers-to-Mask (C2M), and Remain-Centers-to-Mask (RC2M) algorithms. In all, 4,600 images were extracted from six videos collected from three closed and three half-open farrowing crates to train and validate our method. CClusnet-Inseg achieves a mean average precision (mAP) of 84.1 and outperforms all other methods compared in this study. We conduct comprehensive ablation studies to demonstrate the advantages and effectiveness of core modules of our method. In addition, we apply CClusnet-Inseg to multi-object tracking for animal monitoring, and the predicted object center that is a conjunct output could serve as an occlusion-resistant representation of the location of an object.
翻译:计算机视觉可以开发新的方法来监测动物的行为、健康和福利。 在计算机视觉中, 时间分割是一种高精确度的计算机视觉方法, 用来检测感兴趣的动物。 这种方法可用于深入分析动物, 例如通过视频和图像来检查它们的微妙互动行为。 但是, 现有的深学习型实例分割方法大多是在公共数据集的基础上开发的, 这些数据集基本上省略了严重的封闭性问题; 因此, 这些方法在现实世界应用中, 涉及到物体隐蔽性的应用程序中存在局限性, 例如在猪养殖场中, 拖动式纸箱常常阻碍各个感兴趣的动物的精确度。 在本文中, 我们设计用于计算其细小互动行为的中心集群网络网络网络网络网络网络, 利用我们内部的每个像素来预测对象中心, 并跟踪这些中心形成基于集成结果的遮掩码, 包括用于分解和中心比较矢量图的网络, 硬度- 基于空间天线的天体组合目标, 用于应用的半透明性结构, 用于诺伊( DBS- AS- ) 的内、 直路路路路段和内部变变的图像分析中心。