The ability to capture detailed interactions among individuals in a social group is foundational to our study of animal behavior and neuroscience. Recent advances in deep learning and computer vision are driving rapid progress in methods that can record the actions and interactions of multiple individuals simultaneously. Many social species, such as birds, however, live deeply embedded in a three-dimensional world. This world introduces additional perceptual challenges such as occlusions, orientation-dependent appearance, large variation in apparent size, and poor sensor coverage for 3D reconstruction, that are not encountered by applications studying animals that move and interact only on 2D planes. Here we introduce a system for studying the behavioral dynamics of a group of songbirds as they move throughout a 3D aviary. We study the complexities that arise when tracking a group of closely interacting animals in three dimensions and introduce a novel dataset for evaluating multi-view trackers. Finally, we analyze captured ethogram data and demonstrate that social context affects the distribution of sequential interactions between birds in the aviary.
翻译:捕捉社会群体中个人之间详细互动的能力,是我们研究动物行为和神经科学的基础。最近深层次学习和计算机视觉的进步正在推动能够同时记录多个个体的行动和互动的方法的快速进步。许多社会物种,例如鸟类,都深植于一个三维世界中。这个世界带来了更多的认知挑战,如隐蔽、取向外观外观、明显大小的巨大变化以及3D重建的传感器覆盖面差,而研究只对2D飞机移动和互动的动物进行研究的应用并没有遇到这些挑战。这里我们引入了一套系统,用于研究一组鸟类在整个3D虚拟中移动时的行为动态。我们研究了在跟踪一组密切互动的动物时产生的三个维度的复杂性,并引入了一套用于评价多视角跟踪器的新数据集。最后,我们分析了采集的方位图数据,并表明社会环境影响了鸟类在航空中相继互动的分布。