In nature, the collective behavior of animals, such as flying birds is dominated by the interactions between individuals of the same species. However, the study of such behavior among the bird species is a complex process that humans cannot perform using conventional visual observational techniques such as focal sampling in nature. For social animals such as birds, the mechanism of group formation can help ecologists understand the relationship between social cues and their visual characteristics over time (e.g., pose and shape). But, recovering the varying pose and shapes of flying birds is a highly challenging problem. A widely-adopted solution to tackle this bottleneck is to extract the pose and shape information from 2D image to 3D correspondence. Recent advances in 3D vision have led to a number of impressive works on the 3D shape and pose estimation, each with different pros and cons. To the best of our knowledge, this work is the first attempt to provide an overview of recent advances in 3D bird reconstruction based on monocular vision, give both computer vision and biology researchers an overview of existing approaches, and compare their characteristics.
翻译:在大自然中,鸟类的集体行为,如飞鸟等动物的集体行为受同一物种个体之间相互作用的支配。然而,鸟类物种中这种行为的研究是一个复杂的过程,人类无法使用常规的视觉观察技术,例如核心的自然取样。对于鸟类等社会动物来说,群体形成机制可以帮助生态学家了解社会线索和它们随着时间的推移的视觉特征(例如,形状和形状)之间的关系。但是,恢复飞鸟的不同形状和形状是一个极具挑战性的问题。解决这种瓶颈的一个广泛采用的解决办法是提取2D图像到3D对应的外形和形成信息。 3D视觉的最近进展导致了一系列关于3D形状和显示估计的令人印象深刻的作品,每一种都有不同的利弊。根据我们的知识,这项工作是首次试图概述基于单眼视觉的3D鸟类重建的最新进展,让计算机视觉和生物学研究人员对现有方法进行概述,并比较其特征。