林业业务中自主采伐伐木的个案划分 (Instance Segmentation for Autonomous Log Grasping in Forestry Operations)

Wood logs picking is a challenging task to automate. Indeed, logs usually come in cluttered configurations, randomly orientated and overlapping. Recent work on log picking automation usually assume that the logs' pose is known, with little consideration given to the actual perception problem. In this paper, we squarely address the latter, using a data-driven approach. First, we introduce a novel dataset, named TimberSeg 1.0, that is densely annotated, i.e., that includes both bounding boxes and pixel-level mask annotations for logs. This dataset comprises 220 images with 2500 individually segmented logs. Using our dataset, we then compare three neural network architectures on the task of individual logs detection and segmentation; two region-based methods and one attention-based method. Unsurprisingly, our results show that axis-aligned proposals, failing to take into account the directional nature of logs, underperform with 19.03 mAP. A rotation-aware proposal method significantly improve results to 31.83 mAP. More interestingly, a Transformer-based approach, without any inductive bias on rotations, outperformed the two others, achieving a mAP of 57.53 on our dataset. Our use case demonstrates the limitations of region-based approaches for cluttered, elongated objects. It also highlights the potential of attention-based methods on this specific task, as they work directly at the pixel-level. These encouraging results indicate that such a perception system could be used to assist the operators on the short-term, or to fully automate log picking operations in the future.

翻译：木材日志的采集是自动化的艰巨任务。事实上, 日志通常以杂乱的配置形式出现, 随机调整和重叠。最近关于记录采集自动化的工作通常假设日志的外形已经为人所知, 很少考虑到实际的感知问题。在本文中, 我们用数据驱动的方法直截了当地处理后一种问题。首先, 我们引入了一个新的数据集, 名为 TimberSeg 1.0, 这个数据集, 既包括捆绑框, 也包括对日志的像素级遮罩说明。这个数据集由220 个图像组成, 配有 2500 个单项的日志。更有趣的是, 我们用我们的数据集来比较三个神经网络结构结构结构结构结构结构结构, 而不是直接显示我们系统对具体日志的偏差。我们的结果显示, 轴校正一致的建议, 无法考虑到日志的定向性质, 这些基于19.03 mAP 。旋转认知建议的方法可以大大改进结果到 31. 83 mperAP 。更有意思的是, 以转换为基于 eal- e- eal- bal- leglegleglegal 方法, 在单个操作上, 在不直接使用任何方向上, 方向上, 在任何特定的操作中, 在任何方向上, 方向上, 方向上, 方向上, 方向上, 在任何方向偏差偏差的操作中, 方向上, 方向上, 显示我们对等操作显示我们方向方向选择方向方向方向方向方向方向方向方向方向方向方向方向方向方向方向。

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日