Ship detection in aerial images remains an active yet challenging task due to arbitrary object orientation and complex background from a bird's-eye perspective. Most of the existing methods rely on angular prediction or predefined anchor boxes, making these methods highly sensitive to unstable angular regression and excessive hyper-parameter setting. To address these issues, we replace the angular-based object encoding with an anchor-and-angle-free paradigm, and propose a novel detector deploying a center and four midpoints for encoding each oriented object, namely MidNet. MidNet designs a symmetrical deformable convolution customized for enhancing the midpoints of ships, then the center and midpoints for an identical ship are adaptively matched by predicting corresponding centripetal shift and matching radius. Finally, a concise analytical geometry algorithm is proposed to refine the centers and midpoints step-wisely for building precise oriented bounding boxes. On two public ship detection datasets, HRSC2016 and FGSD2021, MidNet outperforms the state-of-the-art detectors by achieving APs of 90.52% and 86.50%. Additionally, MidNet obtains competitive results in the ship detection of DOTA.
翻译:航空图像中的船舶探测仍然是一项积极但具有挑战性的任务,因为任意的物体定向和从鸟眼角度看复杂的背景仍然是一项积极而富有挑战性的任务。大多数现有方法依靠角预测或预设的锚箱,使这些方法对不稳定的角回归和过度的超参数设置高度敏感。为了解决这些问题,我们用一个无锚和角模式来取代基于角的物体编码,并提议用一个新的探测器为每个面向编码的物体(即MidNet)部署一个中心和四个中点。MidNet设计了一个对称的、可变化的演算法,专门用来提高船舶的中点,然后对同一艘船舶的中心和中点进行适应性匹配,预测相应的中子转移和中点。最后,建议用一个简明的分析几何算算法来改进中心和中点,以便建立精确定向的捆绑框。在两个公共船舶探测数据集(HRSC2016和FGSD2021)上,MidNet将一个最先进的探测器比近于达到90.52%和86.50%的AP。此外,MidNet在船舶探测中标中获得了竞争性的结果。