Feature pyramid networks have been widely adopted in the object detection literature to improve feature representations for better handling of variations in scale. In this paper, we present Feature Pyramid Grids (FPG), a deep multi-pathway feature pyramid, that represents the feature scale-space as a regular grid of parallel bottom-up pathways which are fused by multi-directional lateral connections. FPG can improve single-pathway feature pyramid networks by significantly increasing its performance at similar computation cost, highlighting importance of deep pyramid representations. In addition to its general and uniform structure, over complicated structures that have been found with neural architecture search, it also compares favorably against such approaches without relying on search. We hope that FPG with its uniform and effective nature can serve as a strong component for future work in object recognition.
翻译:物体探测文献中广泛采用地貌金字塔网络,以改善地貌表现,更好地处理规模上的差异。本文介绍地貌金字塔(PFG),这是一条深厚的多路路特征金字塔,代表地貌空间,作为由多向横向连接连接连接的平行自下而上路径的常规网格。FPG可以大幅提高单路特征金字塔网络的性能,以类似的计算成本大幅提高,突出深金字塔特征的重要性。除了其一般和统一的结构外,它除了在神经结构搜索中发现的复杂结构之外,还优于不依赖搜索的这类方法。我们希望具有统一和有效性质的FPG能够成为未来物体识别工作的有力组成部分。