Object trackers deployed on low-power devices need to be light-weight, however, most of the current state-of-the-art (SOTA) methods rely on using compute-heavy backbones built using CNNs or transformers. Large sizes of such models do not allow their deployment in low-power conditions and designing compressed variants of large tracking models is of great importance. This paper demonstrates how highly compressed light-weight object trackers can be designed using neural architectural pruning of large CNN and transformer based trackers. Further, a comparative study on architectural choices best suited to design light-weight trackers is provided. A comparison between SOTA trackers using CNNs, transformers as well as the combination of the two is presented to study their stability at various compression ratios. Finally results for extreme pruning scenarios going as low as 1% in some cases are shown to study the limits of network pruning in object tracking. This work provides deeper insights into designing highly efficient trackers from existing SOTA methods.
翻译:在低功率装置上部署的物体跟踪器需要轻量级,但是,目前大多数最新技术(SOTA)方法依靠使用CNN或变压器制造的计算重重骨架。这类模型的大小很大,无法在低功率条件下部署,而且设计大型跟踪模型的压缩变体非常重要。本文展示了如何利用大型CNN和变压器跟踪器的神经结构结构运行来设计高压缩轻量物体跟踪器。此外,还提供了关于最适合设计轻量跟踪器的建筑选择的比较研究。对使用CNN、变压器以及两种组合的SOTA跟踪器进行比较,以研究其在不同压缩比率下的稳定性。最后显示的极端运行情景结果在某些情况下低至1%,以研究物体跟踪中网络运行的极限。这项工作为从现有的SOTA方法中设计高效跟踪器提供了更深入的洞察力。