Existing visual tracking methods usually localize a target object with a bounding box, in which the performance of the foreground object trackers or detectors is often affected by the inclusion of background clutter. To handle this problem, we learn a patch-based graph representation for visual tracking. The tracked object is modeled by with a graph by taking a set of non-overlapping image patches as nodes, in which the weight of each node indicates how likely it belongs to the foreground and edges are weighted for indicating the appearance compatibility of two neighboring nodes. This graph is dynamically learned and applied in object tracking and model updating. During the tracking process, the proposed algorithm performs three main steps in each frame. First, the graph is initialized by assigning binary weights of some image patches to indicate the object and background patches according to the predicted bounding box. Second, the graph is optimized to refine the patch weights by using a novel alternating direction method of multipliers. Third, the object feature representation is updated by imposing the weights of patches on the extracted image features. The object location is predicted by maximizing the classification score in the structured support vector machine. Extensive experiments show that the proposed tracking algorithm performs well against the state-of-the-art methods on large-scale benchmark datasets.
翻译:现有视觉跟踪方法通常以捆绑框将目标对象本地化, 使前景物体跟踪器或探测器的性能经常因包含背景杂乱而受到影响。 要处理这一问题, 我们学习了一个基于补丁的图形显示器, 用于视觉跟踪。 跟踪对象以图表为模型, 将一组非重叠的图像补丁作为节点, 每个节点的重量显示它属于前景对象的可能性, 边緣通过显示两个相邻节点的外观兼容性来进行加权。 此图是动态学习的, 并应用于对象跟踪和模型更新。 在跟踪过程中, 提议的算法在每一个框中执行三个主要步骤。 首先, 将一些图像补丁的二进制量配置成一个图形, 根据预测的捆绑框来显示对象和背景补补。 其次, 每个节点的重量表示器最优化, 通过使用新型的交替方向方法来改进补丁重量。 第三, 对象特征表示通过在提取的图像特征跟踪和模型更新时, 对对象位置进行动态学习。 在跟踪过程中, 通过在结构化的矢量分析中进行最大程度的矢量分析来预测, 显示结构矢量分析, 显示大型矢量的矢量分析方法。