Metric space magnitude, an active field of research in algebraic topology, is a scalar quantity that summarizes the effective number of distinct points that live in a general metric space. The {\em weighting vector} is a closely-related concept that captures, in a nontrivial way, much of the underlying geometry of the original metric space. Recent work has demonstrated that when the metric space is Euclidean, the weighting vector serves as an effective tool for boundary detection. We recast this result and show the weighting vector may be viewed as a solution to a kernelized SVM. As one consequence, we apply this new insight to the task of outlier detection, and we demonstrate performance that is competitive or exceeds performance of state-of-the-art techniques on benchmark data sets. Under mild assumptions, we show the weighting vector, which has computational cost of matrix inversion, can be efficiently approximated in linear time. We show how nearest neighbor methods can approximate solutions to the minimization problems defined by SVMs.
 翻译:矩阵空间规模是代数层表层学中一个活跃的研究领域,是一个标度数量,它总结了存在于一般计量空间中的不同点的有效数量。 ~ em加权矢量是一个密切相关的概念, 以非三重方式捕捉原始计量空间的大部分基本几何学。 最近的工作表明, 当测量空间为欧几里特时, 加权矢量是进行边界探测的有效工具。 我们重写这一结果并显示加权矢量可被视为内核 SVM 的解决方案。 其中一个结果是, 我们用这种新的洞察方法来进行外核探测, 我们展示在基准数据集上具有竞争力或超过最新技术性能的性能。 在轻微的假设下, 我们显示加权矢量的矢量在线性时间里可以有效地接近计算矩阵转换成本。 我们显示, 最接近的邻居方法可以如何近似地解决由 SVMs 定义的最小化问题。