As data volumes continue to grow, searches in data are becoming increasingly time-consuming. Classical index structures for neighbor search are no longer sustainable due to the "curse of dimensionality". Instead, approximated index structures offer a good opportunity to significantly accelerate the neighbor search for clustering and outlier detection and to have the lowest possible error rate in the results of the algorithms. Local sensing hashes is one of those. We indicate directions to mathematically model the properties of it.
翻译:随着数据量的继续增长,数据搜索正变得越来越耗时。 用于邻居搜索的古典索引结构由于“维度诅咒”而不再可持续。 相反,近似索引结构提供了一个很好的机会,可以大大加快邻居群集搜索和外部探测的步伐,并在算法结果中达到尽可能最低的误差率。本地传感是其中之一。我们指出数学模型属性的方向。