Economic policy and research rely on the correct evaluation of the billions of high-frequency data points that we collect every day. Consistent clustering algorithms, like DBSCAN, allow us to make sense of the data in a useful way. However, while there is a large literature on the consistency of various clustering algorithms for high-dimensional static clustering, the literature on multivariate time series clustering still largely relies on heuristics or restrictive assumptions. The aim of this paper is to prove a notion of consistency of DBSCAN for the task of clustering multivariate time series.
翻译:暂无翻译