An increasing number of applications require to recognize the class of an incoming time series as quickly as possible without unduly compromising the accuracy of the prediction. In this paper, we put forward a new optimization criterion which takes into account both the cost of misclassification and the cost of delaying the decision. Based on this optimization criterion, we derived a family of non-myopic algorithms which try to anticipate the expected future gain in information in balance with the cost of waiting. In one class of algorithms, unsupervised-based, the expectations use the clustering of time series, while in a second class, supervised-based, time series are grouped according to the confidence level of the classifier used to label them. Extensive experiments carried out on real data sets using a large range of delay cost functions show that the presented algorithms are able to satisfactorily solving the earliness vs. accuracy trade-off, with the supervised-based approaches faring better than the unsupervised-based ones. In addition, all these methods perform better in a wide variety of conditions than a state of the art method based on a myopic strategy which is recognized as very competitive.
翻译:越来越多的应用程序要求尽可能快地承认即将到来的时间序列的类别,同时不过分地损害预测的准确性。在本文件中,我们提出了一个新的优化标准,其中既考虑到分类错误的成本,又考虑到推迟决定的成本。根据这一优化标准,我们得出了一组非中性算法,这些算法试图预测信息在未来的预期收益与等待成本相平衡。在一类算法中,未受监督的预期使用时间序列的组合,而在第二类算法中,由监督的、时间序列则根据分类者用来标注的时间序列的可信度进行分组。使用大量延迟成本功能对真实数据集进行的广泛实验表明,所提出的算法能够令人满意地解决耳朵的灵敏度与精准交易,而以监督为基础的方法比未受监督的算法要好得多。此外,所有这些方法在多种条件下的表现都比基于我所认识到的高度竞争性的战略的艺术方法的状态要好。