This article introduces a novel approach to the classification of categorical time series under the supervised learning paradigm. To construct meaningful features for categorical time series classification, we consider two relevant quantities: the spectral envelope and its corresponding set of optimal scalings. These quantities characterize oscillatory patterns in a categorical time series as the largest possible power at each frequency, or spectral envelope, obtained by assigning numerical values, or scalings, to categories that optimally emphasize oscillations at each frequency. Our procedure combines these two quantities to produce an interpretable and parsimonious feature-based classifier that can be used to accurately determine group membership for categorical time series. Classification consistency of the proposed method is investigated, and simulation studies are used to demonstrate accuracy in classifying categorical time series with various underlying group structures. Finally, we use the proposed method to explore key differences in oscillatory patterns of sleep stage time series for patients with different sleep disorders and accurately classify patients accordingly.
翻译:本条对在受监督的学习范式下对绝对时间序列进行分类提出了一种新的方法。为了为绝对时间序列分类建立有意义的特征,我们考虑了两个相关数量:光谱信封及其相应的最佳缩放组合。这些数量将绝对时间序列中的随机模式定性为每个频率上的最大可能功率,或者通过分配数字值或缩放而获得的频谱信封,以最佳地强调每个频率的振荡。我们的程序将这两个数量结合起来,以产生一个可解释的和有讽刺意味的基于特征的分类器,可用于准确确定绝对时间序列中的组员。对拟议方法的分类一致性进行了调查,并使用了模拟研究,以显示在将绝对时间序列与各种基本群体结构进行分类时的准确性。最后,我们使用拟议方法,探索不同睡眠障碍患者睡眠阶段时间序列中的主要差异,并据此对病人进行准确分类。