It is expensive and time-consuming to collect sufficient labeled data to build human activity recognition (HAR) models. Training on existing data often makes the model biased towards the distribution of the training data, thus the model might perform terribly on test data with different distributions. Although existing efforts on transfer learning and domain adaptation try to solve the above problem, they still need access to unlabeled data on the target domain, which may not be possible in real scenarios. Few works pay attention to training a model that can generalize well to unseen target domains for HAR. In this paper, we propose a novel method called Semantic-Discriminative Mixup (SDMix) for generalizable cross-domain HAR. Firstly, we introduce semantic-aware Mixup that considers the activity semantic ranges to overcome the semantic inconsistency brought by domain differences. Secondly, we introduce the large margin loss to enhance the discrimination of Mixup to prevent misclassification brought by noisy virtual labels. Comprehensive generalization experiments on five public datasets demonstrate that our SDMix substantially outperforms the state-of-the-art approaches with 6% average accuracy improvement on cross-person, cross-dataset, and cross-position HAR.
翻译:收集足够的标签数据以建立人类活动识别(HAR)模型是昂贵和耗时的。关于现有数据的培训往往使模型偏向于分配培训数据的模式,因此该模型可能在不同分布的测试数据上表现极差。虽然目前在转移学习和领域适应方面的努力试图解决上述问题,但它们仍需要获取目标域的未标签数据,这在真实情况下可能是不可能的。很少有人会注意培训一种能够向HAR广泛推广到看不见的目标域的模式。在本文中,我们提出了一种名为通用跨多功能混合(SDMix)的新颖方法。首先,我们引入了考虑到活动语义变异范围以克服语义差异带来的语义变异的语义变混集。第二,我们引入了巨大的差值损失,以强化混集的区别,防止由噪音虚拟标签带来的分类错误。在五个公共数据集上的全面概括化实验表明,我们的SDMix大大地超越了可通用跨面的跨数据结构。我们采用了6 %平均的跨数据的跨位、跨位化。