We propose new ensemble models for multivariate functional data classification as combinations of semi-metric-based weak learners. Our models extend current semi-metric-type methods from the univariate to the multivariate case, propose new semi-metrics to compute distances between functions, and consider more flexible options for combining weak learners using stacked generalisation methods. We apply these ensemble models to identify respondents' difficulty with survey questions, with the aim to improve survey data quality. As predictors of difficulty, we use mouse movement trajectories from the respondents' interaction with a web survey, in which several questions were manipulated to create two scenarios with different levels of difficulty.
翻译:我们提出了新的多变量功能数据分类混合模型,作为基于半计量的薄弱学习者的组合。我们的模型将目前的半计量类型方法从单词型扩展至多变量型案例,提出新的半计量法以计算功能之间的距离,并考虑采用堆叠的简单化方法将弱学习者合并在一起的更灵活选项。我们运用这些组合模型来查明被调查者在调查问题上的困难,目的是提高调查数据的质量。作为困难预测者,我们使用来自被调查者与网络调查互动的鼠标移动轨迹,其中几个问题被操纵,制造两种困难程度不同的情景。