Epilepsy is a chronic neurological disorder that affects a significant portion of the human population and imposes serious risks in the daily life of patients. Despite advances in machine learning and IoT, small, nonstigmatizing wearable devices for continuous monitoring and detection in outpatient environments are not yet available. Part of the reason is the complexity of epilepsy itself, including highly imbalanced data, multimodal nature, and very subject-specific signatures. However, another problem is the heterogeneity of methodological approaches in research, leading to slower progress, difficulty comparing results, and low reproducibility. Therefore, this article identifies a wide range of methodological decisions that must be made and reported when training and evaluating the performance of epilepsy detection systems. We characterize the influence of individual choices using a typical ensemble random-forest model and the publicly available CHB-MIT database, providing a broader picture of each decision and giving good-practice recommendations, based on our experience, where possible.
翻译:精神失常是一种慢性神经紊乱症,影响到大量人口,给病人日常生活带来严重风险。尽管在机器学习和IoT方面有所进步,但尚没有小型、非污名化的用于在门诊环境中持续监测和检测的可磨损装置,其部分原因是癫痫本身的复杂性,包括高度不平衡的数据、多式联运性质和非常特定主题的特征。然而,另一个问题是研究方法的异质性,导致进展缓慢、难以比较结果和可复制性低。因此,本条款确定了在培训和评价癫痫检测系统性能时必须作出和报告的广泛方法决定。我们用典型的随机森林混合模型和可公开查阅的CHB-MIT数据库来描述个人选择的影响,尽可能根据我们的经验,提供每项决定的更广泛情况,并提出良好做法建议。