Multiple sclerosis is a disease that affects the brain and spinal cord, it can lead to severe disability and has no known cure. The majority of prior work in machine learning for multiple sclerosis has been centered around using Magnetic Resonance Imaging scans or laboratory tests; these modalities are both expensive to acquire and can be unreliable. In a recent paper it was shown that disease progression can be predicted effectively using performance outcome measures (POMs) and demographic data. In our work we extend on this to focus on the modeling side, using continuous time models on POMs and demographic data to predict progression. We evaluate four continuous time models using a publicly available multiple sclerosis dataset. We find that continuous models are often able to outperform discrete time models. We also carry out an extensive ablation to discover the sources of performance gains, we find that standardizing existing features leads to a larger performance increase than interpolating missing features.
翻译:多发性硬化症是一种影响大脑和脊髓的疾病,它可能导致严重残疾,而且没有已知的治疗方法。以前为多发性硬化进行机器学习的大部分工作都是围绕使用磁共振成像扫描或实验室测试进行的;这些模式都非常昂贵,而且可能不可靠。在最近的一篇论文中显示,使用性能效果衡量和人口数据,可以有效地预测疾病的演变。我们在这方面的工作将重点扩大到建模方面,利用POMs连续时间模型和人口数据预测进展。我们用公开提供的多发性硬化成数据集评估四个连续时间模型。我们发现,连续模型往往能够超过离散时间模型。我们还进行了广泛的推算,以发现性能收益的来源。我们发现,标准化现有特征导致的性能增长大于内插缺失特征。