Determining the most informative features for predicting the overall survival of patients diagnosed with high-grade gastroenteropancreatic neuroendocrine neoplasms is crucial to improve individual treatment plans for patients, as well as the biological understanding of the disease. Recently developed ensemble feature selectors like the Repeated Elastic Net Technique for Feature Selection (RENT) and the User-Guided Bayesian Framework for Feature Selection (UBayFS) allow the user to identify such features in datasets with low sample sizes. While RENT is purely data-driven, UBayFS is capable of integrating expert knowledge a priori in the feature selection process. In this work we compare both feature selectors on a dataset comprising of 63 patients and 134 features from multiple sources, including basic patient characteristics, baseline blood values, tumor histology, imaging, and treatment information. Our experiments involve data-driven and expert-driven setups, as well as combinations of both. We use findings from clinical literature as a source of expert knowledge. Our results demonstrate that both feature selectors allow accurate predictions, and that expert knowledge has a stabilizing effect on the feature set, while the impact on predictive performance is limited. The features WHO Performance Status, Albumin, Platelets, Ki-67, Tumor Morphology, Total MTV, Total TLG, and SUVmax are the most stable and predictive features in our study.
翻译:确定预测高水平胃肠胃炎神经内分泌肿瘤肿瘤患者总体存活率的最信息特征,对于改进患者个人治疗计划以及生理上对该疾病的了解至关重要。最近开发了混合特质选择器,如《为功能选择而反复使用精英网技术选择》(Rent)和用户指南的巴伊西亚特征选择框架(UBayFS),使用户能够识别低样本尺寸数据集中的此类特征。虽然RET纯粹是数据驱动的,但UBayFS能够将先验的专家知识纳入特征选择过程。在这项工作中,我们比较了由63名患者组成的数据集的特征选择器和来自多种来源的134个特征,包括基本的患者特征、基线血液值、肿瘤学、成象和治疗信息。我们的实验涉及数据驱动和专家驱动的设置以及两者的组合。我们使用临床文献的研究结果作为专家知识的来源。我们的结果显示,在特征选择器中都允许准确的预测,在特征选择器中先验前验。在由63名患者组成的数据集上进行特征选择器,134个特征选择器来自多种来源,而专家的状态预测是稳定性能的特性,而结果是稳定状态的特征对稳定状态、结果的特性的特性是稳定状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态、状态