Stroke is the top leading causes of death in China (Zhou et al. The Lancet 2019). A dataset from Shanxi Province is used to identify the risk of each patient's at four states low/medium/high/attack and provide the state transition tendency through a SHAP DeepExplainer. To improve the accuracy on an imbalance sample set, the Quadratic Interactive Deep Neural Network (QIDNN) model is first proposed by flexible selecting and appending of quadratic interactive features. The experimental results showed that the QIDNN model with 7 interactive features achieve the state-of-art accuracy $83.25\%$. Blood pressure, physical inactivity, smoking, weight and total cholesterol are the top five important features. Then, for the sake of high recall on the most urgent state, attack state, the stroke occurrence prediction is taken as an auxiliary objective to benefit from multi-objective optimization. The prediction accuracy was promoted, meanwhile the recall of the attack state was improved by $24.9\%$ (to $84.83\%$) compared to QIDNN (from $67.93\%$) with same features. The prediction model and analysis tool in this paper not only gave the theoretical optimized prediction method, but also provided the attribution explanation of risk states and transition direction of each patient, which provided a favorable tool for doctors to analyze and diagnose the disease.
翻译:山西省的一个数据集用于确定每位患者在四个低度/中度/高度/高度/攻击中四个州的风险,并通过SHAP DeepExplainer提供国家过渡趋势。为了提高不平衡抽样的准确性,首先通过灵活选择和附附附四边互动特征,提出了四边互动深神经网络(QIDNN)模型(QIDNN)模型。实验结果表明,具有7个互动功能的QIDNN模型达到了最新准确度83.25美元。血液压力、身体不活动、吸烟、体重和全部胆固醇是最重要的五个重要特征。然后,为了对最紧迫的状态进行高度回顾,攻击状态,将中度预测作为辅助目标,从多目标优化中受益。预测准确性得到了推广,同时,与QIDNNN模型相比,QIDNN模型(从67.93美元增加到84.83美元)实现了最新准确度,而QIDNNN(从67.83美元)、不活跃、吸烟、体重、体重和胆胆和胆胆总胆胆)是最佳预测工具的每个分析工具,提供了最佳诊断结果的预测和最佳诊断工具。