基于集成机器学习的ICU老年多器官功能不全早期死亡风险预测模型

Early death risk prediction model for multiple organ dysfunction in the elderly in ICU based on integrated machine learning

  • 摘要:
      目的  基于集成机器学习模型XGBoost(Extreme Gradient Boosting)构建ICU住院老年多器官功能不全综合征(multiple organ dysfunction syndrome in the elderly,MODSE)早期(入ICU 24 h后)死亡预测模型,以更好地辅助临床决策和治疗。
      方法  利用公开的基于电子病历的大型数据库中重症医学信息数据库MIMIC-Ⅲ(Medical Information Mart for Intensive Care),纳入MODSE患者14 329例,其中院内死亡2 341例(16.3%),随机抽取80%作为训练集,剩余20%为测试集,根据预后将患者分为死亡组(1 864例)和存活组(9 599例),采集人口统计学信息、入ICU第一天的生命体征、临床干预措施、全身炎症反应综合征评分(SIRS),序贯器官衰竭估计评分(SOFA)作为模型参数,比较两组患者各指标差异;采用XGBoost模型算法进行模型训练,研究死亡相关特征重要性排名分布;用受试者工作特征曲线(ROC)评估模型对MODSE患者死亡风险的预测价值。
      结果  与存活组比较,死亡组患者格拉斯哥昏迷指数(glasgow coma scale,GCS)、年龄、心率最大值等较高,体质量指数(body mass index,BMI)、收缩压最小值等较低,差异均存在统计学意义(P均<0.01)。XGBoost构建预测模型的特征排名前10的指标为呼吸频率、活化的部分凝血酶原时间(activated part of the prothrombin time,APTT)、年龄、体温、BMI、收缩压、血小板、血糖、休克指数、白细胞计数;Xgboost模型预测MODSE患者死亡的AUC为0.853,敏感性为0.824,特异性为0.725,准确率为0.854,高于SOFA等各类评分。
      结论  与传统的评分相比,XGBoost模型的预测性能更加优越,可以更好地辅助临床决策,更早地指导临床医生开展集束化治疗。

     

    Abstract:
      Objective  To construct a predictive model for early death of multiple organ dysfunction syndrome in the elderly (MODSE) based on integrated learning model XGBoost, so as to provide evidence for clinical decision-making and treatment.
      Methods  Using the openly available electronic medical record-based database Medical Information Mart for Intensive Care (MIMIC)-Ⅲ, 14 329 patients with MODSE were included, of which 2 341 (16.3%) died in the hospital. Eighty percent of the sample were randomly selected as the training set, and the remaining 20% were the testing set. According to the prognosis, the patients were divided into death group (n=1 864) and survival group (n=9 599). Demographic characteristics, vital signs on the first day admitted to ICU, clinical intervention measures as model parameters were collected, and differences between the two groups were compared. XGBoost model algorithm was used to carry out model training and investigation of the distribution and ranking of death-related features, and the predictive value of the model for the risk of death in patients with MODSE was also evaluated using receiver operating characteristic (ROC) curve.
      Results  Compared with the survival group, patients in the death group had significantly higher mean Glasgow coma score (GCS), mean age, maximum heart rate, but lower BMI and systolic blood pressure (all P < 0.01). The ten most important variables to predict death risk in the XGBoost model were respiratory rate, APTT, age, body temperature, BMI, systolic blood pressure, platelet, blood glucose, shock index, and white blood cell count. In the prediction of death in MODSE patients, the XGBoost model had a sensitivity of 0.824, specificity of 0.725, and accuracy of 0.854 with AUC of 0.853.
      Conclusion  XGBoost model is a better predictive model with high accuracy, which can provide assistance in clinical decision-making and treatment of MODSE and thus improving the prognosis of elderly patients in the ICU.

     

/

返回文章
返回