基于机器学习的创伤性休克患者院内死亡预测模型研究

In-hospital mortality prediction model for patients with traumatic shock based on machine learning

  • 摘要:
      背景  创伤性休克(traumatic shock,TS)是导致创伤患者死亡的重要原因,关注TS患者的预后具有重要意义。
      目的  构建基于入院生命体征、入院血气分析等早期指标的机器学习模型,用以预测TS患者临床结局,辅助医务人员进行医疗决策。
      方法  收集解放军总医院第一医学中心急诊科2014年6月 - 2021年12月收治的TS患者信息,分为建模队列和外部验证队列两个部分,基于决策树(decision tree,DT)、逻辑回归(logistic regression,LR)、随机森林(random forest,RF)三种分类算法建立院内死亡预测模型,比较不同模型的预测效能并选择最佳模型进行外部验证。
      结果  建模过程中,筛选出TS患者281例,根据患者本次出院时是否存活,分为生存组(218例)和死亡组(63例)。内部验证中,RF模型获得的受试者工作特征曲线下面积(area under receiver operator characteristic curve,AUC)最高,为0.856(95% CI:0.847 ~ 0.865),DT模型AUC为0.756(95% CI:0.740 ~ 0.772),LR模型AUC为0.801(95% CI:0.780 ~ 0.822)。RF模型的准确率(0.807)、精确率(0.886)、召回率(0.866)、F值(0.876)、AUC(0.856)等分类器性能指标均值均大于0.8。在50例外部验证集中RF模型的表现与内部验证相似,提示RF模型对TS患者死亡风险识别能力较强。通过校准曲线和决策曲线评价三种模型的校准度和临床实用性,结果提示RF模型亦优于LR和DT模型。
      结论   机器学习预测模型在预测TS患者院内死亡方面具有较好的应用前景。

     

    Abstract:
      Background  Traumatic shock (TS) is an important cause of death in trauma patients. It is of great significance of focusing on the prognosis of TS patients.
      Objective  To construct machine learning models based on early indicators including vital signs and arterial blood gas analysis on admission to predict clinical outcome in TS patients for medical decision-making.
      Methods  Clinical data about TS patients admitted to the Emergency Department of the First Medical Center of Chinese PLA General Hospital from Jun. 2014 to Dec. 2021 were retrospectively collected. In-hospital mortality prediction models were constructed based on three classification algorithms: decision tree (DT), logistic regression (LR), and random forest (RF). The prediction performance of different models was compared and the best model was selected for external validation.
      Results  In the modeling process, 281 TS patients who met the inclusion and exclusion criteria were screened, and divided into survival group (n=218) and death group (n=63) according to whether the patients were alive at discharge. In the internal validation set, the RF model obtained the highest area under receiver operator characteristic curve (AUC), which was 0.856 (95% CI: 0.847-0.865). The AUC of DT model was 0.756 (95% CI: 0.740-0.772). The AUC of LR model was 0.801 (95% CI: 0.780-0.822). The mean values of classifier performance metrics of the RF model including accuracy (0.807), precision (0.886), recall (0.866), F-value (0.876), and AUC (0.856) were all greater than 0.8. And the RF model had similar performance in the external validation set, suggesting that the RF model had a strong ability to identify the risk of death in TS patients. The calibration and clinical usefulness of the three models were evaluated by calibration curves and decision curves, suggesting that the RF model also outperformed the LR and DT models.
      Conclusion  Prediction models based on machine learning have a promising application in predicting in-hospital mortality of TS patients.

     

/

返回文章
返回