ICU住院患者重度急性呼吸窘迫综合征早期预测模型的构建

Establishing an early prediction model for severe acute respiratory distress syndrome

  • 摘要:
      背景  急性呼吸窘迫综合征(acute respiratory distress syndrome,ARDS)发病率高,10%的ICU住院是由ARDS所致,临床特征通常在诱发事件后6 ~ 72 h出现,并迅速加重。其死亡率亦相当高且会随病情严重程度而增加。
      目的  构建一种便捷、无创的危重症ARDS早期预测模型。
      方法  采用麻省理工学院与飞利浦创建的eICU协作研究数据集,从中检索诊断为ARDS患者的呼吸频率、体温、心跳三种生命体征数据以及氧合指数(PaO2/FiO2)。PaO2/FiO2≤100 mmHg为重度ARDS。以每个氧合指数观测点(ARDS诊断时间点)为原点,96 h为一个时间窗,应用逻辑回归、随机森林及LightGBM构建预测模型,分析诊断前6 ~ 96 h、6 ~ 48 h以及6 ~ 24 h的生命体征数据预测是否会发生重度ARDS。通过oob评分、交叉验证以及校准曲线评估模型性能,并选取2014年解放军总医院医院呼吸重症监护室的ARDS病例对模型进行外部验证。
      结果  从eICU数据库检索纳入232例ARDS患者共3 140次氧合指数测量记录,其中PaO2/FiO2≤100 mmHg (1 mmHg=0.133 kPa)共1 042次。以6 ~ 96 h、6 ~ 48 h以及6 ~ 24 h的生命体征数据各自采用逻辑回归、随机森林及LightGBM建立了9个预测模型。不同时间窗比较,6 ~ 96 h的预测准确率及AUC最高;不同模型间比较随机森林模型的诊断性能最优;6 ~ 96 h随机森林模型准确率为0.833,AUC为0.885;6 ~ 48 h、6 ~ 24 h时间窗口的AUC分别为0.815、0.806; LightGBM、逻辑回归模型的6 ~ 96 h时间窗口AUC分为0.868、0.634。各模型在解放军总医院ARDS患者中进行验证,依然是6 ~ 96 h时间窗的随机森林模型预测性能最佳,准确率为0.834,AUC为0.843。
      结论  基于随机森林构建的ARDS预测模型具有良好的预测能力,通过无创且易获取的心率、体温、呼吸频率三种体征指标,利用提前6 ~ 96 h时间窗数据对重度ARDS的发生进行预测,可帮助医护人员更早地进行干预和治疗。

     

    Abstract:
      Background  Acute respiratory distress syndrome (ARDS) is a disease with high morbidity and accounts for 10% of ICU admissions, with clinical features usually presenting within 6-72 hours of the pathogenesis and rapidly worsening. The mortality rate is also high and increases with the severity of the disease.
      Objective  To establish a convenient, noninvasive early prediction model for severe ARDS.
      Methods  The eICU Collaborative Research Database created by MIT and Philips was used to retrieve data on three vital signs (respiratory rate, temperature, and heart rate) and oxygenation index (PaO2/FiO2) of patients diagnosed with ARDS, and PaO2/FiO2≤100 mmHg was considered as severe ARDS. 96 h was used as a time window, and logistic regression, random forest and LightGBM were applied to establish a prediction model to analyze vital sign data from 6-96 h, 6-48 h and 6-24 h before diagnosis to predict whether severe ARDS would occur. Model performance was evaluated by oob score, cross-validation and calibration curve, and also ARDS patients from Respiratory Intensive Care Unit of Chinese PLA General Hospital were selected to validate the models independently.
      Results  A total of 232 patients were retrieved from the eICU database with 3 140 oxygenation index measurements during hospitalization, including 1 042 with PaO2/FiO2 ≤100 mmHg. The 6-96 h, 6-48 h, and 6-24 h vital sign data were respectively used to build 9 prediction models by using logistic regression, random forest, and LightGBM. Comparing different time windows, the highest prediction accuracy and AUC were obtained for 6-96 h; the best diagnostic performance was obtained for the random forest model compared among different models; the accuracy of the random forest model for 6-96 h was 0.833 and the AUC was 0.885; the AUCs for the 6-48 h and 6-24 h time windows were 0.815 and 0.806, respectively; the AUC of LightGBM, and logistic regression models of 6-96 h time window was 0.868 and 0.634, respectively. Each model was validated in ARDS patients in Chinese PLA General Hospital, and the random forest model with 6-96 h time window had the best prediction performance with an accuracy of 0.834 and AUC of 0.843.
      Conclusion  The ARDS early prediciton model based on random forest has good predictive ability. It can warn the occurrence of severe ARDS through non-invasive and three easy-to-obtain physical indicators of heart rate, body temperature and respiratory rate, and help medical staff to make earlier intervention and treatment, relieve the pressure of inadequate medical resources, and improve the success rate of treatment.

     

/

返回文章
返回