论文标题
MLHO的COVID-19不良结果的个性化预测
Individualized Prediction of COVID-19 Adverse outcomes with MLHO
论文作者
论文摘要
我们开发了MLHO(发音为MELO),这是一个端到端的机器学习框架,利用迭代功能和算法选择来预测健康结果。 MLHO实现了迭代顺序表示挖掘,以及特征和模型选择,以预测患者级别的住院,ICU入院,机械通气和死亡的需求。它基于患者过去病历的数据(在共同19感染之前)的数据基础。 MLHO的体系结构实现了面向结果的模型校准,其中对不同的统计学习算法和特征的向量进行了测试,以改善对健康结果的预测。使用来自13,000多名COVID-19阳性患者的大量队列中的临床和人口统计数据,我们使用了大约600个功能,对四个不良结果进行了建模,这些功能代表了患者的健康记录和人口统计学。死亡率预测的平均AUC ROC为0.91,ICU,住院和通风的预测性能在0.80至0.81之间。我们广泛地描述了用于建模中的特征群集及其用于预测每个结果的相对影响。我们的结果表明,虽然人口统计学变量(即年龄)是共同19感染后不良结果的重要预测指标,但过去的临床记录的融合对于可靠的预测模型至关重要。随着世界各地的19日大流行的发展,适应性和可解释的机器学习框架(如MLHO)对于提高我们准备面对Covid-19的潜在未来浪潮以及其他可能出现的新型感染性疾病至关重要。
We developed MLHO (pronounced as melo), an end-to-end Machine Learning framework that leverages iterative feature and algorithm selection to predict Health Outcomes. MLHO implements iterative sequential representation mining, and feature and model selection, for predicting the patient-level risk of hospitalization, ICU admission, need for mechanical ventilation, and death. It bases this prediction on data from patients' past medical records (before their COVID-19 infection). MLHO's architecture enables a parallel and outcome-oriented model calibration, in which different statistical learning algorithms and vectors of features are simultaneously tested to improve the prediction of health outcomes. Using clinical and demographic data from a large cohort of over 13,000 COVID-19-positive patients, we modeled the four adverse outcomes utilizing about 600 features representing patients' pre-COVID health records and demographics. The mean AUC ROC for mortality prediction was 0.91, while the prediction performance ranged between 0.80 and 0.81 for the ICU, hospitalization, and ventilation. We broadly describe the clusters of features that were utilized in modeling and their relative influence for predicting each outcome. Our results demonstrated that while demographic variables (namely age) are important predictors of adverse outcomes after a COVID-19 infection, the incorporation of the past clinical records are vital for a reliable prediction model. As the COVID-19 pandemic unfolds around the world, adaptable and interpretable machine learning frameworks (like MLHO) are crucial to improve our readiness for confronting the potential future waves of COVID-19, as well as other novel infectious diseases that may emerge.