Aim: Comparative assessment of the quality of predictive models of in-hospital mortality (IHM) in patients with ST-segment elevation myocardial infarction (STEMI) after percutaneous coronary artery intervention (PCI), developed on the basis of predictors in continuous, dichotomous and multilevel categorical forms.
Materials and methods: This was a single-center retrospective study analyzing data from 4677 medical records of patients with STEMI PCI who were treated at the Regional Vascular Center of Vladivostok. Two groups of patients were identified: the first consisted of 318 (6.8%) patients who died in hospital, the second — 4359 (93.2%) patients with a favorable treatment outcome. Predictive models of IHF with continuous variables were developed using multivariate logistic regression, random forest, and stochastic gradient boosting. Dichotomization of predictors was performed using grid search methods for optimal cutoff points, centroid calculation, and Shapley additive explanation (SHAP). It was proposed for multi-level categorization to use a combination of threshold values identified during dichotomization, as well as ranking cut-off thresholds using multivariate logistic regression weighting coefficients.
Results: Based on the results of a multistage analysis of indicators of the clinical and functional status of STEMI patients, new predictors of IHM were identified and validated, their categorization was performed, and prognostic models with continuous, dichotomous and multilevel categorical variables were developed (AUC: 0.885-0.902). Models whose predictors were identified using the multimetric categorization method were not inferior in accuracy to models with continuous variables and had higher quality metrics than algorithms with dichotomous predictors. The advantage of models with multilevel categorization of predictors was the ability to explain and clinically interpret the results of IHM prediction.
Conclusions: Multilevel categorization of predictors is a promising tool for explaining predictive scores in clinical medicine.
Materials and methods: This was a single-center retrospective study analyzing data from 4677 medical records of patients with STEMI PCI who were treated at the Regional Vascular Center of Vladivostok. Two groups of patients were identified: the first consisted of 318 (6.8%) patients who died in hospital, the second — 4359 (93.2%) patients with a favorable treatment outcome. Predictive models of IHF with continuous variables were developed using multivariate logistic regression, random forest, and stochastic gradient boosting. Dichotomization of predictors was performed using grid search methods for optimal cutoff points, centroid calculation, and Shapley additive explanation (SHAP). It was proposed for multi-level categorization to use a combination of threshold values identified during dichotomization, as well as ranking cut-off thresholds using multivariate logistic regression weighting coefficients.
Results: Based on the results of a multistage analysis of indicators of the clinical and functional status of STEMI patients, new predictors of IHM were identified and validated, their categorization was performed, and prognostic models with continuous, dichotomous and multilevel categorical variables were developed (AUC: 0.885-0.902). Models whose predictors were identified using the multimetric categorization method were not inferior in accuracy to models with continuous variables and had higher quality metrics than algorithms with dichotomous predictors. The advantage of models with multilevel categorization of predictors was the ability to explain and clinically interpret the results of IHM prediction.
Conclusions: Multilevel categorization of predictors is a promising tool for explaining predictive scores in clinical medicine.