: Publication 17348

Publication 17348

Title:	Development and Validation of a Machine Learning Model for Incident Heart Failure Prediction in Chronic Kidney Disease: A Multicenter Cohort Study.
Journal:	Journal of the American Heart Association
Published:	18 Mar 2026
Pubmed:	https://pubmed.ncbi.nlm.nih.gov/41848060/
DOI:	https://doi.org/10.1161/jaha.125.046156
URL:	http://dx.doi.org/10.1161/jaha.125.046156

Abstract

BACKGROUND: Chronic kidney disease (CKD) and heart failure (HF) share pathophysiological mechanisms, rendering HF one of the most burdensome cardiovascular complication in CKD. Current HF prediction models, derived from the general population, exhibit limited accuracy in CKD, thus necessitating a CKD-specific risk model and clinical implementation.</p>

METHODS: The development set comprised 52 251 patients with CKD from the China Renal Data System (70% training; 30% internal validation). External validation used 21 798 patients from independent Chinese hospitals and 3323 UK Biobank participants. Outcome was 5-year new-onset HF. Five machine learning models were developed, with performance assessed via area under the curve and compared using DeLong test. The top-performing extreme gradient boosting model was simplified via forward stepwise selection; feature importance quantified using Shapley additive explanations.</p>

RESULTS: In the Chinese external validation cohort, the extreme gradient boosting model outperformed others (area under the curve, 0.879 [95% CI, 0.871-0.887]; DeLong P<0.05), with an area under the curve of 0.851 (95% CI, 0.831-0.870) in the UK Biobank cohort. The model was simplified to 9 predictors; Shapley additive explanations analysis ranked estimated glomerular filtration rate and albuminuria as among the most important features. In Chinese and UK Biobank external validation sets, the simplified extreme gradient boosting model showed a ΔC-statistic of 0.050 (0.045-0.056) and 0.006 (-0.013-0.024) versus the Atherosclerosis Risk in Communities risk score, and 0.194 (0.183-0.204) and 0.145 (0.107-0.182) versus the Predicting Risk of Cardiovascular Disease Events Equation, respectively. A web-based calculator was deployed (https://clinician.shinyapps.io/HF_Risk_Predictor_for_CKD/).</p>

CONCLUSIONS: The 9-variable extreme gradient boosting model tailored for patients with CKD may help predict HF risk in this high-risk population. Validation across diverse populations is warranted to confirm its generalizability.</p>

Abstract

10 Authors