: Publication 17302

Publication 17302

Title:	A foundation transformer model with self-supervised learning for ECG-based assessment of cardiac and coronary function.
Journal:	NEJM AI
Published:	26 Nov 2025
Pubmed:	https://pubmed.ncbi.nlm.nih.gov/41446031/
DOI:	https://doi.org/10.1056/aioa2500164
URL:	https://pmc.ncbi.nlm.nih.gov/articles/PMC12724683/

Abstract

Background: The wide availability of labeled electrocardiogram (ECG) data has driven major advances in artificial intelligence (AI)-based detection of structural and functional cardiac abnormalities and thus ECG-based diagnosis. However, many critical, high value clinical diagnostic applications, such as assessing myocardial ischemia and coronary microvascular dysfunction, remain underserved due to the limited availability of labeled datasets. We developed a self-supervised ECG foundation model and demonstrate how this approach can overcome this limitation.</p>

Methods: A modified vision transformer model was pretrained using a large database of unlabeled ECG waveforms (MIMIC-IV-ECG, N=800,035). The model was then fine-tuned using smaller databases that included high-quality labels derived from positron emission tomography (N=3,126) and clinical reports (N=13,704) for 12 clinical, demographic, and traditional ECG prediction tasks. Diagnostic accuracy and model generalizability were evaluated across five additional cohorts including the publicly available PTB-XL and UK Biobank databases and labels from cardiac magnetic resonance imaging (MRI) and single photon emission computed tomography (SPECT).</p>

Results: Diagnostic performance varied across tasks with area under the receiver operating characteristic curve (AUROC) ranging from 0.763 for detection of impaired myocardial flow reserve (MFR < 2) to 0.955 for impaired left ventricular ejection fraction (LVEF < 35%). Self-supervised learning (SSL) pretraining greatly improved diagnostic accuracy in 11 of the 12 prediction tasks compared to conventional de novo supervised training. The model retained strong performance across three external and two internal cross-modality databases, with AUROC ranging from 0.771 for impaired MFR to 0.949 for impaired LVEF.</p>

Conclusion: This versatile ECG foundation model demonstrates that SSL pretraining enhances diagnostic accuracy and generalizability across diverse cardiac diagnostic applications. By enabling effective learning from limited labeled data, this approach supports AI development for complex but clinically critical tasks, such as detecting myocardial ischemia and coronary microvascular dysfunction, where high-quality labels are costly and scarce.</p>

Application ID	Title
234540	Development of AI Tools for Widely Available, Inexpensive Diagnosis of Coronary Microvascular and Vasomotor Dysfunction

Application ID

Title

234540

Development of AI Tools for Widely Available, Inexpensive Diagnosis of Coronary Microvascular and Vasomotor Dysfunction

Abstract

10 Authors

1 Application