Predicting left ventricular hypertrophy from the 12-lead electrocardiogram in the UK Biobank imaging study using machine learning.
Naderi H., Ramírez J., van Duijvenboden S., Pujadas ER., Aung N., Wang L., Anwar Ahmed Chahal C., Lekadir K., Petersen SE., Munroe PB.
AIMS: Left ventricular hypertrophy (LVH) is an established, independent predictor of cardiovascular disease. Indices derived from the electrocardiogram (ECG) have been used to infer the presence of LVH with limited sensitivity. This study aimed to classify LVH defined by cardiovascular magnetic resonance (CMR) imaging using the 12-lead ECG for cost-effective patient stratification. METHODS AND RESULTS: We extracted ECG biomarkers with a known physiological association with LVH from the 12-lead ECG of 37 534 participants in the UK Biobank imaging study. Classification models integrating ECG biomarkers and clinical variables were built using logistic regression, support vector machine (SVM) and random forest (RF). The dataset was split into 80% training and 20% test sets for performance evaluation. Ten-fold cross validation was applied with further validation testing performed by separating data based on UK Biobank imaging centres. QRS amplitude and blood pressure (P < 0.001) were the features most strongly associated with LVH. Classification with logistic regression had an accuracy of 81% [sensitivity 70%, specificity 81%, Area under the receiver operator curve (AUC) 0.86], SVM 81% accuracy (sensitivity 72%, specificity 81%, AUC 0.85) and RF 72% accuracy (sensitivity 74%, specificity 72%, AUC 0.83). ECG biomarkers enhanced model performance of all classifiers, compared to using clinical variables alone. Validation testing by UK Biobank imaging centres demonstrated robustness of our models. CONCLUSION: A combination of ECG biomarkers and clinical variables were able to predict LVH defined by CMR. Our findings provide support for the ECG as an inexpensive screening tool to risk stratify patients with LVH as a prelude to advanced imaging.