Cookies on this website

We use cookies to ensure that we give you the best experience on our website. If you click 'Accept all cookies' we'll assume that you are happy to receive all cookies and you won't see this message again. If you click 'Reject all non-essential cookies' only necessary cookies providing core functionality such as security, network management, and accessibility will be enabled. Click 'Find out more' for information on how to change your cookie settings.

Smoking is the most important behavioural determinant of morbidity and mortality. Using machine learning on plasma levels of 2,917 proteins in the UK Biobank (n = 43,914), we develop a proteomic Smoking Index (pSIN) comprising 51 proteins that accurately distinguish current from never smokers (AUC = 0.95; 95% CI 0.94-0.95). Validation in the China Kadoorie Biobank (n = 3,977) shows similar accuracy (AUC = 0.91; 95% CI 0.89-0.92). pSIN is significantly associated with the risk of all-cause mortality and 18 major chronic diseases, including cardiovascular, renal, pulmonary, neurodegenerative, and cancer outcomes. Among current and former smokers, pSIN predicts death and 11 diseases independently of self-reported smoking history and lifestyle factors. Genome-wide analysis identifies 125 genes (e.g., ALPP, CST5, IL12B) associated with pSIN, while exposome analysis highlights maternal smoking, diet, physical activity, and air pollution as key modifiers. Notably, pSIN tracks recovery among former smokers and identifies those whose disease risks remain comparable to current smokers. These findings demonstrate that plasma proteomics effectively capture the biological imprint of smoking and predict smoking-related morbidity and mortality, offering a more nuanced, molecularly grounded assessment of individual variation in biological response to smoking.

More information Original publication

DOI

10.1038/s41467-025-67656-x

Type

Journal article

Publication Date

2025-12-24T00:00:00+00:00

Volume

17

Keywords

Humans, Proteomics, Female, Male, Smoking, Middle Aged, Aged, Risk Factors, Adult, Machine Learning, Genome-Wide Association Study, China, United Kingdom, Biomarkers