UK Biobank: from downloading data to extracting variables
Download Lei Clifton's step-by-step guide on the first steps of handling UK Biobank data.
BSc, MSc, PhD
- Official Fellow in AI & Machine Learning, Reuben College
Statistical and machine learning expertise in epidemiology and clinical studies.
I joined the Nuffield Department of Population Health (NDPH) in 2019 as the team leader of the Translational Epidemiology Unit (TEU), under its Director Professor David Hunter. I lead a programme of research in translational cancer epidemiology. Key research includes assessing the performance of large-scale information on lifestyle and environment, assisting in development of exposure assessment instruments suitable for use at scale.
I manage specialist grant funded research projects, including the recruitment, supervision and operational management of a research group. I line manage other members of the team, contributing to their development through induction, appraisal, and coaching.
From 2014 - 2018, I worked for Prof Doug Altman in the Centre for Statistics in Medicine (CSM), where I lead statistical work on clinical trials, observational studies, and research on trial methodology. I was a senior advisor on the NIHR Research Design Service (RDS) team, which provides free advice on research design to researchers in the South Central region. I collaborated extensively with principle investigators in trial design and grant applications.
During my 5 years in CSM, I also provided statistical supervision in fellowship applications, taught statistics at the postgraduate level, and review grant proposals for the NIHR. I was a Scientific Research Committee (SRC) member of the Northern Ireland Chest Heart and Stroke (NICHS), responsible for reviewing proposals and allocating research grants.
From 2009 - 2014, I worked in the Institute of Biomedical Engineering, in the University of Oxford, where I undertook research into statistical time-series models for providing early warning of deterioration in post-surgery patients. From 2007-2009, I was a post-doctoral researcher in the Nuffield Department of Clinical Neurosciences, University of Oxford, where I developed mathematical models and prototype apparatus for measuring the lung function of ICU patients.
I was awarded a PhD in Statistical Machine Learning in 2007 from UMIST (now the University of Manchester), after completing my BSc and MSc degrees in Electrical Engineering at the Beijing Institute of Technology, China. I joined CSM in 2014.
Joining CSM and then NDPH has been one of the best choices I have made in my life, as I thoroughly enjoy working with the team on epidemiology and other machine learning. When I am at home, you can find me painting watercolours, practising yoga, and making noise on the violin.
Assessing agreement between different polygenic risk scores in the UK Biobank.
Clifton L. et al, (2022), Sci Rep, 12
An independent external validation of the QRISK3 cardiovascular risk prediction model applied to UK Biobank participants
Parsons RE. et al, (2022)
Combining Machine Learning with Cox models for identifying risk factors for incident post-menopausal breast cancer in the UK Biobank
Liu X. et al, (2022)
Hypertension, a dementia polygenic risk score, APOE genotype, and incident dementia.
Littlejohns TJ. et al, (2022), Alzheimers Dement
Calculating Polygenic Risk Scores (PRS) in UK Biobank: A Practical Guide for Epidemiologists.
Collister JA. et al, (2022), Front Genet, 13