Multimodal learning for population health studies
2024/20
background
Most machine learning methodologies require large datasets with high-quality labelling to effectively train and validate the developed models. However, as medical imaging repositories used for population health studies, such as the UK Biobank, continue to grow, the conventional manual annotation approaches by experts become impractical. On the other hand, the labelling process can be automated by extracting annotations from associated metadata or reports. In such scenarios, automated labelling can introduce errors due to ambiguities in the reports, resulting in large but noisy-labelled (or even mislabelled) datasets. Consequently, this can lead to poorer generalization or replicate human biases present in the data.
The primary objective of this project is to investigate machine learning strategies that can guide optimal annotation techniques to achieve high accuracy while mitigating biases in the developed model.
RESEARCH EXPERIENCE, RESEARCH METHODS AND TRAINING
The project encompasses:
- Ddveloping skills in AI/ML techniques with application to biomedical imaging
- writing scientific reports and journal articles and presenting findings at the research conferences.
FIELD WORK, SECONDMENTS, INDUSTRY PLACEMENTS AND TRAINING
The student will have also opportunity to attend the research seminar offered at the NDPH, the BDI and the Institute of Biomedical Engineering (IBME) as the primary supervisor is a member of Imaging Hub at the IBME (https://eng.ox.ac.uk/biomedical-image-analysis/). The student will be expected to attend relevant seminars within the department and those relevant in the wider University. Subject-specific training will be received through our group's weekly supervision meetings. Students will also attend external scientific conferences where they will be expected to present the research findings.
PROSPECTIVE STUDENT
The ideal student would have:
- a degree in computer science, statistics, engineering or a related discipline
- strong programming skills (preferable Python, or Matlab/C++ and willing to learn Python)
- experience or interest in machine learning (Deep Learning) and medical image analysis
- experience or enthusiasm to work on clinically relevant problems.