If you build it, they will come: unintended future uses of organised health data collections.
O'Doherty KC., Christofides E., Yen J., Bentzen HB., Burke W., Hallowell N., Koenig BA., Willison DJ.
BACKGROUND: Health research increasingly relies on organized collections of health data and biological samples. There are many types of sample and data collections that are used for health research, though these are collected for many purposes, not all of which are health-related. These collections exist under different jurisdictional and regulatory arrangements and include: 1) Population biobanks, cohort studies, and genome databases 2) Clinical and public health data 3) Direct-to-consumer genetic testing 4) Social media 5) Fitness trackers, health apps, and biometric data sensors Ethical, legal, and social challenges of such collections are well recognized, but there has been limited attention to the broader societal implications of the existence of these collections. DISCUSSION: Although health research conducted using these collections is broadly recognized as beneficent, secondary uses of these data and samples may be controversial. We examine both documented and hypothetical scenarios of secondary uses of health data and samples. In particular, we focus on the use of health data for purposes of: Forensic investigations Civil lawsuits Identification of victims of mass casualty events Denial of entry for border security and immigration Making health resource rationing decisions Facilitating human rights abuses in autocratic regimes CONCLUSIONS: Current safeguards relating to the use of health data and samples include research ethics oversight and privacy laws. These safeguards have a strong focus on informed consent and anonymization, which are aimed at the protection of the individual research subject. They are not intended to address broader societal implications of health data and sample collections. As such, existing arrangements are insufficient to protect against subversion of health databases for non-sanctioned secondary uses, or to provide guidance for reasonable but controversial secondary uses. We are concerned that existing debate in the scholarly literature and beyond has not sufficiently recognized the secondary data uses we outline in this paper. Our main purpose, therefore, is to raise awareness of the potential for unforeseen and unintended consequences, in particular negative consequences, of the increased availability and development of health data collections for research, by providing a comprehensive review of documented and hypothetical non-health research uses of such data.