TY - JOUR
T1 - Developing a prototype for federated analysis to enhance privacy and enable trustworthy access to COVID-19 research data
AU - Eradat Oskoui, Solmaz
AU - Retford, Matthew
AU - Forde, Eoghan
AU - Barnes, Rodrigo
AU - J Hunter, Karen
AU - Wozencraft, Anne
AU - Thompson, Simon
AU - Orton, Chris
AU - Ford, David
AU - Heys, Sharon
AU - Kennedy, Julie
AU - McNerney, Cynthia
AU - Peng, Jeffrey
AU - Ghanbariadolat, Hamed
AU - Rees, Sarah
AU - H Mulholland, Rachel
AU - Sheikh, Aziz
AU - Burgner, David
AU - Brockway, Meredith
AU - B. Azad, Meghan
AU - Rodriguez, Natalie
AU - Zoega, Helga
AU - J Stock, Sarah
AU - Calvert, Clara
AU - E Miller, Jessica
AU - Fiorentino, Nicole
AU - Racine, Amy
AU - Haggstrom, Jonas
AU - Postlethwaite, Neil
N1 - Publisher Copyright: © 2024 The Authors
PY - 2025/3
Y1 - 2025/3
N2 - Background: The use of federated networks can reduce the risk of disclosure for sensitive datasets by removing the requirement to physically transfer data. Federated networks support federated analytics, a type of privacy-enhancing technology, enabling trustworthy data analysis without the movement of source data. Objectives: To set out the methodology used by the International COVID-19 Data Alliance (ICODA) and its partners, the Secure Anonymised Information Linkage (SAIL) Databank and Aridhia Informatics in piloting a federated network infrastructure and consequently testing federated analytics using test data provided from an ICODA project, the International Perinatal Outcome in the Pandemic (iPOP) Study. To share the challenges and benefits of using a federated network infrastructure to enable trustworthy analysis of health-related data from multiple countries and sources. Results: This project successfully developed a federated network between the SAIL Databank and the ICODA Workbench and piloted the use of federated analysis using aggregate-level model outputs as test data from the iPOP Study, a one-year, multi-country COVID-19 research project. This integration is a first step in implementing the necessary technical, governance and user experiences for future research studies to build upon, including those using individual-level datasets from multiple data nodes. Conclusions: Creating federated networks requires extensive investment from a data governance, technology, training, resources, timing and funding perspective. For future initiatives, the establishment of a federated network should be built into medium to long term plans to provide researchers with a secure and robust data analysis platform to perform joint multi-site collaboration. Federated networks can unlock the enormous potential of national and international health datasets through enabling collaborative research that addresses critical public health challenges, whilst maintaining privacy and trustworthiness by preventing direct access to the source data.
AB - Background: The use of federated networks can reduce the risk of disclosure for sensitive datasets by removing the requirement to physically transfer data. Federated networks support federated analytics, a type of privacy-enhancing technology, enabling trustworthy data analysis without the movement of source data. Objectives: To set out the methodology used by the International COVID-19 Data Alliance (ICODA) and its partners, the Secure Anonymised Information Linkage (SAIL) Databank and Aridhia Informatics in piloting a federated network infrastructure and consequently testing federated analytics using test data provided from an ICODA project, the International Perinatal Outcome in the Pandemic (iPOP) Study. To share the challenges and benefits of using a federated network infrastructure to enable trustworthy analysis of health-related data from multiple countries and sources. Results: This project successfully developed a federated network between the SAIL Databank and the ICODA Workbench and piloted the use of federated analysis using aggregate-level model outputs as test data from the iPOP Study, a one-year, multi-country COVID-19 research project. This integration is a first step in implementing the necessary technical, governance and user experiences for future research studies to build upon, including those using individual-level datasets from multiple data nodes. Conclusions: Creating federated networks requires extensive investment from a data governance, technology, training, resources, timing and funding perspective. For future initiatives, the establishment of a federated network should be built into medium to long term plans to provide researchers with a secure and robust data analysis platform to perform joint multi-site collaboration. Federated networks can unlock the enormous potential of national and international health datasets through enabling collaborative research that addresses critical public health challenges, whilst maintaining privacy and trustworthiness by preventing direct access to the source data.
KW - COVID-19
KW - Data Re-use
KW - Federated Analytics
KW - Federated Networks
KW - Health Data Research
KW - Privacy-Preserving
KW - Secondary Data
UR - https://www.scopus.com/pages/publications/85211079272
U2 - 10.1016/j.ijmedinf.2024.105708
DO - 10.1016/j.ijmedinf.2024.105708
M3 - Article
SN - 1386-5056
VL - 195
JO - International Journal of Medical Informatics
JF - International Journal of Medical Informatics
M1 - 105708
ER -