Developing a prototype for federated analysis to enhance privacy and enable trustworthy access to COVID-19 research data

  • Solmaz Eradat Oskoui
  • , Matthew Retford
  • , Eoghan Forde
  • , Rodrigo Barnes
  • , Karen J Hunter
  • , Anne Wozencraft
  • , Simon Thompson
  • , Chris Orton
  • , David Ford
  • , Sharon Heys
  • , Julie Kennedy
  • , Cynthia McNerney
  • , Jeffrey Peng
  • , Hamed Ghanbariadolat
  • , Sarah Rees
  • , Rachel H Mulholland
  • , Aziz Sheikh
  • , David Burgner
  • , Meredith Brockway
  • , Meghan B. Azad
  • Natalie Rodriguez, Helga Zoega, Sarah J Stock, Clara Calvert, Jessica E Miller, Nicole Fiorentino, Amy Racine, Jonas Haggstrom, Neil Postlethwaite

Research output: Contribution to journalArticlepeer-review

Abstract

Background: The use of federated networks can reduce the risk of disclosure for sensitive datasets by removing the requirement to physically transfer data. Federated networks support federated analytics, a type of privacy-enhancing technology, enabling trustworthy data analysis without the movement of source data. Objectives: To set out the methodology used by the International COVID-19 Data Alliance (ICODA) and its partners, the Secure Anonymised Information Linkage (SAIL) Databank and Aridhia Informatics in piloting a federated network infrastructure and consequently testing federated analytics using test data provided from an ICODA project, the International Perinatal Outcome in the Pandemic (iPOP) Study. To share the challenges and benefits of using a federated network infrastructure to enable trustworthy analysis of health-related data from multiple countries and sources. Results: This project successfully developed a federated network between the SAIL Databank and the ICODA Workbench and piloted the use of federated analysis using aggregate-level model outputs as test data from the iPOP Study, a one-year, multi-country COVID-19 research project. This integration is a first step in implementing the necessary technical, governance and user experiences for future research studies to build upon, including those using individual-level datasets from multiple data nodes. Conclusions: Creating federated networks requires extensive investment from a data governance, technology, training, resources, timing and funding perspective. For future initiatives, the establishment of a federated network should be built into medium to long term plans to provide researchers with a secure and robust data analysis platform to perform joint multi-site collaboration. Federated networks can unlock the enormous potential of national and international health datasets through enabling collaborative research that addresses critical public health challenges, whilst maintaining privacy and trustworthiness by preventing direct access to the source data.

Original languageEnglish
Article number105708
JournalInternational Journal of Medical Informatics
Volume195
DOIs
Publication statusPublished - Mar 2025

Bibliographical note

Publisher Copyright: © 2024 The Authors

Other keywords

  • COVID-19
  • Data Re-use
  • Federated Analytics
  • Federated Networks
  • Health Data Research
  • Privacy-Preserving
  • Secondary Data

Fingerprint

Dive into the research topics of 'Developing a prototype for federated analysis to enhance privacy and enable trustworthy access to COVID-19 research data'. Together they form a unique fingerprint.

Cite this