Benchmarking federated learning approaches against siloed and mega-analysis regimes


Although neuroimaging is seeing a growing number of datasets, the international adoption of strong data privacy frameworks (Marelli & Testa, 2018) has led to many of these datasets remaining in so-called “silos”. When data cannot readily be shared, it becomes imperative to develop distributed data processing tools and federated analysis methods to enable large-scale multi-site studies.

In this project, we compare a simple federated analysis setup (i.e. sharing only fitted model parameters) with two traditional experimental setups:

We evaluate the performance of machine learning (ML) models on several neuroimaging datasets of Parkinson’s (PD) and Alzheimer’s disease (AD) on two common prediction tasks in neurodegenerative diseases: 1) brain age and 2) cognitive decline. We hypothesize that model performance improves as we go from siloed to federated to mega-analysis setups.

Machine learning setups

People

Michelle Wang
PhD student
Nikhil Bhagwat
Academic Associate