A Parallel Federated Learning Framework

This repository is created to experiment easily in parallel for the state of the art methods in Federated Learning. It also includes the implementation of How to Combine Variational Bayesian Networks in Federated Learning in Pytorch. Please cite the paper if you benefit from this framework. For further details, contact buldu19@itu.edu.tr or ozera17@itu.edu.tr.

Supported Algorithms

SCAFFOLD
FedProx
FedAvg
FedNova
BFLAvg (Ours)

Requirements and Usage

Requirements

You can install the requirements for this project by using requirements.txt

$ conda install --file requirements.txt

Data

Data preperation

The dataset is automatically downloaded and prepared by the code when first time running the experiment. For precaching the dataset, you can run the following command:

$\textbf{Datasets}$	$\textbf{Image Size}$	$\textbf{Number of Labels}$	$\textbf{Train Size}$	$\textbf{Test Size}$
FMNIST	$1 \times 28 \times 28$	$10$	$60000$	$10000$
Cifar-10	$3 \times 32 \times 32$	$10$	$50000$	$10000$
SVHN	$3 \times 32 \times 32$	$10$	$73257$	$26032$

 $ python utils/data/data_downloader.py

Non-IID Data Generation

We inherited the non-IID data generation methods from Federated Learning on Non-IID Data Silos: An Experimental Study. You can run the following experiments:

Experiment	Description
IID	IID data generation for 10 clients
IID-500	IID data generation for 100 clients
noniid-labeldir	Non-IID data generation for 10 clients with dirichlet distribution
noniid-labeldir-500	Non-IID data generation for 100 clients with dirichlet distribution
noniid-label[1:4]	Non-IID data generation for 10 clients with selection of how many class each client have (choices: 1, 2, 3, 4)
iid-diff-quantity	IID data generation for 10 clients with different quantity of data
iid-diff-quantity-500	IID data generation for 100 clients with different quantity of data

Usage

You can run the experiments by using the following command:

 python train.py \
    --dataset=cifar10 \
    --alg=BFLAVG \
    --experiment=noniid-labeldir \
    --device='cuda:0'\
    --process=5 \
    --datadir='./data/' \
    --logdir='./logs/' \
    --init_seed=0

Parameter	Description
dataset	Dataset name: cifar10, fmnist, kmnist, cifar100, svhn, covertype
alg	Algorithm name: BFL, BFLAVG, Fed, FedAVG, FedProx, FedNova, Scaffold
experiment	Experiment name: noniid-labeldir[-500], iid[-500], noniid-label[1:4], iid-dif-quantity[-500]
device	Device name: cuda:0, cpu
process	Number of processes for multiprocessing
datadir	Data directory path
logdir	Log directory path
init_seed	Initial seed number for the experiment
desc	Description of the experiment

Example runtime comparison of multi-process pipeline

Runtime comparison results based on the number of processes of IID partitioned 100 clients experiment with means $\pm$ standard errors of Time Per Communication round (TPC) across five communication rounds for CIFAR-10 dataset. Multi-processed pipeline with 10 processes is the fastest for all models.

$\textbf{\Large Time per Communication Round}$

$\textbf{Model}$	$\textbf{Agg.}$	$\textbf{1 process}$	$\textbf{5 processes}$	$\textbf{10 processes}$
FED	$\texttt{N/A}$	$60.83$ $\pm 0.26$	$15.55$ $\pm 0.51$	$9.30$ $\pm 0.07$
FEDAVG	$\texttt{N/A}$	$60.90$ $\pm 0.18$	$15.57$ $\pm 0.39$	$9.22$ $\pm 0.20$
FVBA	$\texttt{EAA}$	$72.77$ $\pm 0.18$	$16.22$ $\pm 0.05$	$9.49$ $\pm 0.06$
	$\texttt{GAA}$	$71.23$ $\pm 0.88$	$16.48$ $\pm 0.10$	$9.41$ $\pm 0.05$
	$\texttt{AALV}$	$72.10$ $\pm 0.36$	$16.33$ $\pm 0.10$	$9.51$ $\pm 0.09$
	$\texttt{PPA}$	$66.95$ $\pm 0.31$	$18.06$ $\pm 0.20$	$11.23$ $\pm 0.16$
	$\texttt{CF}$	$72.53$ $\pm 0.29$	$16.34$ $\pm 0.10$	$9.36$ $\pm 0.14$
FVBWA	$\texttt{EAA}$	$72.38$ $\pm 0.31$	$16.45$ $\pm 0.06$	$9.44$ $\pm 0.11$
	$\texttt{GAA}$	$72.78$ $\pm 0.15$	$15.88$ $\pm 0.25$	$9.42$ $\pm 0.11$
	$\texttt{AALV}$	$72.41$ $\pm 0.24$	$16.19$ $\pm 0.09$	$9.64$ $\pm 0.13$
	$\texttt{PPA}$	$67.51$ $\pm 0.16$	$17.99$ $\pm 0.42$	$11.15$ $\pm 0.12$
	$\texttt{CF}$	$72.86$ $\pm 0.40$	$17.22$ $\pm 0.40$	$10.56$ $\pm 0.08$

Citation

@article{ozer2022combine,
  title={How to Combine Variational Bayesian Networks in Federated Learning},
  author={Ozer, Atahan and Buldu, Kadir Burak and Akg{\"u}l, Abdullah and Unal, Gozde},
  journal={arXiv preprint arXiv:2206.10897},
  year={2022}
}

Contributors

Name	Email	Github
Kadir Burak Buldu	buldu19@itu.edu.tr	buldubu
Atahan Özer	ozera17@itu.edu.tr	TrubadurOsman
Abdullah Akgül	akgula15@itu.edu.tr	aportekila
Gozde Unal	gozde.unal@itu.edu.tr	gozde-unal

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
figures		figures
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
algorithms.py		algorithms.py
architectures.py		architectures.py
parser.py		parser.py
requirements.txt		requirements.txt
scores.py		scores.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

A Parallel Federated Learning Framework

Supported Algorithms

Requirements and Usage

Requirements

Data

Data preperation

Non-IID Data Generation

Usage

Example runtime comparison of multi-process pipeline

Citation

Contributors

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 4

Uh oh!

Languages

License

ituvisionlab/BFL-P

Folders and files

Latest commit

History

Repository files navigation

A Parallel Federated Learning Framework

Supported Algorithms

Requirements and Usage

Requirements

Data

Data preperation

Non-IID Data Generation

Usage

Example runtime comparison of multi-process pipeline

Citation

Contributors

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 4

Uh oh!

Languages

Packages