feddane's Introduction

FedDANE: A Federated Newton-Type Method

This repository contains the code and experiments for the paper:

FedDANE: A Federated Newton-Type Method

Asilomar Conference on Signals, Systems, and Computers 2019 (Invited Paper)

FedDANE is an optimization method that we adapt from DANE, a method for classical distributed optimization, to handle the practical constraints of federated learning. We provide convergence guarantees for this method when learning over both convex and non-convex functions. Despite encouraging theoretical results, we find that the method has underwhelming performance empirically. We identify low device participation and statistical device heterogeneity as two underlying causes of this underwhelming performance. In the paper, we also suggest several directions of future work.

This repository contains a set of empirical evaluation on both synthetic and real-world datasets. FedDANE consistently underperforms baselines of FedAvg and FedProx in realistic federated settings.

Usage

The usage is almost the same as that of the FedProx code, except that we are using a different optimizer specified in flearn/optimizer/pggd.py. For example, command lines to reproduce the results on the synthetic IID data are:

mkdir log_synthetic
bash run_trainer.sh synthetic_iid fedavg 0 | tee log_synthetic/iid_fedavg_c10_e20
bash run_trainer.sh synthetic_iid fedprox 1 | tee log_synthetic/iid_fedprox_c10_e20
bash run_trainer.sh synthetic_iid feddane 0 | tee log_synthetic/iid_feddane_c10_e20
python plot.py

References

See our FedDANE paper for more details as well as all references.

feddane's People

Contributors

Stargazers

Watchers

feddane's Issues

Why not to select another subset S'

Thank you so much for your efforts (papers + code).
There are few pieces I did not understand. Would you please help!

I noticed that in the FedDANE paper, it is mentioned in the algorithm that we first need to choose a subset S to compute the gradients, then we need to choose another subset S' to run the actual training (update clients weights). However, in your code I noticed that in FedDANE trainer you are passing the same seed:
selected_clients = self.select_clients(i, num_clients=self.clients_per_round)
line number 28 and 39 in FedDANE trainer. So you are choosing the same subset again not another subset S'.

Q2: In the algorithm, it is mentioned that the averaging will take place over the subset S not S' that we actually trained. So I was wondering is that a typo? If not, then would you please explain why we need to train S' then average another set S ?

Q3: When we run the first training loop to average the gradients, then we train for one epoch only, right? Since adding more than one epoch, will overwrite the gradients.

Q4: Finally, I believe in your code you assumed none of the devices will drop, is that correct?

Thank you so much for your time!

Recommend Projects

litian96 / feddane Goto Github PK

feddane's Introduction

FedDANE: A Federated Newton-Type Method

Usage

References

feddane's People

Contributors

Stargazers

Watchers

Forkers

feddane's Issues

Why not to select another subset S'

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent