Giter VIP home page Giter VIP logo

feddane's Introduction

FedDANE: A Federated Newton-Type Method

This repository contains the code and experiments for the paper:

FedDANE: A Federated Newton-Type Method

Asilomar Conference on Signals, Systems, and Computers 2019 (Invited Paper)

FedDANE is an optimization method that we adapt from DANE, a method for classical distributed optimization, to handle the practical constraints of federated learning. We provide convergence guarantees for this method when learning over both convex and non-convex functions. Despite encouraging theoretical results, we find that the method has underwhelming performance empirically. We identify low device participation and statistical device heterogeneity as two underlying causes of this underwhelming performance. In the paper, we also suggest several directions of future work.

This repository contains a set of empirical evaluation on both synthetic and real-world datasets. FedDANE consistently underperforms baselines of FedAvg and FedProx in realistic federated settings.

Usage

The usage is almost the same as that of the FedProx code, except that we are using a different optimizer specified in flearn/optimizer/pggd.py. For example, command lines to reproduce the results on the synthetic IID data are:

mkdir log_synthetic
bash run_trainer.sh synthetic_iid fedavg 0 | tee log_synthetic/iid_fedavg_c10_e20
bash run_trainer.sh synthetic_iid fedprox 1 | tee log_synthetic/iid_fedprox_c10_e20
bash run_trainer.sh synthetic_iid feddane 0 | tee log_synthetic/iid_feddane_c10_e20
python plot.py

References

See our FedDANE paper for more details as well as all references.

feddane's People

Contributors

litian96 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

feddane's Issues

Why not to select another subset S'

Thank you so much for your efforts (papers + code).
There are few pieces I did not understand. Would you please help!

I noticed that in the FedDANE paper, it is mentioned in the algorithm that we first need to choose a subset S to compute the gradients, then we need to choose another subset S' to run the actual training (update clients weights). However, in your code I noticed that in FedDANE trainer you are passing the same seed:
selected_clients = self.select_clients(i, num_clients=self.clients_per_round)
line number 28 and 39 in FedDANE trainer. So you are choosing the same subset again not another subset S'.

Q2: In the algorithm, it is mentioned that the averaging will take place over the subset S not S' that we actually trained. So I was wondering is that a typo? If not, then would you please explain why we need to train S' then average another set S ?

Q3: When we run the first training loop to average the gradients, then we train for one epoch only, right? Since adding more than one epoch, will overwrite the gradients.

Q4: Finally, I believe in your code you assumed none of the devices will drop, is that correct?

Thank you so much for your time!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.