Giter VIP home page Giter VIP logo

xerm's Introduction

[AAAI22 Oral] Cross-Domain Empirical Risk Minimization for Unbiased Long-tailed Classification

We point out the overlooked unbiasedness in long-tailed classification: models should perform well on both imbalanced and balanced test distributions. To tackle this challenge, we propose a novel training paradigm called Cross-Domain Empirical Risk Minimization (xERM). xERM consists of two ERM terms, which are the cross-entropy losses with different supervisions: one is the ground-truth label from the seen imbalanced domain, and the other is the prediction of a balanced model. The imbalanced domain empirical risk encourages the model to learn from the ground-truth annotations from the imbalanced distribution, which favors the head classes. The balanced domain empirical risk encourages the model to learn from the prediction of a balanced model, which imagines a balanced distribution and prefers the tail classes. These two risks are weighted to take advantage of both, i.e., protect the model training from being neither too “head biased” nor too “tail biased”, and can achieve the unbiasedness.

The codes are organized into three folders:

  1. xERM.PC.public uses Post-Compensated Softmax (PC) as balanced model to implement xERM.
  2. xERM.TDE.public uses Total Direct Effect (TDE) as balanced model to implement xERM.
  3. xERM.RIDE.public uses RIDE as balanced model to implement xERM.

Citation

If you find our paper or this project helps your research, please kindly consider citing our paper in your publication.

@inproceedings{beierxERM,
  title={Cross-Domain Empirical Risk Minimization for Unbiased Long-tailed Classification},
  author={Zhu, Beier and Niu, Yulei and Hua, Xian-Sheng and Zhang, Hanwang},
  booktitle={AAAI Conference on Artificial Intelligence},
  year={2022}
}

Update Logs

  • 28 Dec 2021, Release train and eval code of xERM-PC on CIFAR100-LT and ImageNet-LT Dataset.
  • 30 Dec 2021, Release train and eval code of xERM-TDE on Places365-LT Dataset (ResNet50 Backbone).
  • 31 Dec 2021, Release train and eval code of xERM-TDE on Places365-LT Dataset (ResNet152 Backbone).
  • 23 Jan 2022, Release train and eval code of xERM-RIDE-PC-{2,3} on CIFAR100-LT-IB-100 Dataset.

xerm's People

Contributors

beierzhu avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

xerm's Issues

xERM-TDE ImageNet Performance

Dear Beier,
Thank you for your innovative work and I am really interested in it. However, when training the ImageNet_LT dataset with xERM-TDE method, I got low performance only 14.4 in the overall accuracy. I would like to ask, is it convenient to share the ImageNet_LT training code or configs there to make sure we could get your sota results?

how to get a balanced model

Dear Beier:
I'm very interested in your work!In your code,I can get a imbalanced model by TDE, but I don't how do you get the balanced model ?

xERM-PC performance

Dear Beier,
I got a problem that I could not get your accuracy of 53.2 with xERM-PC code. My accuracy result is 0.527 with your config ce.yaml. The accuracies are :
- Imbalanced Acc 0.461 Balanced Acc 0.513 xERM Acc 0.527
Could you give some hints about how to reproduce your result?

Best

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.