Giter VIP home page Giter VIP logo

dacon's Introduction

👋   Hi! My name is Andrew.

Here's are some facts about me:

  • 💡   I'm a software engineer working in the San Francisco Bay Area.
  • 🎓   I graduated from Georgia Tech specializing in machine learning. Happy to connect with aspiring/current Yellow Jackets interested in the tech industry!
  • 🌱   I'm all up for building robust and cost-efficient ML models and delivering trustworthy AI solutions to production.
  • ⛹️‍♂️   In my free time, I enjoy working out, playing basketball and learning new things.
  • ✉️   Feel free to reach out to me on LinkedIn.

🛠   Tech Stack

Python  SQL  Java  C  JavaScript  HTML  CSS  Git  GitHub  PyCharm 

NumPy  Scipy  Pandas  Scikit-learn  OpenCv  NLTK  Spacy  Transformers  PyTorch  Tensorflow  Keras 

Django  MongoDB  FastAPI  Docker  Kubernetes  Google Cloud Platform  Tableau 

dacon's People

Contributors

achen353 avatar oi02lyl avatar

Watchers

 avatar

Forkers

boscoj2008

dacon's Issues

Use Gumbel-Softmax to Approximate Sampling of DA

In our design for the project, part of the Augmentation block was to have a trainable categorical distribution of DA operators from which sample during each iteration for each given data point x.

However, sampling a value from a distribution is differentiable. The common practice is to use Gumbel-Softmax to approximate the distribution such that it is differentiable and trainable through the backdrop.

Useful sources:

Screen Shot 2021-11-13 at 5 22 32 PM

Reweight CE losses

The current implementation puts equal weights on each CE term, giving too many weights to augmented data when the ratio of original data to augmented data is 1:7.

Screen Shot 2021-12-01 at 11 50 41 AM

The weights for these terms could be a tunable parameter; however, this could largely fall out of the scope of this project given the time constraints. One potential change we could make is to weight the CE losses as follows:

CE(M(x), y) + (1 / 7) * [CE(M(aug_x_1), y) + … + CE(M(aug_x_7), y)]

DACon Variants

Summary

Rough draft of the experiment settings:

Method Operator Name(s) Loss Function
DACon Baseline dacon_baseline CE(x, y_true) + CE(x_random_aug, y_true)
DACon One-to-Many dacon_one_to_many CE(x, y_true) + CE(x_aug_1, y_true) + ... + CE(x_aug_n, y_true)
DACon Consistency Loss (w/ fixed JS-divergence) dacon_fixed_consistency CE(x, y_true) + CE(x_aug_1, y_true) + ... + CE(x_aug_n, y_true) + fix JS divergence
DACon Consistency Loss (w/ weighted JS-divergence) dacon_consistency CE(x, y_true) + CE(x_aug_1, y_true) + ... + CE(x_aug_n, y_true) + weighted JS divergence

Add timer

To compare the runtime, we will need to add timer to the code and run through Rotom's experiments as well.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.