Giter VIP home page Giter VIP logo

kennethleungty / data-centric-ai-competition Goto Github PK

View Code? Open in Web Editor NEW
20.0 2.0 3.0 11.66 MB

Codes for a Top 5% finish in the Data-Centric AI Competition organized by Andrew Ng and DeepLearning.AI

Home Page: https://towardsdatascience.com/data-centric-ai-competition-tips-and-tricks-of-a-top-5-finish-9cacc254626e

Python 0.25% Jupyter Notebook 99.75%
deep-learning andrew-ng machine-learning data-science data-centric data-centric-ai ai

data-centric-ai-competition's Introduction

Data-Centric AI Competition 2021 - Tips and Tricks of a Top 5% Finish

Sharing the techniques that worked (and did not work) in the competition organized by Andrew Ng & DeepLearning.AI

Link to Medium writeup: https://towardsdatascience.com/data-centric-ai-competition-tips-and-tricks-of-a-top-5-finish-9cacc254626e


Introduction

Data is food for AI, and there is vast potential for model performance improvement by shifting from a model-centric to a data-centric approach. That is the motivation behind the recent Data-Centric AI Competition organized by Andrew Ng and DeepLearning.AI.

In this repo, I unveil the methods (and codes) of my Top 5% ranked submission (~84% accuracy, ranked 24), including the various techniques that worked and did not work for me. Do check out the Medium article for a more in-depth look at my thought process and methods behind the submission.


About the Competition

  • Link to competition page: https://https-deeplearning-ai.github.io/data-centric-comp/
  • A collaboration between DeepLearning.AI and Landing AI, the Data-Centric AI Competition aims to elevate data-centric approaches to improving the performance of machine learning models.
  • In most machine learning competitions, you are asked to build a high-performance model given a fixed dataset.
  • However, machine learning has matured to the point that high-performance model architectures are widely available, while approaches to engineering datasets have lagged.
  • The Data-Centric AI Competition inverts the traditional format and instead asks you to improve a dataset given a fixed model. We will provide you with a dataset to improve by applying data-centric techniques such as fixing incorrect labels, adding examples that represent edge cases, apply data augmentation, etc.

Contents

  • Full_Notebook_Best_Submission.ipynb (Complete walkthrough codes for the best submission I submitted for the competition)
  • experiment_tracker.csv (Spreadsheet tracker I used to monitor my various experiments)
  • /data (Public Roman MNIST dataset released by the competition)

data-centric-ai-competition's People

Contributors

kennethleungty avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.