Giter VIP home page Giter VIP logo

deep-learning-group's Introduction

About Me

I'm a Sr. Data Engineer at Disney Streaming.

Personal Projects

  • My Master's Thesis on text-independent Speaker verification on microcontrollers.
  • A long-term (super WIP) attempt to write a pure Rust chess engine, lasker.
  • torchmnist, a simple Android app to draw hand written digits and predict them using PyTorch.
  • Finally, my attempt at self-teaching system design and other foundational principles, given that I didn't study Computer Science in college.

Open Source

deep-learning-group's People

Contributors

greenberries avatar jiffar1 avatar samsam2610 avatar tduffy000 avatar tunghoangt avatar

Watchers

 avatar  avatar

deep-learning-group's Issues

Dataset issue

From this confusion matrix,
Test_2_Confusion_Matrix
There are some confusions between classes 7 and 8. Almost all the other classes, the error seems to be randomly distributed.

Update the README

We should update the top-level README to reflect the final structure and contents of this repo.

Tony's tasks

  • Create a 'subset' dataset with separate train & test folders (10 classes)
  • Establish a working pipeline on Google Collab
  • Train, validate, and evaluate a simple CNN bare model on Google Collab
  • Testing additional model architecture
  • Since I have access to Google Collab Pro, testing models with bigger dataset
  • Summarize initial results with 4 models
  • Review Tom's transfer learning models
  • Review Sam's journals
  • Organize results into team's final presentation

More rich EDA

We, currently, don't have any notebooks or other code fully dedicated to exploratory data analysis, which is a crucial task. We should take one of the notebooks and modify it such that it simply serves as exploring the source dataset and providing the reader with examples and distributions.

Sam's tasks

Here are the tasks that I am aiming to complete before Wednesday:

  • Select "some" labels from dataset to train for the project instead of using them all
  • Train, validate, and evaluate a simple CNN bare model. Plot the val_accuracy and accuracy
  • Incorporate callbacks checkpoints to terminate early and to save best model and training history
  • Do the same thing, but with VGG16 imagenet using transfer learning.
  • Compare the result between the two simple model and optimize the hyper-parameters so they can be used as baseline.
  • Create checkpoint to save and load the best model

Tom's tasks

  • Repo clean-up
  • Benchmark model (e.g. random choice)
  • Class prior probability analysis

General hyper-parameters and keras function

Hi guys, here are some issues I ran into and am working on fixing right now

  • The original model.fit function is too slow to train the model. I'm looking for an alternative. Using this function takes an abysmal amount of time per epoch.
  • The default value for steps_per_epoch is too low which causes longer training time. I'm searching for optimal value.
  • Should we limit the number of validation steps per epoch using validation_steps? Right now, the validation_split is 0.2

Trimmed dataset

Hi guys,

Here is the dropbox link to the trimmed dataset. I select the 17 classes that have the most images. Feel free to suggest me to add or remove any classes. I just want to make that we all use the same trimmed dataset.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.