Giter VIP home page Giter VIP logo

deep-unsupervised-saliency-detection's Introduction

[WIP] Implmentation of the Deep Unsupervised Saliency Detection: A Multiple Noisy Labeling Perspective

Requirements

  1. pytorch 1.0
  2. python 3.6
  3. numpy Dataset

  1. MSRA-B

Results

Table of Contents

  1. Training on Small Dataset(10 Images)
  2. Training on Large Dataset(1500 Images Training, 500 Images Validataion, 500 Images Test)

Training on Small Dataset(10 Images)

Experiment Performed for overfitting, checking if the model works, tested for all models, Only reporting for the full(Training Noise module)

Training on Large Dataset(1500 Images Training, 500 Images Validataion, 500 Images Test)

The most important part here is the scheduler, since we keep on training with the same leanring rate without, Image size is taken to be 256, Number of Epochs is 20, Early Stopping on validation loss, paitence of 5 epochs

  1. Exp-1 Using Adam Optimizer

    Exp Name Optimizer Batch Size LR Betas Momentum Scheduler Notes Decay Factor Paitence THRESHOLD MINLR COOLDOWN Recall-Test Precision-Test F1-Test MAE-TEST
    Real Adam 16 3e-4 (0.9, 0.99) x ReduceLROnPlateu Label is the ground truth 0.9 1 1e-4 1e-16 1 0.5023 0.9622 0.66 0.035
    Noise Adam 8 3e-4 (0.9, 0.99) x ReduceLROnPlateu Use all Noise labesl 0.9 1 1e-4 1e-16 1 0.793 0.946 0.862 0.028
    Avg Adam 16 3e-4 (0.9, 0.99) x ReduceLROnPlateu Use Avg of Noise Labels 0.9 1 1e-4 1e-16 1 0.802 0.907 0.851 0.041
    Full Adam 4 3e-4 (0.9, 0.99) x ReduceLROnPlateu Full Training 0.9 1 1e-4 1e-16 1 0.841 0.857 0.848 0.036

Sample Images img Average Exp Sample Maps img Average Exp Sample [email protected] img Real Exp Sample Maps img Real Exp Sample Maps [email protected] img Noise Exp Sample Maps img Noise Exp Sample Maps [email protected] img Full Exp Sample Maps img Full Exp Sample Maps [email protected] img

  1. Exp-2 Using SGD Optimizer

    Exp Name Optimizer Batch Size LR Betas Momentum Scheduler Notes Decay Factor Paitence THRESHOLD MINLR COOLDOWN Recall-Test Precision-Test F1-Test MAE-TEST
    Real SGD 16 1e-3 x 0.9 ReduceLROnPlateu Label is the ground truth 0.9 1 1e-4 1e-16 1 0.865 0.907 0.885 0.032
    Noise SGD 8 1e-3 x 0.9 ReduceLROnPlateu Use all Noise labesl 0.9 1 1e-4 1e-16 1 0.620 0.820 0.706 0.044
    Avg SGD 16 1e-3 x 0.9 ReduceLROnPlateu Use Avg of Noise Labels 0.9 1 1e-4 1e-16 1 0.934 0.639 0.758 0.058
    Full SGD 4 1e-3 x 0.9 ReduceLROnPlateu Full Training 0.9 1 1e-4 1e-16 1 0.825 0.795 0.809 0.027

Average Exp Sample Maps Average Exp Sample Maps Average Exp Sample [email protected] Average Exp Sample Threshold 0.5 Real Exp Sample Maps Real Exp Sample Maps Real Exp Sample Maps [email protected] Real Exp Sample Maps Threshold 0.5 Noise Exp Sample Maps Noise Exp Sample Maps Noise Exp Sample Maps [email protected] Noise Exp Sample Maps Thresold 0.5 Full Exp Sample Maps Full Exp Sample Maps Full Exp Sample Maps [email protected] Full Exp Sample Maps Thresold 0.5

deep-unsupervised-saliency-detection's People

Contributors

kris-singh avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

deep-unsupervised-saliency-detection's Issues

Trained model

Dear author, is it convenient for you to provide your trained model and testing script? Thank you very much.

how to get or create the data?

hi, thanks for you work. i tried your code, but came into with the error: No such file or directory: '/data/workspaces/krish/sd/msrab_hkuis/small.lst'.
so how to get or create the data? can you update part of some which can be directly used or a final official version?

Can you upload user guide ?

I would be very grateful if you could describe how to use your code. For example, what is the form of train.lst ? Thank you very much.

how to generate noisy saliency maps during training?

I have read your paper and your code, I have some questing to ask your help ?

  1. I'm confusing about the noisy saliency maps, I want to know how to generate noisy saliency maps during train.
  2. in your code, there is a sal_label in your dataloder.py, so whta's the sal_label?
  3. the method described in the paper is unsupervised, but in the train precedure, labels are used, so I'm really confusing.
    can you explain these above questions? thank you!

Model does not train well with noise module

Hi, I have a small dataset (300 training images). The model performs ok during round 1 of training, but when introducing the noise module, the prediction loss returns to where it started in round 1 and does not improve. The noise loss does not decrease over time either. Any ideas? I'm using two sources of noisy ground truth.

What is the meaning for "Exp Name FULL" ?

Thanks for your implmentation ! Recently, I am interested in the unsupervised learning, this implmentation can help me. Here, I don not know the meaning of "Exp Name FULL". Can you explain the Full Training?

What does the parameter NUM_MAPS mean

Thanks very much for your code.
But I am a little bit confused of the dimension of noisy_gt.
I use opencv to generate noisy ground truth and the dimension is (256,256),is that the true dimension?
And I noticed that the input image x is repeated by NUM_MAPS,I don't understand the meaning of it.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.