Giter VIP home page Giter VIP logo

Comments (4)

Kaixhin avatar Kaixhin commented on August 21, 2024

I don't know of a mathematical proof, but it seems to work well in practice, and I've not seen a normalisation step in other code I've looked at. An issue with normalisation is that you need to choose a method to use, whilst without you can directly observe changes in intensity over time or between models on the same domain.

from atari.

happywu avatar happywu commented on August 21, 2024

Thanks!!!
One more thing confuses me is whether to take the positive part or the absolute value of the gradients.

I've tried the two both and found that using abs value could result some noisy part while the positive value is more clear. However, the Dueling DQN paper uses absolute value. If positive value of the gradients indicate that part is important for the agent to make decisions, and zero means the agent does not care that part, then what negative value might indicate?

Many thanks.

from atari.

Kaixhin avatar Kaixhin commented on August 21, 2024

I did not realise that they updated the paper with details of the saliency - but it doesn't appear that they use normalisation either.

The absolute value makes sense as the magnitude is important. Positive values indicate areas that should be increased (if you could change the image as if it were a weight matrix) to increase the error, whilst negative areas indicate areas that should be decreased to increase the error. Hence doing the opposite of this (gradient descent) decreases the error in neural network training.

from atari.

happywu avatar happywu commented on August 21, 2024

Thanks!!! It is really helpful for me!!!

from atari.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.