Comments (2)
hi, apologies for the delay in the response.
you are correct in that it's not actually normalizing in the sense that the values add up to 1. we are in fact scaling the "normalized" values by the number of neurons in the layer.
this may have been an oversight on our end, and we will add a clarifying note to the paper and to this code. however, given that our experiments were run with this setup, we will keep the code as is!
if you decide to correct this and emit properly normalized scores, you will likely have to adjust the threshold as well. do let us know if you find anything interesting!
from dopamine.
Hi @yycho0108 @psc-g , I have some confusion about redo. In the paper, it says, reinitialize their incoming weights and zero out the outgoing weights. I'm confused since in my mind each layer of the network is just a matrix. I'm wondering what are the incoming weights and outgoing weights.
Could you give me some hints?
Thanks a lot!
from dopamine.
Related Issues (20)
- This repo is dead! HOT 1
- “python_requires” should be set with “>=3.5”, as dopamine-rl 4.0.2 is not compatible with all Python versions. HOT 1
- ImportError: cannot import name 'isin' from 'jax._src.numpy.lax_numpy'
- ModuleNotFoundError: No module named 'dopamine.metrics' HOT 5
- Poor introduction to dopamine HOT 2
- DEPRECATION WARNING: Logger is being deprecated. Please switch to CollectorDispatcher! HOT 4
- the return value of step function in atari HOT 7
- Jax agents not passing required argument to replay buffer constructor HOT 2
- Value of Epsilon Decay Period HOT 3
- Support for farama gymnasium HOT 1
- Load baseline Data HOT 2
- Colab demo for jax agents not working. Error: "no attribute 'online_network'" HOT 1
- Import Error: No file or module"text_summary"
- Is it possible to release the evaluation scores of the baseline agents?
- AttributeError: module 'jax.interpreters.xla' has no attribute 'DeviceArray' HOT 6
- Cartpole example is not working in colab.
- Bug for truncated episodes in replaybuffer
- TypeError, 'Regexp cannot be negated' in ReDo when running sac_train_eval.py HOT 2
- [Bug] Issues when running continuous domain example
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from dopamine.