Comments (5)
Got nearly working code in this thread,
https://discuss.pytorch.org/t/continuous-action-a3c/1033
from pytorch-a3c.
Hi, any chance you could give me some advice? I'm still stuck trying to get this to work? Here's a post of my code
https://gist.github.com/AjayTalati/184fec867380f6fa22b9aa0951143dec
I keep getting this error,
File "main_single.py", line 174, in <module>
value_loss = value_loss + advantage.pow(2)
AttributeError: 'numpy.ndarray' object has no attribute 'pow'
I don't understand why advantage
has become a numpy
array instead of a torch.tensor
- it never occurred with the discrete action implementation?
Any ideas what I've got wrong?
Thanks a lot for your help,
Best,
Ajay
from pytorch-a3c.
Closing this, as continuous functions are just a pain to approximate?
from pytorch-a3c.
I will add continuous control later. I don't have time at the moment.
from pytorch-a3c.
OK - cool - take your time π I don't mean this as an A3C specific comment, or anything specific about your implementation.
It's just a general observation, (and perhaps a provable fact), that I've found discrete functions easier to approximate that continuous ones.
In terms of simple MLP theory, this is nice by Mhaskar and Poggio,
Learning Real and Boolean Functions: When Is Deep Better Than Shallow
from pytorch-a3c.
Related Issues (20)
- gradient share problem HOT 1
- GAE parameter name should be lambda not tau. And why is default 1.0? HOT 4
- What's the difference between environment 'Pong-v4' and 'PongDeterministic-v4'
- Reward Smoothing
- Multi-processing or multi-threading HOT 1
- The while True loop of function train?
- NotImplementedError HOT 6
- [Question] Does a2c support distributed processing?
- Question in train.py
- with respect to how to choose an action
- How does A3C aggregate the model from different learner? HOT 1
- Why do we reverse rewards? HOT 1
- Dependency list not provided (environment.yml file)
- Stuck in 'p.join()' HOT 1
- After some steps, all the NNs always output same action HOT 1
- Scepticism about the correctness of the use of the LSTMCell
- Can you provide the python, pytorch, numpy and other versions used in the project?
- TypeError: tuple indices must be integers or slices, not tuple
- if there's no "if shared_param.grad is not None: return" what will happen? HOT 1
- where see the resultοΌ
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
π Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. πππ
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google β€οΈ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from pytorch-a3c.