Comments (8)
Do your package versions match the ones listed in the README?
from maac.
My pytorch is 0.4.0 and python is 3.6.1. The rest of the requirements are installed through PIP. I also feel that there is a problem with the version. Is it possible that my gym is the latest version installed directly by PIP?
from maac.
Yeah if you run pip install gym
it will install the most recent version by default. Try pip install gym==0.9.4
from maac.
I've never tested simple_crypto.py, so I am not sure what the problem is there. It's possible that I broke something in that environment when making the modifications that I did.
What parts of main.py do you not understand? You need to pass in the name of the environment and a name for your saved model and all other parameters are optional. For example, this would run the multi_speaker_listener environment and save the model with the name "test" for 25000 episodes (with all other parameters at their defaults).
python main.py multi_speaker_listener test --n_episodes 25000
from maac.
I used the command you gave me to appear in the console of pycharm: Episodes number-number of number form. Models folder will have a multi_speaker_listener folder.
from maac.
It sounds like you have it running properly!
from maac.
Yeah it doesn't render the environment during training since that would slow things down significantly. After training, you can load the saved model (I provide a method on the model class to load from a saved file) and render the environment to see how it's doing.
from maac.
I am going to go ahead and close this issue, since issues are generally created for reporting bugs in the code. My advice to you would be to read the code carefully, and it should answer most of your questions.
from maac.
Related Issues (20)
- Problem of optimizing policy HOT 4
- Seeding fails to produce deterministic results HOT 9
- About SAC implementation HOT 1
- question about reward HOT 10
- How to implement MADDPG+SAC and COMA+SAC HOT 2
- About query, key and value input embedding HOT 1
- How does the gradient back-propagate from Q to the action $a_i$? HOT 2
- When I run "python main.py fullobs_collect_treasure V1" I meet error "ImportError: cannot import name 'Wall'"
- Critic encoders as shared modules ? HOT 3
- Bias on value extractors ?
- Memory usage increases a lot when use the latest version of OpenAI baselines
- Memory Leak HOT 1
- How to solve env_id? HOT 2
- Where is the code to load the model?
- Critic function learning
- Why does your implementation of MADDPG not work in your fork of MPE?
- The function names of "update_policies" and "update_critic" are reversed
- How to visualize during training
- issue thanks!
- Is this code applicable to continuous actions?
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from maac.