Comments (4)
It is doable. However, MARLlib decides not to incorporate the loading and rendering functions as we find it hard to unify all ten environments to render in a similar pattern.
We would like to provide you with instructions on how to implement this.
You can find a example for rendering here.
Also, the way to load the checkpoint is adding a restore path in tune.run(restore=YourCheckPointPath)
The complete configuration can be found in Trainer.
There is a thorough solution provided by Sven: multiagent-load-only-one-policy-from-checkpoint.
Any further question is welcome. We are happy to help you out.
from marllib.
Hi Thanks for the previous answer. But we have been unable to use the learned policy to compute actions for our agents. Our objective is to compute agent's actions based on the learnt policy. But when we use the function agent.compute_single_action(obs), where obs = env.reset()
We get the error that seq_lens in None type. We are not able to find where to find the sequence lengths to resolve this error. We added some print statements in the training and could observe the sequence lengths being printed there but compute_single_action does not seem to be workking with that.
It would be really helpful if you could provide some insights for the same. Also kindly let us know if we should be using some function other than agent.compute_single_action for the same purpose.
from marllib.
Hi SiddSS,
Sorry for the late reply. Check out mamujoco example and mpe example for loading the checkpoint and rendering the environment in MARLlib.
You are welcomed for any further question.
Siyi
from marllib.
New APIs are here for guiding how to render the pretrained model.
from marllib.
Related Issues (20)
- Unable to install globally using setup.py HOT 1
- Does this framework support asynchronous execution of the step function for different agents? HOT 1
- AircraftSimulator use of bloods?
- There is a bug in def central_value_function(self, state, opponent_actions=None) in cc_mlp.py and needs to be modified. HOT 1
- Configuration of custom environment HOT 2
- trainning stopped because of OOM HOT 3
- Marllib seems never uses gpu devices HOT 2
- cannot train ma-gym environment with IQL HOT 6
- TypeError in ray HOT 3
- Working with my own customized env HOT 3
- Help with questions about custom environments HOT 3
- AttributeError: 'MAPPOTrainer' object has no attribute '_local_ip' HOT 3
- Evaluating agents after training HOT 2
- Continue my Training process HOT 1
- Where is numpy.object_ from? HOT 3
- Can not save video HOT 3
- Backpropagation through time for PPO HOT 1
- The problems about Modify the network structure. HOT 2
- reslink in model
- Access Value Function After algo.Fit HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from marllib.