Giter VIP home page Giter VIP logo

light_mappo's Issues

训练效果查看

请问训练结束后,得到logs 和 models 怎么查看和使用?log
image
s使用tensoboard进行查看吗, 怎么来加载模型测试来查看效果呢?

加入自己的环境,使用env_continuous时碰到的问题

在自己修改代码后,选择的是continuous env,智能体separated policy更新action,但是env_runner.py中的 collect 函数这里只有MultiDiscrete 和Discrete两个选项,没有Box选项,请问这个情况要怎么处理?感谢!

image

一次回合结束时重置环境导致obs发生变化

在env_wrappers.py中,step_wait()的"obs[i] = self.envs[i].reset()"判断episode是否结束,这里将reset之后的观测值传给了obs[i],导致episode结束的那一刻的obs被覆盖。这样赋值是否不妥?因为reset之后的obs可以认为是随机的,不应该将其赋给obs[i],而应该直接调用"self.envs[i].reset()"?

env

給出的范例只有 sub-agnet_obs ,这里是不是没有特别区分观测信息与全局状态信息? 这里的 sub_agent_obs 就是指智能体的部分观测信息的列表吗?那这样全局信息是怎么处理的呢,就是把部分观测信息的融合作为全局信息?

选use_eval的时候运行报错NotImplementedError

是不是连续动作空间的环境不能用eval

Traceback (most recent call last):
File "G:\lcz\mappo\train\train.py", line 149, in
main(sys.argv[1:])
File "G:\lcz\mappo\train\train.py", line 137, in main
runner.run()
File "G:\lcz\mappo\runner\shared\env_runner.py", line 88, in run
self.eval(total_num_steps)
File "C:\Users\ljh99\anaconda3\lib\site-packages\torch\autograd\grad_mode.py", line 27, in decorate_context
return func(*args, **kwargs)
File "G:\lcz\mappo\runner\shared\env_runner.py", line 183, in eval
raise NotImplementedError
NotImplementedError

回合结束后没有reset环境

RT,runner在训练的时候,如果环境给出了done,runner没有进行reset,在某些环境中可能会导致不收敛

Action mask?

您好!如果agent的动作维度不一致时,light-mappo如何进行action mask?

share_policy置False时出错

报错定位于runner/separated/env_runner.py中的collcet函数中的
actions = np.array(actions).transpose(1, 0, 2)
初步排查发现在这句代码上面的循环中,当agent_id取1时,生成的动作的shape与agent_id取0时不同

VecEnvWrapper使用

There is the mistake in the env_wrappers.py that is the VecEnvWrappercan is the unresolved reference.

MAPPO-L

Thanks very much for your codes.

Have you considered to extend it into other variants of MAPPO, such as MAPPO-L?

How to set continuous action

I want to use continuous actions, but an error is reported after setting the self.discrete_action_space in the environment to false.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.