Comments (4)
- ensemble_num是指同时训练K个模型(K个Actor和K个Critic),不过K个模型会共享底部参数(Multi-head方式);
- 预测的时候K个Critic会对K个Actor的输出分别打分,每个Critic会分别归一化它们自己的打分,然后每个Actor的输出最终得分为归一化后的Critic打分之和,我们会选择最终得分最高的Actor输出。这块逻辑可以参考源码:
from parl.
def define_ensemble_predict(self, obs);函数我看到在build_program中调用过,但build_program没看到在哪使用?
from parl.
build_program是在PARL/parl/framework/agent_base.py中,模型运行时就直接调用的吗?
from parl.
PARL/parl/framework/agent_base.py
Line 46 in 348db1f
是的,agent 构建的时候自动调用这个函数。
看来这个函数的调用方式不大容易定位,我们会update文档来说明这个问题
from parl.
Related Issues (20)
- import parl时报错RuntimeError问题 HOT 1
- A2C模型训练报错 HOT 1
- train.py导入parl时报错怎么解决 HOT 6
- 运行lesson3的课件代码,无法显示平衡杆的图像效果 HOT 4
- PARL在MacOS系统上用pip安装的时候报错 HOT 2
- LESSON5中的DDPG,将PyCharm中提示未实现抽象函数的类都实现后,reward一直处于10左右
- 救命,安装需要的环境包的时候没有一个包安得上,换了ali的源也一样,ubuntu18.04
- pip安装parl时报错
- 电气研究生跨考控制之我真是小白!!! HOT 1
- import parl时RuntimeError HOT 2
- 询问gym库中OBS对象属性的问题
- 使用python train.py运行tutorials中的代码没有反应
- 渲染图像render=True时,代码报错,图像框一下子闪退 Windows10
- torch的选择问题? HOT 2
- DDPG mujoco error
- It seems `fluid` is absent from paddlepaddle after 2.5, causing issue with backend detection on Windows HOT 1
- Attributerror HOT 6
- pip安装问题 HOT 4
- 分发本地文件如何同步更新呢 HOT 1
- Remaining issues of CI HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from parl.