cun-bjy / policy-distillation-baselines Goto Github PK

View Code? Open in Web Editor NEW

49.0 49.0 6.0 981.74 MB

Pytorch Implementation of Policy Distillation for control, which has well-trained teachers via stable_baselines3.

License: GNU General Public License v3.0

Python 100.00%

policy-distillation pytorch reinforcement-learning rl stable-baselines transfer-learning

policy-distillation-baselines's Introduction

Junyeob Baek

policy-distillation-baselines's People

Contributors

Stargazers

Watchers

Forkers

policy-distillation xinqiangyu peter9697 xiaoyangyang2 clementine5 cpthoang

policy-distillation-baselines's Issues

more precise requirement

requirement.txt dosen‘t give the accurate versions of some python packages. Some packages are updated so that the project can't run. Could you please update the requirement.txt? Thank you!

history visualization w/ render

About multi-task reinforcement learning.

Thank you for your work on single-task reinforcement and transfer learning!
But I think the most meaningful part of this paper is transfer learning on multi-task reinforcement learning.
If the teacher models are in different envs(for example, one is in CartPole-v1,others is in Acrobot-v1), the state space and action space of them are totally different. It is necessary to pay attention to how to design the input layer and output layer of the student model.

"About 90% of parameters are shared, with only 3 small MLP “controllers” on top which are task specific and allow for different action sets between different games." It's in the paper. But I don't know the details

condition of the training termination

Project Page Stable Baselines3

Hello,

nice project =)

We would be also interested if you could do a pull request on stable-baselines3 where you add your project to the documentation (project section) ;)