Tensorflow implementation of TRPO(Trust Region Policy Optimization) with GAE(Generalized Advantage Estimator) on mujoco
terrisgo / trpo-gae Goto Github PK
View Code? Open in Web Editor NEWThis project forked from yjhong89/trpo-gae
Trust Region Policy Optimization with Generalized Advantage Estimator