The ddpg from mocr

ddpg's Introduction

This TensorFlow version of DDPG has been written during the master's thesis of Arnaud de Froissard de Broissia, under the supervision of Olivier Sigaud.

It was used to obtain the results described in the following paper: http://arxiv.org/abs/1606.09152

It has been coded under strong time constraints, thus the code is "quick and dirty". Maybe you should consider using more advanced versions of DDPG available on gitHub before using this one.

To install this version of DDPG (two methods):

First method:

1)Clone repository somewhere.

2)add to your .bashrc file : export PYTHONPATH=$PYTHONPATH:(path of the DDPG directory's parent)

Second method:

1)In a terminal type "echo $PYTHONPATH"

2)Clone the repository to the directory indicated by PYTHONPATH

Test if it worked:

1)open a python terminal

2)type :"import DDPG.test.test_mc as mc"

3)[optional] type :"mc.doEp(100)"

ddpg's People

Contributors

Stargazers

Watchers

ddpg's Issues

No module named DDPG

I followed your instructions to install DDPG, but I always get the result of "No module named ddpg"
/home/alex-zhai/Pictures/Screenshot from 2016-05-30 19:35:06.png

Can you tell me your installtion？ Thank you!!

Hey - I was looking through your code to try to implement some things from here, but I couldn't figure out what you mean by the action_bounds parameter. I'm trying to implement it in an environment where there are 11 possible actions [0:10]. Any suggestions?

DDPG Actor output saturate

Hello~ I have some question about DDPG
Ｗhen my action dimension = 1, the result is good, but when my action dimension = 2 (the activation function is tanh and sigmoid), the output of actor will saturate.
Here is the result what I said: https://github.com/m5823779/DDPG
By the way, I use batch normalization only in my actor network.
Do you know where is the problem?

Recommend Projects

mocr / ddpg Goto Github PK

ddpg's Introduction

ddpg's People

Contributors

Stargazers

Watchers

Forkers

ddpg's Issues

No module named DDPG

action bounds

DDPG Actor output saturate

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent