Giter VIP home page Giter VIP logo

ddpg's Introduction

This TensorFlow version of DDPG has been written during the master's thesis of Arnaud de Froissard de Broissia, under the supervision of Olivier Sigaud.

It was used to obtain the results described in the following paper: http://arxiv.org/abs/1606.09152

It has been coded under strong time constraints, thus the code is "quick and dirty". Maybe you should consider using more advanced versions of DDPG available on gitHub before using this one.

To install this version of DDPG (two methods):

First method:

1)Clone repository somewhere.

2)add to your .bashrc file : export PYTHONPATH=$PYTHONPATH:(path of the DDPG directory's parent)

Second method:

1)In a terminal type "echo $PYTHONPATH"

2)Clone the repository to the directory indicated by PYTHONPATH

Test if it worked:

1)open a python terminal

2)type :"import DDPG.test.test_mc as mc"

3)[optional] type :"mc.doEp(100)"

ddpg's People

Contributors

mocr avatar osigaud avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

ddpg's Issues

No module named DDPG

I followed your instructions to install DDPG, but I always get the result of "No module named ddpg"
/home/alex-zhai/Pictures/Screenshot from 2016-05-30 19:35:06.png

Can you tell me your installtion? Thank you!!

action bounds

Hey - I was looking through your code to try to implement some things from here, but I couldn't figure out what you mean by the action_bounds parameter. I'm trying to implement it in an environment where there are 11 possible actions [0:10]. Any suggestions?

DDPG Actor output saturate

Hello~ I have some question about DDPG
When my action dimension = 1, the result is good, but when my action dimension = 2 (the activation function is tanh and sigmoid), the output of actor will saturate.
Here is the result what I said: https://github.com/m5823779/DDPG
By the way, I use batch normalization only in my actor network.
Do you know where is the problem?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.