Giter VIP home page Giter VIP logo

Comments (5)

CherryPieSexy avatar CherryPieSexy commented on July 28, 2024

Hello!
When defining an actor (which is a neural net model) you should specify it's output size. For predicting actions you have to initialize a probability distribution by specifying it's parameters, for example for Normal distribution it is mu and sigma for each action coordinate. So, nn should produce parameters of double size compared to action dim.

It is true for every continuous distribution I've implemented.
For discrete distributions you should not double action dim.

from imitation_learning.

LongchaoDa avatar LongchaoDa commented on July 28, 2024

It's so kind of you! Thanks for the answering and I saw the dim was truned to real out_put size (for example: in convert_params_beta() you used action_size = params_size//2)) And I got it now! Thanks again for the clear and extinguished work !

from imitation_learning.

LongchaoDa avatar LongchaoDa commented on July 28, 2024

(Btw: Is it convenient for me to have your email? I'm trying to use GAIL with recurrent policy and I want to discuss more details about this, plus by email we could share screenshots I guess its more effective and I really admire your work so I would like to learn from you as a friend! (If you dont want too many to bother you, You may delete the email account comment after get my greeting email?))

from imitation_learning.

CherryPieSexy avatar CherryPieSexy commented on July 28, 2024

Glad to read that you managed to understand.
I have the email [email protected]
And telegram CherryPieHSE
Feel free to reach me.

from imitation_learning.

LongchaoDa avatar LongchaoDa commented on July 28, 2024

Thanks and My Telegram is: danielsmithda
We may add each other both sides as mutual contacts to start a conversation!

from imitation_learning.

Related Issues (3)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.