Giter VIP home page Giter VIP logo

deeprl_algorithms's Introduction

About Deep Reinforcement Learning

强化学习和深度学习的结合产生了一系列重要的算法,本项目将着重参考相关 paper 并尽可能实现相关算法,这里使用的深度学习框架是 Pytorch

1.为什么要自己写这些已有的东西?

尽管在 Github 上可以看到需要基于该框架的实现。自己动手实现,包括调试以及跑实验……,这个完整的过程,会使得对于算法的理解,以及算法的细微末节处可能遇到的问题更清楚, 从宏观上,你仍然可以参考其他人的实现,但永远不可以简单地 copy & paste,学习这个过程,别人永远无法替你完成。当然,你也会遇到一些问题, 亲自解决这些问题使得你今后对于算法的调试有许多帮助。

从头开始的另一个好处,就是你能把握整个知识脉络,由简单到复杂,性能对比,能够做到心中有数。我的实际使用中会涉及到调用相关的算法,但很难受的是,我一上手 就去调用人家写的东西,实验效果不理想,而无从调试,也不知道问题出在哪里。这就是用别人的东西存在的问题,调用高质量的库能够节省时间也能保证质量,但前提是 ————你了解细节,你已经具备基础。

2.算法列表

  1. DQN系列(Naive DQN, Double DQN, Dueling DQN etc.)

  2. Policy Gradient系列(Reinforce, Vanilla PG, TRPO, PPO etc.)

3.实验效果

  • DQNs on CartPole-v0

  • REINFORCE on MountainCar-v0

  • PPO on BipedalWalker-v2:

  • PGs on Mujoco Bechmarks for mujoco

deeprl_algorithms's People

Contributors

ritchiehuang avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.