Giter VIP home page Giter VIP logo

openai_lab's Introduction

OpenAI Lab GitHub release CircleCI Codacy Badge Codacy Badge GitHub stars GitHub forks


NOTICE: Please use the next version, SLM-Lab.


OpenAI Lab Documentation


An experimentation framework for Reinforcement Learning using OpenAI Gym, Tensorflow, and Keras.

OpenAI Lab is created to do Reinforcement Learning (RL) like science - theorize, experiment. It provides an easy interface to OpenAI Gym and Keras, with an automated experimentation and evaluation framework.

Features

  1. Unified RL environment and agent interface using OpenAI Gym, Tensorflow, Keras, so you can focus on developing the algorithms.
  2. Core RL algorithms implementations, with reusable modular components for developing deep RL algorithms.
  3. An experimentation framework for running hundreds of trials of hyperparameter optimizations, with logs, plots and analytics for testing new RL algorithms. Experimental settings are stored in standardized JSONs for reproducibility and comparisons.
  4. Automated analytics of the experiments for evaluating the RL agents and environments, and to help pick the best solution.
  5. The Fitness Matrix, a table of the best scores of RL algorithms v.s. the environments; useful for research.

With OpenAI Lab, we could focus on researching the essential elements of reinforcement learning such as the algorithm, policy, memory, and parameter tuning. It allows us to build agents efficiently using existing components with the implementations from research ideas. We could then test the research hypotheses systematically by running experiments.

Read more about the research problems the Lab addresses in Motivations. Ultimately, the Lab is a generalized framework for doing reinforcement learning, agnostic of OpenAI Gym and Keras. E.g. Pytorch-based implementations are on the roadmap.

Implemented Algorithms

A list of the core RL algorithms implemented/planned.

To see their scores against OpenAI gym environments, go to Fitness Matrix.

algorithm implementation eval score (pending)
DQN DQN -
Double DQN DoubleDQN -
Dueling DQN - -
Sarsa DeepSarsa -
Off-Policy Sarsa OffPolicySarsa -
PER (Prioritized Experience Replay) PrioritizedExperienceReplay -
CEM (Cross Entropy Method) next -
REINFORCE - -
DPG (Deterministic Policy Gradient) off-policy actor-critic ActorCritic -
DDPG (Deep-DPG) actor-critic with target networks DDPG -
A3C (asynchronous advantage actor-critic) - -
Dyna next -
TRPO - -
Q*(lambda) - -
Retrace(lambda) - -
Neural Episodic Control (NEC) - -
EWC (Elastic Weight Consolidation) - -

Run the Lab

Next, see Installation and jump to Quickstart.

Timelapse of OpenAI Lab

Timelapse of OpenAI Lab, solving CartPole-v0.

openai_lab's People

Contributors

kengz avatar lgraesser avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.